Gene PHATRDRAFT_43585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43585 
Symbol 
ID7197315 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp882047 
End bp885579 
Gene Length3533 bp 
Protein Length1105 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177709 
Protein GI219111915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTCGG ATAGAAAAGA ATACACAAAG AACGACGAGG AGCAAGGGGA CCAGGCGCAA 
GCTTTGCCTC CTTCCGAATA CTCGACGACC GAACGGAATC CACCGACACC AAATTGCGCC
CTGCGTCAGG ACTTGTGCTC CGAAAACAGC TCATTAGTGT CCGACCTTAT TCAATCCGCT
GCTCGGTCCG TCTTGGAAGA CGAGCACTAT GAGCAGGAAA GGAACGGTTC GTCGTGCTCG
GGCTCTTACA CCGACATCGA CGGCGGTCGA AGAGGATACC ATACCTCTGG AATGACACCG
AATTCACGGA AAAAGGGTCC ACTATCTCCA ACCTTGCAAT CCTTACAAAC TAAAACGCTA
CCCGCCATGC CCGATGAACA AGATCGCAAG CGTTTCGTGG TGCGTACTTT TCCACATTCC
ATTGTGCTCC CCTACAGGTG CAGGTTCTGA TCGTTTGGGG AGATTTTGGC TTACAAAAAC
TCGGATCAAG TGTATTTCTC ATATACTTTT TGAGTTTGTC TCCTTTTTTG TTTCTAAAAT
AGGGTTGTTT AGCAGCAGTT TTGGCGTCTC TTTACGATTT CGATGTGGTT GATGACGACG
AAGACTTGTC TCAAGCCGAT AAAGTCCTGA GCATGGCATA CCTTGATAGA AGTGAGCAAG
ACGACGAGGA CGAGCATAGT CTTAGTCCGA CCAAAGCAAG TCGTAACAAT ACAAAAGACA
GCAGCAGCTT ATATTCGCGC TCAGTCGACG GTTTTGACAC ACCAGCCAGA GCGCAACAAT
ACCGATCGAT GATTTCGTCT CGATCATCCA TGCAAATCGA TCGGGGCACC CGGGATCAGC
TTCAAAAGGC TCGATCACGC CATCGCAAAC GGCGGTACGA TATATTGTCG GATCTTCTTT
TGGCGTCAGG AGACTATTTG CAACTCGAAA GTGGCCAGGT CAAGGCTTTT TTACCTATGC
TAGCCAAACT CCTGGTGCCG AACAGTGATA AAAAAGAGGC ATCGCATGCA TCTCAACTTT
CAGCTCAACA TGGTCAGCGG CCGCACTCCA ATAGTAGCAA CACGAGCAGC AATTTGGCTG
CTATGGACAA TCAGGAAAAG ACTATATTGC AACGCTCAGG GATCTCTGGC GCCAATGTGA
ATGGCTTTAC AAATTCCGAA GAAATGGTGC ATCTTGAATT GGATGACATC GAGTATTTGC
GGCCGTTCTT GGAATCACTG ACTCCTGGTG CCGGGTTGCG ATGTGTTGCG TTACTGCTCC
TTCAGTATTT ATTGCTGCAC AGCCGTCAAA CGGGCTATGA CGCTCGTGTA CGACATGCCA
TAAAAACGCT TGGTGTCCTG GTTCTGGTTC ATGACATGCA ACATGACCCC GTTGATGTGT
ACATTGATGA TGAATTGAAA AAGCCTTCTT CTTCACCAAG GACTCGACGA CATCGAGGTC
ATTTGAAGAC GTCGCATCCC GATTTGGTCG TACTGGCCAC CCGCAAGTTT GAATCGCTTG
AACACTTTAT TGCAGCAAAA CTAATCGTGT TGTCACGTGA ACAGCAGGCA CATAAAGTTC
ATAGGGGCGC CCGGAGTGCT GGTGCTCGCT CGTCTCAGAC TCAACAGACA CCAGCATCAA
AAGGCCTGAC CCGGGAACAG TGGATGCGAG GGATTAAGAT TGGTGGCACG GCGATGGCAG
CCGGTACCTT ATTTGCAATA ACGGGAGGAC TCGCCGCTCC AGGTATTGCC GCAGGGGTTG
CGGCGATTGC GGGAGGGACG GCAGTGACAG CGGCCGCCGC GGCTGTCTTA ACAAGTACGG
CAGCTGTGAC GACAATCTTT GGAGTGGGGG GAGGAGGATT GGCAGCGTAC AAAATGCAGC
GGCGGACACA AGGTTTAACC GAGTTCGAAT TTCGTAAAGA AACTGGAAAG GCAAGTCGGG
AGAAAGAGGG TCAAATAGAC ACAGTAGACG CTGAGCTGTT TAGTACAATC TGCATATCAG
GTTGGCTCCG GGACAAATTC GATTTTCAAC GACCTTGGGG GGTCTCCCCA TCACGACCTG
AGTTGACTGA TCGACAAGAG CTGTTGGAAA GATTCTACAC GATCCATAGT CCATCGCATA
TATCACGTTG TGCCAAAATT TTGGACCATT GGAAAGGTGA AGAAAAGGAT CTTTGGGGTT
TGCTCAGGCA AAAGTACGGG CAAGATCCAG ACCATTTATT TCCTTTGGAG AAAGGTCCTC
GATTACACGC CTCGTTGACT CTTGAGCAGA AGGAGGTCAT AGATCAGTTG TTTGTAGAGC
TGGGATACAC GCCCAAATCT CTGGACGAAA TAAAAACGCA GCCTACGCCT TTCGAAAGAA
TTAGGAAGGG CTGGAATAAA CAAGCCGCTG GACCTCGACG CGATGAAAAT TTATCTACTT
CACACATTCC TGTCGGTCCT GCACATCGAT CTCTTGCAGA TTCCTTACAA AGTCCTGAGA
GTGTCGAGAC ATACGTTGGG TCTAGAGCTG AGGTTACATC GTCGGGATTT GAGAGCTTTT
CTACTGCGCT GTCAATGCTT CCACCGGACA AGCGATCAGA TGAGTCGACA GAGAAAGTTG
AGTTGCCAAG GCACATTGCT ACTGTTTGGG ACTATCCATC TATATATGGA GGGGAGCAGT
ATACGGTACA ATGGGAAAGT GAACTGCTGA CTGAGTTGTG CGACTCTGTC AATGACCTTG
CGCGAGATTT GGTAAGCGGT GGAACCGCTC AGATCTTAAA GCATACTGCT TTGTCAACGC
TAATATCGGC CTTTGCTTGG CCGTACGCGC TTGTAAACGC CGCAAACATG ATTGATGGGA
CGTGGACGCT AGCAGTTGAA CGATCCGATG AAGCGGGGAG AGAGTTGGCC AGAAGCTTGC
TCCTCAGCCG GGCAGGCCAT CGTCCTGTTA CTCTCGTAGG ATTCTCCTTT GGCGCACGAG
CAATCTATTC TTGCTTGAAA GAGCTCGCTC GCCTTCAGGA AAAATGGGAA GATTTTTGTG
AAGACGAGGA TTCCTCTCGG AGCGGAAAAG TGTTGCAAAA CCAATCAGTC GCCGATTTAG
AGTTAGACGA ATCAAACAAG GACTATTTCA GGTACATGCG AGAGCCGGCA AGCATAGTTG
AAGATGTGGT ACTAATGGGA CTTCCAAACC ATCTTAGCTT ATCTTCTTGG AAGGCATGTC
GCCAAGTTGT GGCCGGGAGG CTTATCAACT GCTTTTCTCA GAAGGATTTG ATCCTTTCAC
TGATGTTTCA ATTCAAAAGG CTCGGGCTTA AGCCGGTATG TGGAACTTGT CCAGTTAACG
TACCTGGGGT GGAGAATATT GATGTATCCG ATTTGGTATC CGGTCACCAG GATTACACTC
TCGTTAACGG AGATATTTTG AAACGCGTGA GGCATTGTCA ACCTTTTCGA TCCAGGCACA
CTCGTATATT TGTGCCGGAA GTCGCTGCAT CAAGCATGTA AATAAAAACT CTTGATGGAG
TCGAGGCTCA GCGAAAAGTG CAAGTTTTAT TTCAGAGTAT TAAGCGAATC AAT
 
Protein sequence
MESDRKEYTK NDEEQGDQAQ ALPPSEYSTT ERNPPTPNCA LRQDLCSENS SLVSDLIQSA 
ARSVLEDEHY EQERNGSSCS GSYTDIDGGR RGYHTSGMTP NSRKKGPLSP TLQSLQTKTL
PAMPDEQDRK RFVGCLAAVL ASLYDFDVVD DDEDLSQADK VLSMAYLDRS EQDDEDEHSL
SPTKASRNNT KDSSSLYSRS VDGFDTPARA QQYRSMISSR SSMQIDRGTR DQLQKARSRH
RKRRYDILSD LLLASGDYLQ LESGQVKAFL PMLAKLLVPN SDKKEASHAS QLSAQHGQRP
HSNSSNTSSN LAAMDNQEKT ILQRSGISGA NVNGFTNSEE MVHLELDDIE YLRPFLESLT
PGAGLRCVAL LLLQYLLLHS RQTGYDARVR HAIKTLGVLV LVHDMQHDPV DVYIDDELKK
PSSSPRTRRH RGHLKTSHPD LVVLATRKFE SLEHFIAAKL IVLSREQQAH KVHRGARSAG
ARSSQTQQTP ASKGLTREQW MRGIKIGGTA MAAGTLFAIT GGLAAPGIAA GVAAIAGGTA
VTAAAAAVLT STAAVTTIFG VGGGGLAAYK MQRRTQGLTE FEFRKETGKA SREKEGQIDT
VDAELFSTIC ISGWLRDKFD FQRPWGVSPS RPELTDRQEL LERFYTIHSP SHISRCAKIL
DHWKGEEKDL WGLLRQKYGQ DPDHLFPLEK GPRLHASLTL EQKEVIDQLF VELGYTPKSL
DEIKTQPTPF ERIRKGWNKQ AAGPRRDENL STSHIPVGPA HRSLADSLQS PESVETYVGS
RAEVTSSGFE SFSTALSMLP PDKRSDESTE KVELPRHIAT VWDYPSIYGG EQYTVQWESE
LLTELCDSVN DLARDLVSGG TAQILKHTAL STLISAFAWP YALVNAANMI DGTWTLAVER
SDEAGRELAR SLLLSRAGHR PVTLVGFSFG ARAIYSCLKE LARLQEKWED FCEDEDSSRS
GKVLQNQSVA DLELDESNKD YFRYMREPAS IVEDVVLMGL PNHLSLSSWK ACRQVVAGRL
INCFSQKDLI LSLMFQFKRL GLKPVCGTCP VNVPGVENID VSDLVSGHQD YTLVNGDILK
RVRHCQPFRS RHTRIFVPEV AASSM