Gene Tery_3487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3487 
Symbol 
ID4244487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5367357 
End bp5372069 
Gene Length4713 bp 
Protein Length1570 aa 
Translation table11 
GC content42% 
IMG OID638108461 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_723050 
Protein GI113476989 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein
[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAT GTCTTATGGC TTTTCCTTTC CTTTCCATAG CCTTTCCTTT CCTTTCCGTC 
TGCACTTTCC TTGTTTTTTC CCCCTTCGCC CTCGCTCAAC TCCTGCCAGA TAATACATTG
GGAGAAGAAA ATTCTGTCGT CACACCCAAT AATATAAAAG GGAAAGATCG CATCATAATA
GATGGCGGGG CCCAAAGAGG GGCCAACTTA TTCCACAGTT TTCAAGAATT CAACATTCAA
CAGGGACGAG GGGTTTACTT TAGCAACCCT GATGGAGTTC AAAATATCCT GACGAGGGTC
ACAGGAAACA AGGCTTCTCA GATATTTGGC ACCTTGGGTG TAGATGGAAA AGCTGATTTA
TTTTTCATTA ACCCCAATGG CATAATTTTT GGGCCAGAGG CTAAACTAGA TGTCCAAGGT
TCATTCTTTG GAGCTACTGC GGATAGTATC TTATTTAACA ATGGATTTGA ATTTGCTAGT
TCAGACCCTC AAGCACCCCC ACTACTAACA GTTAATGTTC CCATTGGTTT GAGGTTGCCC
GAAAACCCTG GTAGTATTTT GGTTGAGGGT CTAGGTCATA ACATAAATAT GTCAGAGTAC
CTAGAAACGA TATTCGACGG AAAGGAAAAA GGCCCTCGAG ACGGAAAAGA GGACCGGAAT
CTTGGGTTGG AAGTTTCCCA AGGCCAAACC TTGGGGTTAG TTGGTGGTGA TATATTCTTA
AGGGGGGGAA ACCTCACATC AATTGGAGGA CGTGTTGAAT TAGGAAGTGT TCACAATGGG
GTAGTAGGAA TCAGGAAAGA CCAAACCCTT AGCTATGATC CCAACCTGAA ATTTGGCCAA
ATACAAATAT CTAAAGAGAG CTCTATAGAT GTAAGTGGAA ATGGAAGAGT TGAATTTCAA
ATTCAAGGAG AAGAGATAAA AGTGCGAGAT GCTTCAATAA TTACAGGAGA GATTAAAGGC
GAAGGCGAAG GCGAAGGCGA AAAAGGGGGA AATTCAATTA TTAAAGCGTC TAAGTCTATT
AAAATAGAAG CCGGAGAGTA TTCGGAACTA ACTTTAAGGT TAACTTCCGA TAAACTTCCG
AAACAAGATT TTAATTTTAA TAAGACTTAC AGTGCTATTA TTCTAAGCAC TTTCGGTAAG
AGAGATGTCG GGGAGTTGAA GGTGGAAACA GAAAGTTTAA CGCTGCAAGA TGGAGCCTTT
ATTTCTACGA CTACTTCTGG TCAGGGAAAT GCAGGGAATA TTACAGTCTC TGCTAGAGAC
ATAGAAATGA ATGGTGGCGG TGACGGTGCT ACTTTGGTTA CAATGACATT TGCATTTGCA
AAAGGTAACG CAGGGAATTT AACGCTGAAA GCTGCAGAAA GTTTAACGCT GCACGACGCT
ATCATTTCTA CGGGATCTTT TGGTTCAGCT CAGGGAAATA CAGGGAAATT AACAGTCTCC
GCTAATGCTA TAGAAATAAA TGCCGTAGCG AGCGAGGGGG CTGGTTTTTT TAGCGAGCGA
GGCAGTGGCA AAGGTAAGGG AGGGAATTTA ACGGTGAAAG CTGCAGAAAG TTTAACGCTA
GTTGGCAATG GATTTATTTC TACCAGTACT TTTGGTGATG GAGATGCAGG GAATTTAACA
GTGTCTGCTA GAGATATAGA AATGAGTGGG ACAAGTCGGA TGGCGGCACA AACAGGTACA
AGGCAAACAG GTGGCAAAGG TAAGGGAGGG ACTTTAATGG TGAAAGCTGC AGAAAGTTTA
ACGCTGCAAG ATAAAGCCTT TATTTCTACC GATACTAAAG GTGATGGAGA TGCAGGGAAT
TTAACAGTGT CTGCTAGAGA TATAGAAATG AACAGTGTCT TTGGTTTGCT TGCACAGACA
GAAAACAAAG GTAATGGAGG GAATTTAACG GTGAAAGCTG CAGAAAGTTT AACGCTAGTT
GGCAATGGAG TTATTTCTAC CAGTGCTTTT GGTGATGGAG ATGCAGGGAA TTTAACAGTG
TCTGCTAGAG ATATAGAAAT GAGTGGGAAA AGTCGGATGG TGGCACAAAC AGGTAGAAGG
CAAACAGGTG GCAAAGGTAA GGGAGGGACT TTAATGGTGA AAGCTGCAGA AAGTTTAACG
CTGCAAGATG AAGCCTTTAT TTCTACCACT ACTTTTGGTG ATGGAGATGC AGGGAATTTA
ACAGTGTCTG CTAGAGATAT AGAAATGAGT AGTATTTTTG GTTCTGGTGT GTTTGCACAG
ACAGAAAACA AAGGTAATGG AGGGAATTTA ACGGTGGAAG TTGCAGAAAG TTTAACACTG
CAAGATGGAG CCATTATTTC TACCAGAAGT CTTGGTGATG GAGATGCAGG AAATATAACT
ATCTCTGCTA GAGATATAGA AATAATTGGC ACATCTCCTG ATGGTAAATA CTTCAGTAGC
TTAACCGCAG AAGGCAGAGC AAAAGGAGCA GCAGGAAGCA TAACCATCAA CAGCAACAAC
ATCCGCCTCG AAAACCAAGC TAACCTCAAA GCTGAAACCC AAGCAGGGGG TGAAGGCAGC
ATTATCATCA ATAATAACAA AGACCTCATC CTGCGCAACA AAAGCAAAAT AACCACTAGT
GCTAAAGGCA GGGCCACTGG AGGTAATATT ACTATTAAGA CGGAAAACTT AGTCGCCCCA
GATAACAGCG ACATCTCTGC TGACGCTGAA AAAGCATCTG GAGGCCAAGT CGACATCACC
GCAGCAGGTA TATTCGGCAT AAAATTCCGT CCTGAAAAAA CAGAAAGAAG TGATATCACA
GTCACCTCAG AATTTGACAG AGATGGAGAA GTCAATATCA ACACCCCCGA TGTAGATCCT
AGTGACGGAC TAGTAGAATT TGACGATACC ATCCCAGATA TCTCCAACCT CCTCGACCAA
AACCCTTGCC GAAAAGGACA AAACAGCAAA TTCATCATTA CAGGAAGAGG TGGCTTACCA
CCAACTCTCG AAGATCCCTT TACTCCCACC CACATCTGGC AAGGCTCAGA AACCTCAGTT
ACTCCTCCAA AAGCCATCAC TCCATCAGAG CAAGAATTCA TTGAAGCACA AGGTTGGATC
TCTAATTCTC AAGGTATTGA ACTGATCACC AACCCCGAAA CTGCTACTCC CACCTCCCCT
TGGTTAATAC CTCCCAACTG CAAACAATTA GACGCCTTTG AACCCCCACC CTATCTACTC
GCTAGCAGTA AGGGAAATAT TCCCACCTCA ACTTCTGAAC CTCTAAAAGT TACTATCAAA
CAATTCAACT TTGAGGGTAA CACAAAATTT AGCAACCAAG AGCTACAACA ACAACTCACA
CCTTATCTAA ATAAACCCAT TACCTTTGCT CAACTCATAG CAGCACGCAC TGCTATTACA
AAATATTATA CCAAAAACAA CTACATCACA TCGGGAGCAT TTATTCCTCC CCAAACTATG
TCTGACAATG GCACAGTTAC CCTCCGAGTA GTAGAAGGAA AAGTTGGTGA AATCAACGTC
AATATCCAAG GTAGATTAAA TGAAAATTAT ATCAAAAGTC GGTTAGAAAA AGCTACCACA
GCTCCTCTGA ATCAAGAAAA ATTGGTGTCT GCCCTACAAC TATTACAAAT AGACCCTTTA
GTCAAAACTA TATCTGCGGA ACTTTCATCC GGAGTCCGCC CAGATACAAG TAGGTTAGAC
ATAAGAATAG AAACAGCTAA CCCCTGGCAA ATAGAAACTA TCAGTAATAA TGGCAGGGCA
CCAAGTGTGG GAACATTCAG GCGAGGGGTA GAAGTAGGTC ATAGGAATGT TACAGGTATT
GGAGACAGTT TAAATGCTCT CTACACGAAT ACTGATGGCA GCGACATGGT AGAAGTATCT
TATGCTATTC CTGTGAACTC CAGTAATGGT CGTATCGAAC TATTTTATCG CCACAGAGAT
AATAAGGTAG TAGAATCACC TTTTGAAAGA CTCGATATTG AGTCAAATTC AAATACTTAC
AAGTTATCAT TTAGTCAACC AATAATGCAA ACACCCTGGC AAACTTTAAC TTTGGGTTTA
TCTGCAATAA AGCGAGATAG TCAAACTTCC ATATTAGGAG AGAATTATCC ATTATCTGAG
GGTGCAGATG AAAATGGGGA AACGAAATTA TCAATTTTGC AGCTTTTCCA AGATTATGAA
CTACGGGGAA AAAATCAAGT TTTGGCGTTT AATTCTCAGT TGAATGTAGG GTTGGGAATA
TTGGATGCCA CAAAGAATAG TAGTGAACCG GATAGTCGAT TTTTGTATTG GCGGGGTCAG
GGACAATGGG TGCGTGAGTT AGCCAAAAAT ACTTTGTTGG TGGTTGGAGC AGATCTACAG
TTATCTCCAT TTGATTTGGT GCCTAAAGAG CAGTTTGGTT TAGGAGGTTA TCGAAGCGTG
AGAGCCTATC GTCAGGATAC TCGTTTAACA GATAATGGAG CGTTGGGAAC AGTGGAGTTG
CGCTTGCCGC TGCCCTGGAT ATCTGGAAAA AATAGATTAT TTCAGGTAGT GCCGTTTATT
GATGGGGGGG TAGCATGGAA TAGTGATGAT GAAGAGGTAG AAGGAAGTAA GGCTTTGGCG
GCGGCCGGGG TGGGGTTGCA GGTAAATTTA TGGGAGAAGA TAAATATGCG CTTAGATTAT
GGAATTCCTT TAGTGGATGT GGACTCGCCA GATAAAACGG CTCAGGAGGA AGGGTTTTAT
TTTTCTTTTT CTACTACTCC TTTTTCTTTT TGA
 
Protein sequence
MNRCLMAFPF LSIAFPFLSV CTFLVFSPFA LAQLLPDNTL GEENSVVTPN NIKGKDRIII 
DGGAQRGANL FHSFQEFNIQ QGRGVYFSNP DGVQNILTRV TGNKASQIFG TLGVDGKADL
FFINPNGIIF GPEAKLDVQG SFFGATADSI LFNNGFEFAS SDPQAPPLLT VNVPIGLRLP
ENPGSILVEG LGHNINMSEY LETIFDGKEK GPRDGKEDRN LGLEVSQGQT LGLVGGDIFL
RGGNLTSIGG RVELGSVHNG VVGIRKDQTL SYDPNLKFGQ IQISKESSID VSGNGRVEFQ
IQGEEIKVRD ASIITGEIKG EGEGEGEKGG NSIIKASKSI KIEAGEYSEL TLRLTSDKLP
KQDFNFNKTY SAIILSTFGK RDVGELKVET ESLTLQDGAF ISTTTSGQGN AGNITVSARD
IEMNGGGDGA TLVTMTFAFA KGNAGNLTLK AAESLTLHDA IISTGSFGSA QGNTGKLTVS
ANAIEINAVA SEGAGFFSER GSGKGKGGNL TVKAAESLTL VGNGFISTST FGDGDAGNLT
VSARDIEMSG TSRMAAQTGT RQTGGKGKGG TLMVKAAESL TLQDKAFIST DTKGDGDAGN
LTVSARDIEM NSVFGLLAQT ENKGNGGNLT VKAAESLTLV GNGVISTSAF GDGDAGNLTV
SARDIEMSGK SRMVAQTGRR QTGGKGKGGT LMVKAAESLT LQDEAFISTT TFGDGDAGNL
TVSARDIEMS SIFGSGVFAQ TENKGNGGNL TVEVAESLTL QDGAIISTRS LGDGDAGNIT
ISARDIEIIG TSPDGKYFSS LTAEGRAKGA AGSITINSNN IRLENQANLK AETQAGGEGS
IIINNNKDLI LRNKSKITTS AKGRATGGNI TIKTENLVAP DNSDISADAE KASGGQVDIT
AAGIFGIKFR PEKTERSDIT VTSEFDRDGE VNINTPDVDP SDGLVEFDDT IPDISNLLDQ
NPCRKGQNSK FIITGRGGLP PTLEDPFTPT HIWQGSETSV TPPKAITPSE QEFIEAQGWI
SNSQGIELIT NPETATPTSP WLIPPNCKQL DAFEPPPYLL ASSKGNIPTS TSEPLKVTIK
QFNFEGNTKF SNQELQQQLT PYLNKPITFA QLIAARTAIT KYYTKNNYIT SGAFIPPQTM
SDNGTVTLRV VEGKVGEINV NIQGRLNENY IKSRLEKATT APLNQEKLVS ALQLLQIDPL
VKTISAELSS GVRPDTSRLD IRIETANPWQ IETISNNGRA PSVGTFRRGV EVGHRNVTGI
GDSLNALYTN TDGSDMVEVS YAIPVNSSNG RIELFYRHRD NKVVESPFER LDIESNSNTY
KLSFSQPIMQ TPWQTLTLGL SAIKRDSQTS ILGENYPLSE GADENGETKL SILQLFQDYE
LRGKNQVLAF NSQLNVGLGI LDATKNSSEP DSRFLYWRGQ GQWVRELAKN TLLVVGADLQ
LSPFDLVPKE QFGLGGYRSV RAYRQDTRLT DNGALGTVEL RLPLPWISGK NRLFQVVPFI
DGGVAWNSDD EEVEGSKALA AAGVGLQVNL WEKINMRLDY GIPLVDVDSP DKTAQEEGFY
FSFSTTPFSF