Gene Tery_3489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3489 
Symbol 
ID4244489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5375265 
End bp5379440 
Gene Length4176 bp 
Protein Length1391 aa 
Translation table11 
GC content41% 
IMG OID638108463 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_723052 
Protein GI113476991 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein
[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTT ATTTTGTTGT AACTCGTCAA TATTTACTTA AATCTCACAC AAATTTTACA 
ATAACGCTAC GCCGCAAATC ATCTACAATA AATCGCTATT TAAGAACAGA ATCTTTAAAT
TCAGAAGATT TATGGCGAAA TCTTACTCTT AGTGTTTTTC TCCTCTTTGG ATTTTTTTCC
CCCTCTGCCC TCGCTTTTTT TATTCATTTT CTCCTCTTTG GATTTTTTTC CCCCTCCGCC
CTCGCTCAAC TCCAACCAGA TAATACATTG GGAGAAGAAA ATTCAGTCGT CACACCGAAT
ATCAATATTA AAGGGATAGA AAGCGATCGC ATAGATGGCG GGGCCCAAAG AGGGGCCAAC
TTATTCCACA GTTTTCAAGA ATTCAACATT CAACAGGGAC GAGGGGTTTA CTTTAGCAAC
CCTGATGGAG TTCAAAATAT CCTGACTCGG GTCACAGGAA ACAATGGCTC TAACATATTA
GGAACCTTGG GTGTAGATGG AAAAGCTGAT TTATTTTTCA TTAACCCCAA TGGCATAATC
TTTGGACCAG AGGCTAAACT AGATGTCCAA GGTTCATTCT ATGGAGCTAC TGCCGATAGT
ATCTTATTTA AAAATGGATT TGAATTTGCT AGTTCAGACC CTCAAGCGCC CCCACTACTT
ACAGTTAATG TTCCCATTGG TTTGAGGTTG CCCGAAAAGC CAGGTAGTAT TCTGGTTGAG
GGTCAAGGTC ATAACATAAA TATGTCAGAG TACCTAGAAA CATATGGATA TAGAAACAGA
AAAGAGGACC GGAATCTAGG GTTGGAAGTT TCCCAAGGCC AAACCTTAGG GTTAGTTGGT
GGTGATATAT TCTTAAGGGG GGGAAACCTC ACATCAGATG GAGGACGTGT TGAATTAGGA
AGTGTTCAGA ATGGTGTAGT AGAAATCACT ACCGACCGTA CCCTTAGCTA TGATAAAAAT
CTGGAATTTG GCCAAATCAT ACTATCAGAA GCGAGTTCTA TAGATATAAG TGGAAATGGA
AGAGTAGAAT TTCAAATTCA AGGAGAAGAG ATCAACGTGC AAGATAATTC ACTAATTTTA
GGAAAGATAT TGGGTGATGA GGACGGAGGA AATTCAATTA TTAGAGCATC TAAGTCTATT
AAAATAGGAC CTACAAAACC TTTTGAGAAG ATTCCTGTGA AAATAAAAAT TGTTAATTTT
ACCTATACTT TTAATAAGAC TATGAGTTCT ATTTTTCTAA ACACTTATGG TAAGGGAGAT
GTCGGAGAGT TGAGGGTGAA AACAGAAAGT TTAACGCTGC AAGATGGAGC CGTTATTTAT
ACCGGTACTG TGGGTCAGGG AAATGCAGGG AATTTAACAG TCTCTGCTAG AGACATAGAA
ATAAATGGTG GGAATTATGC GACTGGATTT TTTCCAACGA CATTAGCAGA AGGTAACGGA
GGGAATTTGA CGGTGGAAGT TGCAGAAAGT TTAACGCTGC AAGATGGAGC CGTTATGTCT
GCCGGTACTG GGTATCAGGG AAATGCAGGG AATTTAACAG TCTCTGCTAG AGACATAGAA
ATGAGTGGTT TTGTTGTTGT TACAACAATC ACTGAAACAA CAGGTAACGG AGGGAATTTG
ACGGTGGAAG TTGCAGAAAG TTTAACGCTG CAAGATGGAG CCGTTATGTC TGCCGGTACT
GTGGGTCAGG GAAATGCAGG GAATTTAATA GTCTCTGCTA CAGACATAGA AATAATAGGC
AAATCTCCTG ATGGTAAATC CTTCAGTCGC TTAACCGCAG AAGCAACAAA TGAATCAACA
GGAGCAGCAG GAACCATAAC CATCAACACC GAAACCCTCA ACCTCCGGAA CGAAGCCGAA
ATTATAGTCA CCTCTGCTAC CCAACAAGCT GCAGGTGACT TAATAATAAA CAGCAACAAC
ATCCGCATCG AAAACCAAGC TAGTCTCAAA GCTGAAACCA AAGGAGGTCA AGGCAGCATT
ACCATCAATA ATAATAAAGA CCTCATCCTG CGTAACAACA GCAACATCAC CACTAATTCT
GAAGGCACGG CTACTGGAGG TAATATTACC ATTAATACGG AAAACCTAGT TGCCCGAGAC
AACAGCGACA TCACTGCTAA CGCTGTCGAA GGATTTGGAG GCCGAGTTAA CATCACCGCA
GCCGGCATAT TCGGCACAGA ATTCCGTCCC GAACAAACCG AAAAAAGTGA CATCACCGCC
ACCTCAAAAT TAGGAGCTTC ATTCAGTGGA GAAGTCAATA TCAACACCCC TGATGTAGAT
CCTAGTGACG GGCTAGTAGA ATTTGAGGAT ACCATCCCAG ATATCTCCAA CCTCCTCGAC
CAAAACCCTT GCCGACAAGG ACAAAAAAGC AAATTCATCA TTACAGGAAG AGGCGGCTTA
CCACCAACTC TCGAAGAACC CTTGGCTCCC ACCTACCTCT GGCAAGACTC AGAAACCTCA
GTTACTCCTC CACCAGCTAT CACTCCATCA GAGCAAGAGT TCATTGAAGC ACAAGGTTGG
ATCTCTAATT CTCAAGGTAT TGAACTGATC ACTGACCCCG AAACTGCTAC TCCCACCTCC
CCTTGGTTAA TACCTCCCAA CTGCGAACAA TTAGACACCT TTGAACCCCC ACCCTATCTA
CTCGCAAGCA GTAAGGGAAA TATTCCCACC TCAACTTCTG AACCTCTAAA AGTTACTATC
AAACAATTCA ACTTTGAGGG TAACACAAAA TTTAGCAACC AAGAGCTACA ACAACAACTC
ACACCTTATC TAAATAAACC CATTACCTTT GCTCAACTAA TAGCAGCACG CACTGCCATT
ACAAAATTTT ATACTAAAAA CAACTACATC ACATCGGGAG CATTTATTCC TCCCCAAACT
ATGGCTGAAA ATGGCACAGT CACCCTCCGA GTAGTGGAAG GAAAAGTTGG TGAAATCAAT
GTCAATATCC AAGGTAGATT AAATGAAAAT TATATCAAAA GTCGGTTAGA AAAAGCCACC
ACGGCTCCTC TGAATCAAGA AAAATTGTTG TCTGCCCTAC AAATATTACA AATAGACCCT
TTAGTCAAAA CTCTATCTGC GGAACTTTCA TCCGGAGTCC GCCCAGATAC AAGTAGGTTA
GACATAAGAA TAGAAACAGC TAACCCCTGG CAAATAGAAA CTATCAGTAA TAATGGCAGG
GCACCAAGTG TGGGAACATT CAGGCGAGGG GTAGAGGTAG GCCATAGAAA TGTTACAGGT
ATTGGAGACA GTTTAAGTGC TCTCTATACT AATACTGATG GCAGCGATAT GGTAGAAGTA
TCTTATGCTA TTCCTGTGAA CTCCAGTAAT GGTCGTATCG AACTATTTTA TCGCCACAGA
GATAATAAGG TGGTAGAATC ACCTTTTGAA AGACTTGATA TTGAGTCAAA TTCAAATACT
TACAAGTTAT CATTTAGTCA ACCAATAATG CAAACACCCT GGCAAACTTT AAGTTTGGGT
TTATCTGCAA TAAAGCGAGA TAGTCAAACT TCTATATTAG GAGAGAATTA TCCATTATCT
GAGGGTGCTG ATGAAAATGG GGAAACGAAA TTATCAATTT TGCAGCTTTT CCAAGATTAT
GAACTACGGG GAAAAAATCA AGTTTTGGCG TTTAATTCTC AGTTGAATGT AGGGTTGGGA
ATATTGGATG CTACAAAGAA TAGTAATGAA CCGGATGGTC GATTTTTTTA TTGGCGGGGT
CAAGGACAAT GGGTGCGTGA GTTAGGCAAA AATACTTTGT TGGTGATGGG AGCAGATCTA
CAGTTATCTC CATCTGATTT GGTGCCTAAG GAAAGGTTTG GTTTAGGGGG TTATCGAAGT
GTGAGAGCTT ATCGTCAGGA TACTCGTTTG ACAGATAATG GAGCGTTGGG AACAGTGGAG
TTGCGGTTGC CCGTGCCCTG GATATCTGGA AAAAATAGAT TATTTCAGGT AGTGCCGTTT
ATTGATGGGG GGGTAGCATG GAATAGTGAT GGTAAAGAGG AGGAAGGAAG TAAGGCTTTG
GCGGCGGCCG GGGTGGGGTT GCAGGTAAAT TTATGGGAGA AGATAAATAT GCGCTTAGAT
TATGGAATTC CTTTAGTGGA TGTGGACTCC CTAGATAAAA CGGCTCAGGA GGAAGGGTTT
TATTTTTCTT TTTCTACTAC TCCTTTTTCT TTTTGA
 
Protein sequence
MNFYFVVTRQ YLLKSHTNFT ITLRRKSSTI NRYLRTESLN SEDLWRNLTL SVFLLFGFFS 
PSALAFFIHF LLFGFFSPSA LAQLQPDNTL GEENSVVTPN INIKGIESDR IDGGAQRGAN
LFHSFQEFNI QQGRGVYFSN PDGVQNILTR VTGNNGSNIL GTLGVDGKAD LFFINPNGII
FGPEAKLDVQ GSFYGATADS ILFKNGFEFA SSDPQAPPLL TVNVPIGLRL PEKPGSILVE
GQGHNINMSE YLETYGYRNR KEDRNLGLEV SQGQTLGLVG GDIFLRGGNL TSDGGRVELG
SVQNGVVEIT TDRTLSYDKN LEFGQIILSE ASSIDISGNG RVEFQIQGEE INVQDNSLIL
GKILGDEDGG NSIIRASKSI KIGPTKPFEK IPVKIKIVNF TYTFNKTMSS IFLNTYGKGD
VGELRVKTES LTLQDGAVIY TGTVGQGNAG NLTVSARDIE INGGNYATGF FPTTLAEGNG
GNLTVEVAES LTLQDGAVMS AGTGYQGNAG NLTVSARDIE MSGFVVVTTI TETTGNGGNL
TVEVAESLTL QDGAVMSAGT VGQGNAGNLI VSATDIEIIG KSPDGKSFSR LTAEATNEST
GAAGTITINT ETLNLRNEAE IIVTSATQQA AGDLIINSNN IRIENQASLK AETKGGQGSI
TINNNKDLIL RNNSNITTNS EGTATGGNIT INTENLVARD NSDITANAVE GFGGRVNITA
AGIFGTEFRP EQTEKSDITA TSKLGASFSG EVNINTPDVD PSDGLVEFED TIPDISNLLD
QNPCRQGQKS KFIITGRGGL PPTLEEPLAP TYLWQDSETS VTPPPAITPS EQEFIEAQGW
ISNSQGIELI TDPETATPTS PWLIPPNCEQ LDTFEPPPYL LASSKGNIPT STSEPLKVTI
KQFNFEGNTK FSNQELQQQL TPYLNKPITF AQLIAARTAI TKFYTKNNYI TSGAFIPPQT
MAENGTVTLR VVEGKVGEIN VNIQGRLNEN YIKSRLEKAT TAPLNQEKLL SALQILQIDP
LVKTLSAELS SGVRPDTSRL DIRIETANPW QIETISNNGR APSVGTFRRG VEVGHRNVTG
IGDSLSALYT NTDGSDMVEV SYAIPVNSSN GRIELFYRHR DNKVVESPFE RLDIESNSNT
YKLSFSQPIM QTPWQTLSLG LSAIKRDSQT SILGENYPLS EGADENGETK LSILQLFQDY
ELRGKNQVLA FNSQLNVGLG ILDATKNSNE PDGRFFYWRG QGQWVRELGK NTLLVMGADL
QLSPSDLVPK ERFGLGGYRS VRAYRQDTRL TDNGALGTVE LRLPVPWISG KNRLFQVVPF
IDGGVAWNSD GKEEEGSKAL AAAGVGLQVN LWEKINMRLD YGIPLVDVDS LDKTAQEEGF
YFSFSTTPFS F