Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3487 |
Symbol | |
ID | 4244487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5367357 |
End bp | 5372069 |
Gene Length | 4713 bp |
Protein Length | 1570 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638108461 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_723050 |
Protein GI | 113476989 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2831] Hemolysin activation/secretion protein [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAT GTCTTATGGC TTTTCCTTTC CTTTCCATAG CCTTTCCTTT CCTTTCCGTC TGCACTTTCC TTGTTTTTTC CCCCTTCGCC CTCGCTCAAC TCCTGCCAGA TAATACATTG GGAGAAGAAA ATTCTGTCGT CACACCCAAT AATATAAAAG GGAAAGATCG CATCATAATA GATGGCGGGG CCCAAAGAGG GGCCAACTTA TTCCACAGTT TTCAAGAATT CAACATTCAA CAGGGACGAG GGGTTTACTT TAGCAACCCT GATGGAGTTC AAAATATCCT GACGAGGGTC ACAGGAAACA AGGCTTCTCA GATATTTGGC ACCTTGGGTG TAGATGGAAA AGCTGATTTA TTTTTCATTA ACCCCAATGG CATAATTTTT GGGCCAGAGG CTAAACTAGA TGTCCAAGGT TCATTCTTTG GAGCTACTGC GGATAGTATC TTATTTAACA ATGGATTTGA ATTTGCTAGT TCAGACCCTC AAGCACCCCC ACTACTAACA GTTAATGTTC CCATTGGTTT GAGGTTGCCC GAAAACCCTG GTAGTATTTT GGTTGAGGGT CTAGGTCATA ACATAAATAT GTCAGAGTAC CTAGAAACGA TATTCGACGG AAAGGAAAAA GGCCCTCGAG ACGGAAAAGA GGACCGGAAT CTTGGGTTGG AAGTTTCCCA AGGCCAAACC TTGGGGTTAG TTGGTGGTGA TATATTCTTA AGGGGGGGAA ACCTCACATC AATTGGAGGA CGTGTTGAAT TAGGAAGTGT TCACAATGGG GTAGTAGGAA TCAGGAAAGA CCAAACCCTT AGCTATGATC CCAACCTGAA ATTTGGCCAA ATACAAATAT CTAAAGAGAG CTCTATAGAT GTAAGTGGAA ATGGAAGAGT TGAATTTCAA ATTCAAGGAG AAGAGATAAA AGTGCGAGAT GCTTCAATAA TTACAGGAGA GATTAAAGGC GAAGGCGAAG GCGAAGGCGA AAAAGGGGGA AATTCAATTA TTAAAGCGTC TAAGTCTATT AAAATAGAAG CCGGAGAGTA TTCGGAACTA ACTTTAAGGT TAACTTCCGA TAAACTTCCG AAACAAGATT TTAATTTTAA TAAGACTTAC AGTGCTATTA TTCTAAGCAC TTTCGGTAAG AGAGATGTCG GGGAGTTGAA GGTGGAAACA GAAAGTTTAA CGCTGCAAGA TGGAGCCTTT ATTTCTACGA CTACTTCTGG TCAGGGAAAT GCAGGGAATA TTACAGTCTC TGCTAGAGAC ATAGAAATGA ATGGTGGCGG TGACGGTGCT ACTTTGGTTA CAATGACATT TGCATTTGCA AAAGGTAACG CAGGGAATTT AACGCTGAAA GCTGCAGAAA GTTTAACGCT GCACGACGCT ATCATTTCTA CGGGATCTTT TGGTTCAGCT CAGGGAAATA CAGGGAAATT AACAGTCTCC GCTAATGCTA TAGAAATAAA TGCCGTAGCG AGCGAGGGGG CTGGTTTTTT TAGCGAGCGA GGCAGTGGCA AAGGTAAGGG AGGGAATTTA ACGGTGAAAG CTGCAGAAAG TTTAACGCTA GTTGGCAATG GATTTATTTC TACCAGTACT TTTGGTGATG GAGATGCAGG GAATTTAACA GTGTCTGCTA GAGATATAGA AATGAGTGGG ACAAGTCGGA TGGCGGCACA AACAGGTACA AGGCAAACAG GTGGCAAAGG TAAGGGAGGG ACTTTAATGG TGAAAGCTGC AGAAAGTTTA ACGCTGCAAG ATAAAGCCTT TATTTCTACC GATACTAAAG GTGATGGAGA TGCAGGGAAT TTAACAGTGT CTGCTAGAGA TATAGAAATG AACAGTGTCT TTGGTTTGCT TGCACAGACA GAAAACAAAG GTAATGGAGG GAATTTAACG GTGAAAGCTG CAGAAAGTTT AACGCTAGTT GGCAATGGAG TTATTTCTAC CAGTGCTTTT GGTGATGGAG ATGCAGGGAA TTTAACAGTG TCTGCTAGAG ATATAGAAAT GAGTGGGAAA AGTCGGATGG TGGCACAAAC AGGTAGAAGG CAAACAGGTG GCAAAGGTAA GGGAGGGACT TTAATGGTGA AAGCTGCAGA AAGTTTAACG CTGCAAGATG AAGCCTTTAT TTCTACCACT ACTTTTGGTG ATGGAGATGC AGGGAATTTA ACAGTGTCTG CTAGAGATAT AGAAATGAGT AGTATTTTTG GTTCTGGTGT GTTTGCACAG ACAGAAAACA AAGGTAATGG AGGGAATTTA ACGGTGGAAG TTGCAGAAAG TTTAACACTG CAAGATGGAG CCATTATTTC TACCAGAAGT CTTGGTGATG GAGATGCAGG AAATATAACT ATCTCTGCTA GAGATATAGA AATAATTGGC ACATCTCCTG ATGGTAAATA CTTCAGTAGC TTAACCGCAG AAGGCAGAGC AAAAGGAGCA GCAGGAAGCA TAACCATCAA CAGCAACAAC ATCCGCCTCG AAAACCAAGC TAACCTCAAA GCTGAAACCC AAGCAGGGGG TGAAGGCAGC ATTATCATCA ATAATAACAA AGACCTCATC CTGCGCAACA AAAGCAAAAT AACCACTAGT GCTAAAGGCA GGGCCACTGG AGGTAATATT ACTATTAAGA CGGAAAACTT AGTCGCCCCA GATAACAGCG ACATCTCTGC TGACGCTGAA AAAGCATCTG GAGGCCAAGT CGACATCACC GCAGCAGGTA TATTCGGCAT AAAATTCCGT CCTGAAAAAA CAGAAAGAAG TGATATCACA GTCACCTCAG AATTTGACAG AGATGGAGAA GTCAATATCA ACACCCCCGA TGTAGATCCT AGTGACGGAC TAGTAGAATT TGACGATACC ATCCCAGATA TCTCCAACCT CCTCGACCAA AACCCTTGCC GAAAAGGACA AAACAGCAAA TTCATCATTA CAGGAAGAGG TGGCTTACCA CCAACTCTCG AAGATCCCTT TACTCCCACC CACATCTGGC AAGGCTCAGA AACCTCAGTT ACTCCTCCAA AAGCCATCAC TCCATCAGAG CAAGAATTCA TTGAAGCACA AGGTTGGATC TCTAATTCTC AAGGTATTGA ACTGATCACC AACCCCGAAA CTGCTACTCC CACCTCCCCT TGGTTAATAC CTCCCAACTG CAAACAATTA GACGCCTTTG AACCCCCACC CTATCTACTC GCTAGCAGTA AGGGAAATAT TCCCACCTCA ACTTCTGAAC CTCTAAAAGT TACTATCAAA CAATTCAACT TTGAGGGTAA CACAAAATTT AGCAACCAAG AGCTACAACA ACAACTCACA CCTTATCTAA ATAAACCCAT TACCTTTGCT CAACTCATAG CAGCACGCAC TGCTATTACA AAATATTATA CCAAAAACAA CTACATCACA TCGGGAGCAT TTATTCCTCC CCAAACTATG TCTGACAATG GCACAGTTAC CCTCCGAGTA GTAGAAGGAA AAGTTGGTGA AATCAACGTC AATATCCAAG GTAGATTAAA TGAAAATTAT ATCAAAAGTC GGTTAGAAAA AGCTACCACA GCTCCTCTGA ATCAAGAAAA ATTGGTGTCT GCCCTACAAC TATTACAAAT AGACCCTTTA GTCAAAACTA TATCTGCGGA ACTTTCATCC GGAGTCCGCC CAGATACAAG TAGGTTAGAC ATAAGAATAG AAACAGCTAA CCCCTGGCAA ATAGAAACTA TCAGTAATAA TGGCAGGGCA CCAAGTGTGG GAACATTCAG GCGAGGGGTA GAAGTAGGTC ATAGGAATGT TACAGGTATT GGAGACAGTT TAAATGCTCT CTACACGAAT ACTGATGGCA GCGACATGGT AGAAGTATCT TATGCTATTC CTGTGAACTC CAGTAATGGT CGTATCGAAC TATTTTATCG CCACAGAGAT AATAAGGTAG TAGAATCACC TTTTGAAAGA CTCGATATTG AGTCAAATTC AAATACTTAC AAGTTATCAT TTAGTCAACC AATAATGCAA ACACCCTGGC AAACTTTAAC TTTGGGTTTA TCTGCAATAA AGCGAGATAG TCAAACTTCC ATATTAGGAG AGAATTATCC ATTATCTGAG GGTGCAGATG AAAATGGGGA AACGAAATTA TCAATTTTGC AGCTTTTCCA AGATTATGAA CTACGGGGAA AAAATCAAGT TTTGGCGTTT AATTCTCAGT TGAATGTAGG GTTGGGAATA TTGGATGCCA CAAAGAATAG TAGTGAACCG GATAGTCGAT TTTTGTATTG GCGGGGTCAG GGACAATGGG TGCGTGAGTT AGCCAAAAAT ACTTTGTTGG TGGTTGGAGC AGATCTACAG TTATCTCCAT TTGATTTGGT GCCTAAAGAG CAGTTTGGTT TAGGAGGTTA TCGAAGCGTG AGAGCCTATC GTCAGGATAC TCGTTTAACA GATAATGGAG CGTTGGGAAC AGTGGAGTTG CGCTTGCCGC TGCCCTGGAT ATCTGGAAAA AATAGATTAT TTCAGGTAGT GCCGTTTATT GATGGGGGGG TAGCATGGAA TAGTGATGAT GAAGAGGTAG AAGGAAGTAA GGCTTTGGCG GCGGCCGGGG TGGGGTTGCA GGTAAATTTA TGGGAGAAGA TAAATATGCG CTTAGATTAT GGAATTCCTT TAGTGGATGT GGACTCGCCA GATAAAACGG CTCAGGAGGA AGGGTTTTAT TTTTCTTTTT CTACTACTCC TTTTTCTTTT TGA
|
Protein sequence | MNRCLMAFPF LSIAFPFLSV CTFLVFSPFA LAQLLPDNTL GEENSVVTPN NIKGKDRIII DGGAQRGANL FHSFQEFNIQ QGRGVYFSNP DGVQNILTRV TGNKASQIFG TLGVDGKADL FFINPNGIIF GPEAKLDVQG SFFGATADSI LFNNGFEFAS SDPQAPPLLT VNVPIGLRLP ENPGSILVEG LGHNINMSEY LETIFDGKEK GPRDGKEDRN LGLEVSQGQT LGLVGGDIFL RGGNLTSIGG RVELGSVHNG VVGIRKDQTL SYDPNLKFGQ IQISKESSID VSGNGRVEFQ IQGEEIKVRD ASIITGEIKG EGEGEGEKGG NSIIKASKSI KIEAGEYSEL TLRLTSDKLP KQDFNFNKTY SAIILSTFGK RDVGELKVET ESLTLQDGAF ISTTTSGQGN AGNITVSARD IEMNGGGDGA TLVTMTFAFA KGNAGNLTLK AAESLTLHDA IISTGSFGSA QGNTGKLTVS ANAIEINAVA SEGAGFFSER GSGKGKGGNL TVKAAESLTL VGNGFISTST FGDGDAGNLT VSARDIEMSG TSRMAAQTGT RQTGGKGKGG TLMVKAAESL TLQDKAFIST DTKGDGDAGN LTVSARDIEM NSVFGLLAQT ENKGNGGNLT VKAAESLTLV GNGVISTSAF GDGDAGNLTV SARDIEMSGK SRMVAQTGRR QTGGKGKGGT LMVKAAESLT LQDEAFISTT TFGDGDAGNL TVSARDIEMS SIFGSGVFAQ TENKGNGGNL TVEVAESLTL QDGAIISTRS LGDGDAGNIT ISARDIEIIG TSPDGKYFSS LTAEGRAKGA AGSITINSNN IRLENQANLK AETQAGGEGS IIINNNKDLI LRNKSKITTS AKGRATGGNI TIKTENLVAP DNSDISADAE KASGGQVDIT AAGIFGIKFR PEKTERSDIT VTSEFDRDGE VNINTPDVDP SDGLVEFDDT IPDISNLLDQ NPCRKGQNSK FIITGRGGLP PTLEDPFTPT HIWQGSETSV TPPKAITPSE QEFIEAQGWI SNSQGIELIT NPETATPTSP WLIPPNCKQL DAFEPPPYLL ASSKGNIPTS TSEPLKVTIK QFNFEGNTKF SNQELQQQLT PYLNKPITFA QLIAARTAIT KYYTKNNYIT SGAFIPPQTM SDNGTVTLRV VEGKVGEINV NIQGRLNENY IKSRLEKATT APLNQEKLVS ALQLLQIDPL VKTISAELSS GVRPDTSRLD IRIETANPWQ IETISNNGRA PSVGTFRRGV EVGHRNVTGI GDSLNALYTN TDGSDMVEVS YAIPVNSSNG RIELFYRHRD NKVVESPFER LDIESNSNTY KLSFSQPIMQ TPWQTLTLGL SAIKRDSQTS ILGENYPLSE GADENGETKL SILQLFQDYE LRGKNQVLAF NSQLNVGLGI LDATKNSSEP DSRFLYWRGQ GQWVRELAKN TLLVVGADLQ LSPFDLVPKE QFGLGGYRSV RAYRQDTRLT DNGALGTVEL RLPLPWISGK NRLFQVVPFI DGGVAWNSDD EEVEGSKALA AAGVGLQVNL WEKINMRLDY GIPLVDVDSP DKTAQEEGFY FSFSTTPFSF
|
| |