Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3489 |
Symbol | |
ID | 4244489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5375265 |
End bp | 5379440 |
Gene Length | 4176 bp |
Protein Length | 1391 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638108463 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_723052 |
Protein GI | 113476991 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2831] Hemolysin activation/secretion protein [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTTT ATTTTGTTGT AACTCGTCAA TATTTACTTA AATCTCACAC AAATTTTACA ATAACGCTAC GCCGCAAATC ATCTACAATA AATCGCTATT TAAGAACAGA ATCTTTAAAT TCAGAAGATT TATGGCGAAA TCTTACTCTT AGTGTTTTTC TCCTCTTTGG ATTTTTTTCC CCCTCTGCCC TCGCTTTTTT TATTCATTTT CTCCTCTTTG GATTTTTTTC CCCCTCCGCC CTCGCTCAAC TCCAACCAGA TAATACATTG GGAGAAGAAA ATTCAGTCGT CACACCGAAT ATCAATATTA AAGGGATAGA AAGCGATCGC ATAGATGGCG GGGCCCAAAG AGGGGCCAAC TTATTCCACA GTTTTCAAGA ATTCAACATT CAACAGGGAC GAGGGGTTTA CTTTAGCAAC CCTGATGGAG TTCAAAATAT CCTGACTCGG GTCACAGGAA ACAATGGCTC TAACATATTA GGAACCTTGG GTGTAGATGG AAAAGCTGAT TTATTTTTCA TTAACCCCAA TGGCATAATC TTTGGACCAG AGGCTAAACT AGATGTCCAA GGTTCATTCT ATGGAGCTAC TGCCGATAGT ATCTTATTTA AAAATGGATT TGAATTTGCT AGTTCAGACC CTCAAGCGCC CCCACTACTT ACAGTTAATG TTCCCATTGG TTTGAGGTTG CCCGAAAAGC CAGGTAGTAT TCTGGTTGAG GGTCAAGGTC ATAACATAAA TATGTCAGAG TACCTAGAAA CATATGGATA TAGAAACAGA AAAGAGGACC GGAATCTAGG GTTGGAAGTT TCCCAAGGCC AAACCTTAGG GTTAGTTGGT GGTGATATAT TCTTAAGGGG GGGAAACCTC ACATCAGATG GAGGACGTGT TGAATTAGGA AGTGTTCAGA ATGGTGTAGT AGAAATCACT ACCGACCGTA CCCTTAGCTA TGATAAAAAT CTGGAATTTG GCCAAATCAT ACTATCAGAA GCGAGTTCTA TAGATATAAG TGGAAATGGA AGAGTAGAAT TTCAAATTCA AGGAGAAGAG ATCAACGTGC AAGATAATTC ACTAATTTTA GGAAAGATAT TGGGTGATGA GGACGGAGGA AATTCAATTA TTAGAGCATC TAAGTCTATT AAAATAGGAC CTACAAAACC TTTTGAGAAG ATTCCTGTGA AAATAAAAAT TGTTAATTTT ACCTATACTT TTAATAAGAC TATGAGTTCT ATTTTTCTAA ACACTTATGG TAAGGGAGAT GTCGGAGAGT TGAGGGTGAA AACAGAAAGT TTAACGCTGC AAGATGGAGC CGTTATTTAT ACCGGTACTG TGGGTCAGGG AAATGCAGGG AATTTAACAG TCTCTGCTAG AGACATAGAA ATAAATGGTG GGAATTATGC GACTGGATTT TTTCCAACGA CATTAGCAGA AGGTAACGGA GGGAATTTGA CGGTGGAAGT TGCAGAAAGT TTAACGCTGC AAGATGGAGC CGTTATGTCT GCCGGTACTG GGTATCAGGG AAATGCAGGG AATTTAACAG TCTCTGCTAG AGACATAGAA ATGAGTGGTT TTGTTGTTGT TACAACAATC ACTGAAACAA CAGGTAACGG AGGGAATTTG ACGGTGGAAG TTGCAGAAAG TTTAACGCTG CAAGATGGAG CCGTTATGTC TGCCGGTACT GTGGGTCAGG GAAATGCAGG GAATTTAATA GTCTCTGCTA CAGACATAGA AATAATAGGC AAATCTCCTG ATGGTAAATC CTTCAGTCGC TTAACCGCAG AAGCAACAAA TGAATCAACA GGAGCAGCAG GAACCATAAC CATCAACACC GAAACCCTCA ACCTCCGGAA CGAAGCCGAA ATTATAGTCA CCTCTGCTAC CCAACAAGCT GCAGGTGACT TAATAATAAA CAGCAACAAC ATCCGCATCG AAAACCAAGC TAGTCTCAAA GCTGAAACCA AAGGAGGTCA AGGCAGCATT ACCATCAATA ATAATAAAGA CCTCATCCTG CGTAACAACA GCAACATCAC CACTAATTCT GAAGGCACGG CTACTGGAGG TAATATTACC ATTAATACGG AAAACCTAGT TGCCCGAGAC AACAGCGACA TCACTGCTAA CGCTGTCGAA GGATTTGGAG GCCGAGTTAA CATCACCGCA GCCGGCATAT TCGGCACAGA ATTCCGTCCC GAACAAACCG AAAAAAGTGA CATCACCGCC ACCTCAAAAT TAGGAGCTTC ATTCAGTGGA GAAGTCAATA TCAACACCCC TGATGTAGAT CCTAGTGACG GGCTAGTAGA ATTTGAGGAT ACCATCCCAG ATATCTCCAA CCTCCTCGAC CAAAACCCTT GCCGACAAGG ACAAAAAAGC AAATTCATCA TTACAGGAAG AGGCGGCTTA CCACCAACTC TCGAAGAACC CTTGGCTCCC ACCTACCTCT GGCAAGACTC AGAAACCTCA GTTACTCCTC CACCAGCTAT CACTCCATCA GAGCAAGAGT TCATTGAAGC ACAAGGTTGG ATCTCTAATT CTCAAGGTAT TGAACTGATC ACTGACCCCG AAACTGCTAC TCCCACCTCC CCTTGGTTAA TACCTCCCAA CTGCGAACAA TTAGACACCT TTGAACCCCC ACCCTATCTA CTCGCAAGCA GTAAGGGAAA TATTCCCACC TCAACTTCTG AACCTCTAAA AGTTACTATC AAACAATTCA ACTTTGAGGG TAACACAAAA TTTAGCAACC AAGAGCTACA ACAACAACTC ACACCTTATC TAAATAAACC CATTACCTTT GCTCAACTAA TAGCAGCACG CACTGCCATT ACAAAATTTT ATACTAAAAA CAACTACATC ACATCGGGAG CATTTATTCC TCCCCAAACT ATGGCTGAAA ATGGCACAGT CACCCTCCGA GTAGTGGAAG GAAAAGTTGG TGAAATCAAT GTCAATATCC AAGGTAGATT AAATGAAAAT TATATCAAAA GTCGGTTAGA AAAAGCCACC ACGGCTCCTC TGAATCAAGA AAAATTGTTG TCTGCCCTAC AAATATTACA AATAGACCCT TTAGTCAAAA CTCTATCTGC GGAACTTTCA TCCGGAGTCC GCCCAGATAC AAGTAGGTTA GACATAAGAA TAGAAACAGC TAACCCCTGG CAAATAGAAA CTATCAGTAA TAATGGCAGG GCACCAAGTG TGGGAACATT CAGGCGAGGG GTAGAGGTAG GCCATAGAAA TGTTACAGGT ATTGGAGACA GTTTAAGTGC TCTCTATACT AATACTGATG GCAGCGATAT GGTAGAAGTA TCTTATGCTA TTCCTGTGAA CTCCAGTAAT GGTCGTATCG AACTATTTTA TCGCCACAGA GATAATAAGG TGGTAGAATC ACCTTTTGAA AGACTTGATA TTGAGTCAAA TTCAAATACT TACAAGTTAT CATTTAGTCA ACCAATAATG CAAACACCCT GGCAAACTTT AAGTTTGGGT TTATCTGCAA TAAAGCGAGA TAGTCAAACT TCTATATTAG GAGAGAATTA TCCATTATCT GAGGGTGCTG ATGAAAATGG GGAAACGAAA TTATCAATTT TGCAGCTTTT CCAAGATTAT GAACTACGGG GAAAAAATCA AGTTTTGGCG TTTAATTCTC AGTTGAATGT AGGGTTGGGA ATATTGGATG CTACAAAGAA TAGTAATGAA CCGGATGGTC GATTTTTTTA TTGGCGGGGT CAAGGACAAT GGGTGCGTGA GTTAGGCAAA AATACTTTGT TGGTGATGGG AGCAGATCTA CAGTTATCTC CATCTGATTT GGTGCCTAAG GAAAGGTTTG GTTTAGGGGG TTATCGAAGT GTGAGAGCTT ATCGTCAGGA TACTCGTTTG ACAGATAATG GAGCGTTGGG AACAGTGGAG TTGCGGTTGC CCGTGCCCTG GATATCTGGA AAAAATAGAT TATTTCAGGT AGTGCCGTTT ATTGATGGGG GGGTAGCATG GAATAGTGAT GGTAAAGAGG AGGAAGGAAG TAAGGCTTTG GCGGCGGCCG GGGTGGGGTT GCAGGTAAAT TTATGGGAGA AGATAAATAT GCGCTTAGAT TATGGAATTC CTTTAGTGGA TGTGGACTCC CTAGATAAAA CGGCTCAGGA GGAAGGGTTT TATTTTTCTT TTTCTACTAC TCCTTTTTCT TTTTGA
|
Protein sequence | MNFYFVVTRQ YLLKSHTNFT ITLRRKSSTI NRYLRTESLN SEDLWRNLTL SVFLLFGFFS PSALAFFIHF LLFGFFSPSA LAQLQPDNTL GEENSVVTPN INIKGIESDR IDGGAQRGAN LFHSFQEFNI QQGRGVYFSN PDGVQNILTR VTGNNGSNIL GTLGVDGKAD LFFINPNGII FGPEAKLDVQ GSFYGATADS ILFKNGFEFA SSDPQAPPLL TVNVPIGLRL PEKPGSILVE GQGHNINMSE YLETYGYRNR KEDRNLGLEV SQGQTLGLVG GDIFLRGGNL TSDGGRVELG SVQNGVVEIT TDRTLSYDKN LEFGQIILSE ASSIDISGNG RVEFQIQGEE INVQDNSLIL GKILGDEDGG NSIIRASKSI KIGPTKPFEK IPVKIKIVNF TYTFNKTMSS IFLNTYGKGD VGELRVKTES LTLQDGAVIY TGTVGQGNAG NLTVSARDIE INGGNYATGF FPTTLAEGNG GNLTVEVAES LTLQDGAVMS AGTGYQGNAG NLTVSARDIE MSGFVVVTTI TETTGNGGNL TVEVAESLTL QDGAVMSAGT VGQGNAGNLI VSATDIEIIG KSPDGKSFSR LTAEATNEST GAAGTITINT ETLNLRNEAE IIVTSATQQA AGDLIINSNN IRIENQASLK AETKGGQGSI TINNNKDLIL RNNSNITTNS EGTATGGNIT INTENLVARD NSDITANAVE GFGGRVNITA AGIFGTEFRP EQTEKSDITA TSKLGASFSG EVNINTPDVD PSDGLVEFED TIPDISNLLD QNPCRQGQKS KFIITGRGGL PPTLEEPLAP TYLWQDSETS VTPPPAITPS EQEFIEAQGW ISNSQGIELI TDPETATPTS PWLIPPNCEQ LDTFEPPPYL LASSKGNIPT STSEPLKVTI KQFNFEGNTK FSNQELQQQL TPYLNKPITF AQLIAARTAI TKFYTKNNYI TSGAFIPPQT MAENGTVTLR VVEGKVGEIN VNIQGRLNEN YIKSRLEKAT TAPLNQEKLL SALQILQIDP LVKTLSAELS SGVRPDTSRL DIRIETANPW QIETISNNGR APSVGTFRRG VEVGHRNVTG IGDSLSALYT NTDGSDMVEV SYAIPVNSSN GRIELFYRHR DNKVVESPFE RLDIESNSNT YKLSFSQPIM QTPWQTLSLG LSAIKRDSQT SILGENYPLS EGADENGETK LSILQLFQDY ELRGKNQVLA FNSQLNVGLG ILDATKNSNE PDGRFFYWRG QGQWVRELGK NTLLVMGADL QLSPSDLVPK ERFGLGGYRS VRAYRQDTRL TDNGALGTVE LRLPVPWISG KNRLFQVVPF IDGGVAWNSD GKEEEGSKAL AAAGVGLQVN LWEKINMRLD YGIPLVDVDS LDKTAQEEGF YFSFSTTPFS F
|
| |