Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2593 |
Symbol | |
ID | 4244661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4004575 |
End bp | 4008168 |
Gene Length | 3594 bp |
Protein Length | 1197 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638107664 |
Product | periplasmic protein TonB links inner and outer membranes-like |
Protein accession | YP_722263 |
Protein GI | 113476202 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | [TIGR01376] Chlamydial polymorphic outer membrane protein repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0238559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0624657 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACAA TTGTTGTTAA TACTCTTGAT GATGAAAATG ATGGAAGTCT TGATCAAGGA ACAGGCAACT CACTCCGAGA AGCTATTAAT TTGTCTAATG CAGGAGATAC AATCATTTTT GCTGACACTA TTGCCGGAAA AACCATTAAT TTGAGCAATA GTGAACTTAT CATTAATAAA AACCTCACTA TTGATGGGGA TGAAAATAAC CGAGTCACAA TAAATGCACA AGGTAACTCT CGTGTATTTA ATATAAATGA TAACAATAGC ACACAGCAGC AAGTTACCAT TGATGGAGTT GTAATCACAG GTGGAAAAGC TTCTGGTAGT GGGCCAAATG AAAGTGATGG GGGTGGTATA TTTACTAATG AAGCACTCAC GCTCAGTAAC AGTGTAGTTA CAGGTAACAC AGCATCAGGT GGAAATGCTG ATGGTGGTGG TATATATATA AATTCCATAG CTAGCGCTGA TATTATAAAC ACTACTATTA GCAATAATAC AGCAACTGAT GACGGTGGTG GTGTCGTTGG ATTTGGCAAT ACAAATATTA CTAACTCTAC TATCAATAAC AACGCAACAC TTACTTCTAC TGGCGATGGT GGTGGTGTCT ATATAGTTCG CACTACAAAT ATTACTAATT CCACTATCAG TGGTAACACA TCATCAGGTG AAGGTGGTGG TGTCTATGTT AAAGATCAGC GACTCTCCAA CGCGACAATA ACCAACAGTA CAATTACTAA TAACCAAGCA CCTGAAGGTA AAGGTAGTGG TTTAGCTACT TTAGGAGTTC AAACAACAAC TGTTACCTCT AGCATTATTG CTGGTAATGT TAACAGCGAC GTTAACGAAT TAACTAGTGG AAGTAACGCC ATAGAAAGTG GTGGCAACAA CTTAATTGGT ACAGGAAACA CTGCTGCTAA ATTTAATGCT ACAAGCGATA TAACAGGAGT CACTAATCCT GGTCTAGATC CTCTTGCTGA TAATGGGGGA CCTACTCAAA CCCATGCTTT ACAAAATGTT AGTCGTGCTA TTAATGCTGG CAGCAATCTT TCCAATCTGG ATACTGACCA ACGTGGTGCA GGTTTCTCCC GCGTTTTTAA TGGTGTAGCC GATATTGGTG CTTTTGAGTT TGGCAACCCC ATACCTACAC CTGAACCTAC ACCTGCACCT ACACCTGAGC CAACACCTGC ACCTGAGCCA ACACCTGAGC CAACACCTGA ACCGACACCT GAACCGACAC CTGAACCGAC ACCTGAACCG ACACCTGAAC CGACACCTGA ACCGACACCT GAACCGACAC CTGAACCTAC ACCTGAACCT ACACCTGAAC CGACACCTGA ACCGACACCT GAACCGACAC CTGAACCTAA TTCTGGTAAT GACACTCTTG ACGGTAATGG TAGGAATAAC TCTATCGATG GCGGGGACGG AGATGATTTC ATAAATGGTA AAGGAGGCAA CGACGAACTA TTTGGTGAAA ATGGTGCAGA TACCCTCTTG GGTGACCTTG GCAACGACGA ACTATTTGGT GGTAACGGTG GAGACAGCCT TGTCGGTGGT CCTGGTCGAG ACCTTCTTGA TGGTGGGACT GGTAACGACA GCTTAAATGG CTCATCTGGT CCAGATACTC TTGATGGTGG TGCCGGTGAA GATAACCTCG TCGGTGGTCC TGATGGAGAC AGACTATTCG GTGGTCCTCG TAATGACACC CTAGATGGTG GGTTAGGTAG AGATAGTCTT ATTGGTGGTA ACGGCAACGA CAGTCTGATG GGTGGTAATA GTATAGATAC AATTGATGGT GGTATTGGTA ATGACACTAT TGACGGTGGC CGGGGTCCTG ATGTTATAAG TGGGGAAGAC AACGCTGACA CTTTTGTATT GCGTTCTACC TATGGCAACG ATCAGATTCT AGATTATGTA GATGGTACAG ATAATTTCTT TTTAGATGGT CTGGCTTTCA AAGACTTGAT AATAGAACGT GACCCAGATA ACCGTAAGAA TACTATTATA CAGACTAAGG TAACTGGCTC GACAGAAATT TTAGCAACTT TGGTTGACTT TACAAGAACT GATAAGCTAA CTGAGGCTGA TTTTGAGTCT TCAACACCAA CACCAACACC AACACCAACA CCAACACCAA CTTCAACACC AACACCAACA CCAACACCAA CACCAACAAC TATTTCTCCT ACTATTAATA ATGACACTCT TGATGGTGAT GATCAGAATA ACGAGATCAA AGGTTTAGGA GGAAATGATT TAATAAGTGG TAAAGGAGGC AATGATCAAC TATTTGGTGA TGACGGTGCA GATACCCTTT TAGGTGATGA TGGTAAAGAT ACTCTTAATG GGGGAGTAGG TAACGATAAG CTTGATGGTG GTAAAGATGA TGATCGTTTA ATAGGTGGTG CAGGAAATGG TGCAGATATC CTCTTAGGTG GTGATGGTAA AGATACTCTT ATTGGTGGAG CGGGTAACGA TAAGCTTGAT GGTGGTAAAG ATGATGATCA TTTAATAGGT GGTACAGGTA ATGATGTCTA TACAATTAAC TCCCGCAAAG ATATTATTAT CGAAAGAGCA AGACAAGGTA ACGACCATGT CATGTCTTCA GTCACCTATA ATCTCAGCAA CAACTTAGAA AGACTGACTC TGCTCGGAAA AAACAATCTT ACCGGAAGAG GTAATGATCG CGCTAACATT ATTCGAGGAA ACTCTGGAAA TAATAACCTC GAAGGTAAAG GAGGCAACGA CCAACTATTT GGTAATAACG GTGCAGATAA CCTTTTGGGT GGTGATGGTA AAGATACTCT TATTGGTGGA GCAGGTAACG ATAAACTTAA TGGTGGTAAA GATGATGATC GTTTAATAGG TGGTACAGGT GATGATGTCT ATACAATTGA TTCCACGAAA GATATTATTA TCGAAAGAGC AAACCAAGGT AACGATCATG TAAGATCTTT AGTTACCTAT AATCTCAACA ACAACTTAGA AAAACTGACT CTGCTCGGAA AAAACAATAT TATAGGAAGA GGTAACGATC GCGCAAACGT TATTCGAGGA AACTCTGGAA ATAATAAACT CGAAGGTAAA GGAGGCAACG ACCAACTATT TGGTAATAAC GGTGCAGATA ACCTCTTGGG TGGTAATGGT AACGATGCTC TTAATGGTGG TGGAGGTTTA GACATTCTCA ATGGTGGCTC TGGTCAAGAC ACCCTCATCG GTAGCTCTGG TCGAGATACC CTCATCGGTG GCTTGGATGC AGATAGCCTA CTCGGTGGTA GCGATGTGGA CACCCTCAAT GGTGGTTCTG GTCGAGACAC TCTTGACGGT GGCCGAAACC CTGATCGTCT AACTGGTGGA AGCGACGCTG ACAGTTTCGT ATTACGTTCT GGCGATGGCA ATGATAGGAT TCTAGATTAT GTAGATGGTA CAGATAACTT CTTGTTAGAT AGTCTGACTT TTGAAGACTT GACAATAGAA CTTGACTCAA CTGGTAATAA TACTGTCATA AAGACCTCGA CAGAAACTTT GGCAACTTTG GTTGGCGTTA CTGATATTAC TGATCTAGAT AATGGTGATT TTGTCATCGT CTAA
|
Protein sequence | MATIVVNTLD DENDGSLDQG TGNSLREAIN LSNAGDTIIF ADTIAGKTIN LSNSELIINK NLTIDGDENN RVTINAQGNS RVFNINDNNS TQQQVTIDGV VITGGKASGS GPNESDGGGI FTNEALTLSN SVVTGNTASG GNADGGGIYI NSIASADIIN TTISNNTATD DGGGVVGFGN TNITNSTINN NATLTSTGDG GGVYIVRTTN ITNSTISGNT SSGEGGGVYV KDQRLSNATI TNSTITNNQA PEGKGSGLAT LGVQTTTVTS SIIAGNVNSD VNELTSGSNA IESGGNNLIG TGNTAAKFNA TSDITGVTNP GLDPLADNGG PTQTHALQNV SRAINAGSNL SNLDTDQRGA GFSRVFNGVA DIGAFEFGNP IPTPEPTPAP TPEPTPAPEP TPEPTPEPTP EPTPEPTPEP TPEPTPEPTP EPTPEPTPEP TPEPTPEPTP EPTPEPNSGN DTLDGNGRNN SIDGGDGDDF INGKGGNDEL FGENGADTLL GDLGNDELFG GNGGDSLVGG PGRDLLDGGT GNDSLNGSSG PDTLDGGAGE DNLVGGPDGD RLFGGPRNDT LDGGLGRDSL IGGNGNDSLM GGNSIDTIDG GIGNDTIDGG RGPDVISGED NADTFVLRST YGNDQILDYV DGTDNFFLDG LAFKDLIIER DPDNRKNTII QTKVTGSTEI LATLVDFTRT DKLTEADFES STPTPTPTPT PTPTSTPTPT PTPTPTTISP TINNDTLDGD DQNNEIKGLG GNDLISGKGG NDQLFGDDGA DTLLGDDGKD TLNGGVGNDK LDGGKDDDRL IGGAGNGADI LLGGDGKDTL IGGAGNDKLD GGKDDDHLIG GTGNDVYTIN SRKDIIIERA RQGNDHVMSS VTYNLSNNLE RLTLLGKNNL TGRGNDRANI IRGNSGNNNL EGKGGNDQLF GNNGADNLLG GDGKDTLIGG AGNDKLNGGK DDDRLIGGTG DDVYTIDSTK DIIIERANQG NDHVRSLVTY NLNNNLEKLT LLGKNNIIGR GNDRANVIRG NSGNNKLEGK GGNDQLFGNN GADNLLGGNG NDALNGGGGL DILNGGSGQD TLIGSSGRDT LIGGLDADSL LGGSDVDTLN GGSGRDTLDG GRNPDRLTGG SDADSFVLRS GDGNDRILDY VDGTDNFLLD SLTFEDLTIE LDSTGNNTVI KTSTETLATL VGVTDITDLD NGDFVIV
|
| |