Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_2201 |
Symbol | |
ID | 5876179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 2201890 |
End bp | 2203152 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641542556 |
Product | extracellular solute-binding protein |
Protein accession | YP_001663809 |
Protein GI | 167040824 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.455559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA AACTTATAAG CATCTTTGTA TTGACGATCT TTGTATTAGC TACTGTTTTA GCTGGTTGTT CATCCAGTAA AAATAATACT TCCAGTGCCA ATGAGACAAA TACACAAAAA CAAGAGACAG CAAAACCAGT TACTATAAAA TTAGGCATGT GGTCTTCATC TCCAGCAGAA AAGAAGATAG TGGATGACCA AATAGCTAAG TTTAAAGAAA AATATCCAAA TATAGATGTG CAAATTGAGA CAATTGTGGG AGATTACATG CAAAAATTAC AAACAGAACT GGCGTCAAAT ACAGCACCAG ACATATTCTA TCTTGACAGC ATGCCGGCAC CACAGCTTAT GTCTTCAGGA GTTTTAGAGC CATTAGATGA TTATATTAAG AAATACAATG TGGATGTAAA TGATTTCGAG CCAGCATTGC TTTCCGCTTT TCAGTGGGAT GGAAAAACTT ATGGTTTACC AAAGGATTTC AATACTCTAG CTTTGTTTTA CAACAAAGAC ATGTTTAAAG CGGCTGGAAT AAATGAGCCT CCAAAAACAT GGGAGGAATT AAGAGATGTA GCTAAAAAAT TGACAAAAGA CGGTGTCAAA GGTTTGGTTT TATCAGCAGA CCTTGCAAGA TTTGATGCTT TTATAAATCA AAATGGTGGT TCAGTATATC AAGATGGAAA AGTTACTTTA AATCTGCCAG AGAATGCACA AGCTCTTGAT TTTTATGTGA GCCTCATCAC AAAAGACAAA GTTGCTGACA CACCACAAAA CATGGGAGAA GGCTGGAATG GAGATGCTTT TGCTGCTAAA AAAGCTGCAA TGGCAATAGA AGGTGGCTGG ATGATACCAT TCCTCAAAGA AAAAGCTCCT GATTTAAACT ATGGTATAGC AGAGCTTCCA GCAGGAAAGC AAAAATCTAC AATGGCTTTC ACTGTTGCAT ATGTGATGAA TAAAAACAGC AAACATAAAG ATGAAGCCTT TAAACTTATT GAATTTTTAA CCGGTAAAGA AGGACAGCAA TTTGTAGTAG ATTCAGGCCT TGCACTTCCA TCGAGAAAGT CTATGCAAGA AGGATTTAAG GAGAAATATC CTGAAAGAGC TGCCTTTGTA GATGGTGCTT CTTATGCGGT ACCATGGCAA TTCGGTTTGT ATGGCACAAA GGTAGTAGAT GCGGCTAATA AAGCCTGTGA AGCATTAATA ATGAAGCAAA TAAGTAGTGC TCAGCAAGCT CTTGACAACG CACAAAAGGA AGTTGGACAA TAA
|
Protein sequence | MSKKLISIFV LTIFVLATVL AGCSSSKNNT SSANETNTQK QETAKPVTIK LGMWSSSPAE KKIVDDQIAK FKEKYPNIDV QIETIVGDYM QKLQTELASN TAPDIFYLDS MPAPQLMSSG VLEPLDDYIK KYNVDVNDFE PALLSAFQWD GKTYGLPKDF NTLALFYNKD MFKAAGINEP PKTWEELRDV AKKLTKDGVK GLVLSADLAR FDAFINQNGG SVYQDGKVTL NLPENAQALD FYVSLITKDK VADTPQNMGE GWNGDAFAAK KAAMAIEGGW MIPFLKEKAP DLNYGIAELP AGKQKSTMAF TVAYVMNKNS KHKDEAFKLI EFLTGKEGQQ FVVDSGLALP SRKSMQEGFK EKYPERAAFV DGASYAVPWQ FGLYGTKVVD AANKACEALI MKQISSAQQA LDNAQKEVGQ
|
| |