Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_2194 |
Symbol | |
ID | 5877579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 2195193 |
End bp | 2196437 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641542549 |
Product | extracellular solute-binding protein |
Protein accession | YP_001663802 |
Protein GI | 167040817 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00903267 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT TTTACACTAA GTTAATAGCA GTTTTAATCA TTATCTCGCT TCTAGGTACT GTTATTGCAG GCTGTGGCAG CAAGACCCAA TCAGAAGCGA GGAAAAAAGT TTTGAAAGTT TCAATGGGAC TTGGAGAAGC AGAATGGAAA GTGATGAAAG AAGATATTTT CCCACCATTT GAGCAAAAGT ATGGTGTAAA AATTGAACCT CTTCAAATCG AAGCAGGAGA CCTTATTAAA AAATTAGATG CTATGCACAA AGCAAATGCG ATGGATATAG ACATTATCAC ACAAGATAAC ATGCAACTTG CTCCACTTGT TGCAAAAGGG CTTGTGGAAG ATTTGTCTCA GTATAGAGAC ATGATACCAA AGGAAGTAAT ACCAAGTCTT GTGCCAGTAG GAGAGTTTGA TGGAAAGTTG TACTTTATGC CGTATAGACC AAATGTAGAA ATAGCTTTCT ACAACGAAGA TAAATTCAAC GAATACGGTT TAAAACTACC TACAAATTGG GATGAGCTTT TACAGGTTGC AAAGACTTTT AAAGAAAAAG AGGGCATAGG TAGAGTCATA ATTAAAGAAA ATTTAGGGCC TGACAGCACA GTCCACATGT TTGACCTTAT AAGGTCTGCT GGTGGTGACC CAACAGTATT GAATGATGAG GGTTCAATAA AAGCATTTAC TTTCTTGAAA GAAATACAGC CATATCTCTC TCCTGACTCA AAGAAAGCTG ACTGGAATAC ACCTGTAGAA TATCTTGCAA AAGAGAGCGT ATATTTGGTT CAAAATTGGC CGTACACTGC AAACGTTCTT GTAGAGCAGT ATGGAAAAAA GAACATTTTG GCATATCACG GATGGACAGG TCCGGTTAAA GAGTCCCACG TTTTGGGAGG AGAAGTTATA GGAATACCAA CTGGTGCACC TAATAAAGAG ATGGCTATAA AGTTTATGGA ATACCTTATG AGTAAAGAAG TTCAAGAGAA ACTTGTCACT AAATTAGGAT GGCCATCCAT GAGAAGTGAC GCTTATGGGA AGGTTGCAGA GTGGCAAAAA CCATATTTTG AAGCTATAAA TGAAGCGTTA AAACATGCAG AACCAAGGCC AAACCTTGTA TACTGGGCTG ATGTGGACAA AGCTATAAAT GGAGCATTGA GAGAAATAAT ATTTGAAGGC AAAGATATCA AGACAACTCT TGACAAATAT CACAACATGA TAGAAGAAGC TAAGAAAGCT GCAGAAAGCA AGTAA
|
Protein sequence | MKKFYTKLIA VLIIISLLGT VIAGCGSKTQ SEARKKVLKV SMGLGEAEWK VMKEDIFPPF EQKYGVKIEP LQIEAGDLIK KLDAMHKANA MDIDIITQDN MQLAPLVAKG LVEDLSQYRD MIPKEVIPSL VPVGEFDGKL YFMPYRPNVE IAFYNEDKFN EYGLKLPTNW DELLQVAKTF KEKEGIGRVI IKENLGPDST VHMFDLIRSA GGDPTVLNDE GSIKAFTFLK EIQPYLSPDS KKADWNTPVE YLAKESVYLV QNWPYTANVL VEQYGKKNIL AYHGWTGPVK ESHVLGGEVI GIPTGAPNKE MAIKFMEYLM SKEVQEKLVT KLGWPSMRSD AYGKVAEWQK PYFEAINEAL KHAEPRPNLV YWADVDKAIN GALREIIFEG KDIKTTLDKY HNMIEEAKKA AESK
|
| |