Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0340 |
Symbol | |
ID | 6091744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 322967 |
End bp | 324226 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642487517 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738379 |
Protein GI | 170288141 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAAT TTCTCGTTGT TCTCATGGTG GTTCTCAGTG TTTTTGCCCT CGCGAAGGTG AAGGTCACGT TCTGGCACGC CATGGGCGGG GGACACGGTG AAACTCTCCA AGAGATAGTG AACACCTTCA ACGAACTCCA CCCGGATATC GAGGTCGAAG CGGTCTACGT TGGAAACTAC AGTGCACTCT CTCAGAAGCT CCTCGCGGCA GCACAGGCAG GAGAACTTCC CACCATCTCG CAGTCCTATT CCAACTGGAC GGCGAAGCTC ATCCAGAGCG GTGTCGTGCA GCCTCTGAAC GAGTTCGTGA ACGATCCAAA GATAGGCCTC ACGAAGGAAG AGTGGGAAGA CATCTTCAAG CCTTTGAGAG ACAACTGCAT GTGGGGAGAC ACCATCTACG CGGTCCCGTT CAACAAGAGT CTCTACATAC TCTATTACAA CGCGGATGCT TTTGCGATGT ACGGCGTGGA TGTTCCCAAA ACGATCGATG AGCTTTACGA AGCGGCGCGA ATCATGACAG AAGACCTCGA CGGAGATGGA AATATCGATC AGTACGGTTT TGGCTTCAGA ACGACCGTTG ACTTCTTCCA GATACTACTT CTTCTGCGCG GTGGTTCCAT CCTGAAGCAG GTCGATGGAA AGTGGGTCTC CAACATCGAC AGTCAGGAAA CAAGGGACGT TCTCGCTTTC GTGAAGAAGA TGGTGGACGA TGGTATCGCG TACTTCCAGG GTGGATACCT CAACGATATC TTCGGTCAGC AGAAGATCAT GATGTACATC GACACGATAG CGGGAAGACC CTACGTGGAA AGCTCCACGA AGGGGAAGTT CACCTGGAGC TGGGCACCTG TTCCCACCTG GGTGACGAAC AAGGTGCCGT TCGCCGGAAC AGACATCATC ATGTTCAACA CGGCAAGCGA TGAGGAAAAA CGTGCCGCCT GGGAGTTCAT GAAGTACCTC ATCTCACCAG AGGTCACCGC TTACTGGGCG ATCAACACAG GGTACATACC TGTGAGGAGA AGCGCCCTTG AAACGTCGAT CTGGAAGGAA GCGGCTAAAT CCGATCCTCT GATCGAAATA CCGCTGAAGC AGATAGACAA CGCCATGTTC GACCCGCAGA TAGGTGTATG GTACGAGATC AGAACGGTGG TTGGAAACAT GTTCTCCGAT TTCATCAACG GAAAGGTAGA CATGGAAACT GCGATAAAGA CGGCGGATCA GAAGATAAGG GAGTATCTCA AGGAAGAGTA CGGTGAGTGA
|
Protein sequence | MKKFLVVLMV VLSVFALAKV KVTFWHAMGG GHGETLQEIV NTFNELHPDI EVEAVYVGNY SALSQKLLAA AQAGELPTIS QSYSNWTAKL IQSGVVQPLN EFVNDPKIGL TKEEWEDIFK PLRDNCMWGD TIYAVPFNKS LYILYYNADA FAMYGVDVPK TIDELYEAAR IMTEDLDGDG NIDQYGFGFR TTVDFFQILL LLRGGSILKQ VDGKWVSNID SQETRDVLAF VKKMVDDGIA YFQGGYLNDI FGQQKIMMYI DTIAGRPYVE SSTKGKFTWS WAPVPTWVTN KVPFAGTDII MFNTASDEEK RAAWEFMKYL ISPEVTAYWA INTGYIPVRR SALETSIWKE AAKSDPLIEI PLKQIDNAMF DPQIGVWYEI RTVVGNMFSD FINGKVDMET AIKTADQKIR EYLKEEYGE
|
| |