Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0671 |
Symbol | |
ID | 6092088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 686428 |
End bp | 687738 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642487857 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738707 |
Protein GI | 170288469 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.687145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAT TTTTACTGTT GATCTTTCTC ATCATCACCT CGTTGATCTT CTCGGTTAAA ATCTCCGTTC TCTGTTCTCC AGACAACGCG GACGCCCTGA AGTGGCTTGC CCAGGAGTTC ATGAAACAGA ATCCCGAAAT TCAGGTTGAG ATCGTACCTC TTTCGTGGGA AGTGTTGTAT CCAAAACTAC TGCAGGATCT CAGATCTCAG GCTGGATCGT TCGATGCTTT CACTTACGAT GTGATGACCA CTGGAGCCGT CTCTTTCGGA CTGGTTGACC TTGGAGAGTT CATGAAACAA CATCCAGAAC TTGTTCCAGA AGATTATGAT TTGAACGATT TTATCCCACA GGTTCTGGAA GAATCTGGAA AGTGGCAGGG AAAACTCGTC GGGCTTCCGT TCTACAACAA CACAATGCTC TTCTATTACA GAAAAGATCT CTTTGAAGAT CCAAAGATAA AACAAGCGTT CAAGGAAAAA TACGGTAGAG AACTCACCCT CCCGACCACC TGGGAAGAAG TTGTAGAAAT AGCGGAATTC TTCACCAAAA AATACAACAA GAGCTCTCCA ACAGACTACG GAATCGCCCT CATGTTCCCG AGAACCCACA CACTCTTCTA CATGTATCTG CTGTTTTTCG GTGAGTACAG GAACGCACCA CTCGGTATCA TGAGGCACGG AACTGCGGAT CTTGAATTCG GTGAATACTT CACAGCGGAT CACAAACCGG CCTTCAACAG TGAAGAGGGA TTGAAAGCGC TCGAAATGAT GAAAAAACTC ATGCCTTACA GTCCAGATCC GCTCGGCTCT GATTACGGTG AAACGATTGA GTACTTCAAC CAGGGACTCG TTGCTATGGT ACCTCAATGG ACGGGGCCGT ATCTGATCTT CAAGAGCACA CTCGGTGAAG ATAAAGTCGG GATCATTCCC ATGCCGGGTC GATCTGTGAG TGGTCAATGG GCACTCGGCA TCAACAAATT CATACCCGAG GACAAGAAAC TCGCTGCGTT CAAATTCATC ATTTTCGCCA CCAGCAAATG GGCTGACAAG AACAAGTTCC TGAGATTCGC CGTCGCTCCT GCCAGAATCT CAACACTCCA GGATCCCGAG GTGAGGGCCG CTGACCCGAG AGTTCCCGCC CTCGAGGTAA CATACGTTTC TCAGACCCAC AGGCCAAGGA TTCCAGAGGA ACCGAGACTC GAAGACATCA CCGTTGAGAC CTTCTCCAAG ATCCTCTCTG GAGAACTCCC GCTCTCCATG GAAACGCTGA ACGATCTTGC AAAAAAATGG GAAGAGATTC TTGGAAAATA A
|
Protein sequence | MKRFLLLIFL IITSLIFSVK ISVLCSPDNA DALKWLAQEF MKQNPEIQVE IVPLSWEVLY PKLLQDLRSQ AGSFDAFTYD VMTTGAVSFG LVDLGEFMKQ HPELVPEDYD LNDFIPQVLE ESGKWQGKLV GLPFYNNTML FYYRKDLFED PKIKQAFKEK YGRELTLPTT WEEVVEIAEF FTKKYNKSSP TDYGIALMFP RTHTLFYMYL LFFGEYRNAP LGIMRHGTAD LEFGEYFTAD HKPAFNSEEG LKALEMMKKL MPYSPDPLGS DYGETIEYFN QGLVAMVPQW TGPYLIFKST LGEDKVGIIP MPGRSVSGQW ALGINKFIPE DKKLAAFKFI IFATSKWADK NKFLRFAVAP ARISTLQDPE VRAADPRVPA LEVTYVSQTH RPRIPEEPRL EDITVETFSK ILSGELPLSM ETLNDLAKKW EEILGK
|
| |