Gene TRQ2_0671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0671 
Symbol 
ID6092088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp686428 
End bp687738 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content48% 
IMG OID642487857 
Productextracellular solute-binding protein 
Protein accessionYP_001738707 
Protein GI170288469 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.687145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAT TTTTACTGTT GATCTTTCTC ATCATCACCT CGTTGATCTT CTCGGTTAAA 
ATCTCCGTTC TCTGTTCTCC AGACAACGCG GACGCCCTGA AGTGGCTTGC CCAGGAGTTC
ATGAAACAGA ATCCCGAAAT TCAGGTTGAG ATCGTACCTC TTTCGTGGGA AGTGTTGTAT
CCAAAACTAC TGCAGGATCT CAGATCTCAG GCTGGATCGT TCGATGCTTT CACTTACGAT
GTGATGACCA CTGGAGCCGT CTCTTTCGGA CTGGTTGACC TTGGAGAGTT CATGAAACAA
CATCCAGAAC TTGTTCCAGA AGATTATGAT TTGAACGATT TTATCCCACA GGTTCTGGAA
GAATCTGGAA AGTGGCAGGG AAAACTCGTC GGGCTTCCGT TCTACAACAA CACAATGCTC
TTCTATTACA GAAAAGATCT CTTTGAAGAT CCAAAGATAA AACAAGCGTT CAAGGAAAAA
TACGGTAGAG AACTCACCCT CCCGACCACC TGGGAAGAAG TTGTAGAAAT AGCGGAATTC
TTCACCAAAA AATACAACAA GAGCTCTCCA ACAGACTACG GAATCGCCCT CATGTTCCCG
AGAACCCACA CACTCTTCTA CATGTATCTG CTGTTTTTCG GTGAGTACAG GAACGCACCA
CTCGGTATCA TGAGGCACGG AACTGCGGAT CTTGAATTCG GTGAATACTT CACAGCGGAT
CACAAACCGG CCTTCAACAG TGAAGAGGGA TTGAAAGCGC TCGAAATGAT GAAAAAACTC
ATGCCTTACA GTCCAGATCC GCTCGGCTCT GATTACGGTG AAACGATTGA GTACTTCAAC
CAGGGACTCG TTGCTATGGT ACCTCAATGG ACGGGGCCGT ATCTGATCTT CAAGAGCACA
CTCGGTGAAG ATAAAGTCGG GATCATTCCC ATGCCGGGTC GATCTGTGAG TGGTCAATGG
GCACTCGGCA TCAACAAATT CATACCCGAG GACAAGAAAC TCGCTGCGTT CAAATTCATC
ATTTTCGCCA CCAGCAAATG GGCTGACAAG AACAAGTTCC TGAGATTCGC CGTCGCTCCT
GCCAGAATCT CAACACTCCA GGATCCCGAG GTGAGGGCCG CTGACCCGAG AGTTCCCGCC
CTCGAGGTAA CATACGTTTC TCAGACCCAC AGGCCAAGGA TTCCAGAGGA ACCGAGACTC
GAAGACATCA CCGTTGAGAC CTTCTCCAAG ATCCTCTCTG GAGAACTCCC GCTCTCCATG
GAAACGCTGA ACGATCTTGC AAAAAAATGG GAAGAGATTC TTGGAAAATA A
 
Protein sequence
MKRFLLLIFL IITSLIFSVK ISVLCSPDNA DALKWLAQEF MKQNPEIQVE IVPLSWEVLY 
PKLLQDLRSQ AGSFDAFTYD VMTTGAVSFG LVDLGEFMKQ HPELVPEDYD LNDFIPQVLE
ESGKWQGKLV GLPFYNNTML FYYRKDLFED PKIKQAFKEK YGRELTLPTT WEEVVEIAEF
FTKKYNKSSP TDYGIALMFP RTHTLFYMYL LFFGEYRNAP LGIMRHGTAD LEFGEYFTAD
HKPAFNSEEG LKALEMMKKL MPYSPDPLGS DYGETIEYFN QGLVAMVPQW TGPYLIFKST
LGEDKVGIIP MPGRSVSGQW ALGINKFIPE DKKLAAFKFI IFATSKWADK NKFLRFAVAP
ARISTLQDPE VRAADPRVPA LEVTYVSQTH RPRIPEEPRL EDITVETFSK ILSGELPLSM
ETLNDLAKKW EEILGK