Gene TRQ2_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1696 
Symbol 
ID6093146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1717396 
End bp1718706 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content46% 
IMG OID642488896 
Productextracellular solute-binding protein 
Protein accessionYP_001739713 
Protein GI170289475 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00506774 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TTTTTGTTCT TCTGATGATT CTTCTCGTAG TTATCAGTTT AGCGAAGGTC 
AAGATTCAGT TCTGGCACGC CATGGGAGGA TGGAGGATCG AACTTCTTCA AAACATGGCG
GAGGACTTCA TGAAGACCCA TCCAGACATT GAAGTAGAAG TTCAGTACAC CGGAAGCTAC
AGAGATACTC TGAACAAACT CGTAGCAGCT GTACAGGGAG GAACCCCTCC ACACGTTGTT
CAGATATACG AAATTGGCAC TCAGTTCATG ATTGACAGTG GTATAGCCGT CCCAATTGGT
GATTTGATCG AGAAAGATCC CTCCTTCGAT GTTGGAAAAT TCCTTCCACA GGTTCTGGAC
TACTACAGAG TGAAGGGGAA ACTCTACTCG ATGCCGTTCA ACTCTTCAAA TCCGATTCTC
TACTACAACA AAACCCTCTT CAAAGAAGTG GGTCTCGATC CAAACAAACC TCCCAGGACA
TTCAATGAGT TAATAGAATA CTGCAGAAAA CTCACGGTTA AAGATGAAAA AGGAAATATC
GTTCGTGCTG GCATCACTTG GCCACTCCAC AGCTGGTTCT TCGAACAGTT CGTAGCTCTT
CAGAATGCTC CTTTAGTTGA CAACGAAAAT GGAAGAGCGG GAAGAGCAAC AAAGGCAGTT
TTCAACCACA AAGCGGCGCT CAGGTTCCTC AAACTCTGGA ATACACTTAC GAAAGAGGGT
CTTATGATCA ACACAACAAA AGAAGACTGG ACAGGAGCAA GACAGCTCTT TATTTCTCAA
AAAGTTGCCA TGCTTATCAC CTCTACCTCT GACGTGAAAC TGATGATGGA CGCTGCCAAG
GAAAACGGAT TCGAGCTCGG AACAGCATTC CTTCCGAAAC CAGAGGGAGT TGAGCTTGGA
GGAACACCAA TAGGTGGTGG AAGCTTGTGG ATCATAGGAG GCCATCCAGA AGAGGAGATA
AAAGCGGCCT GGGAGTTCGT AAAATGGATG GCGGAGCCAG AACAACAGAT ACGCTGGCAC
CTTGGAACAG GCTACTTCCC GGTAAGAAAA GACGCAGTAG AGACACTTCT CTACCAGGGT
TACTACTCTG AATATCCTCA TCATCTCACT GCGCTTTTGC AGCTTCTGCT GTCGGTTCAA
ACACCGAACA CCAGAGGAGC TGTTATAGGA CCGTTCCCAG AGGTGAGAGA CATAATAGAA
ACCGCTATTG AAAAAATGAT TAATGGAGAA ATGACGCCCG AAGAAGCTCT CGCCTGGGCT
GAAAAAGAAG CTACAAGGGC CATCAGAGAA TACAATGAAC TCTATGAGTG A
 
Protein sequence
MKKFFVLLMI LLVVISLAKV KIQFWHAMGG WRIELLQNMA EDFMKTHPDI EVEVQYTGSY 
RDTLNKLVAA VQGGTPPHVV QIYEIGTQFM IDSGIAVPIG DLIEKDPSFD VGKFLPQVLD
YYRVKGKLYS MPFNSSNPIL YYNKTLFKEV GLDPNKPPRT FNELIEYCRK LTVKDEKGNI
VRAGITWPLH SWFFEQFVAL QNAPLVDNEN GRAGRATKAV FNHKAALRFL KLWNTLTKEG
LMINTTKEDW TGARQLFISQ KVAMLITSTS DVKLMMDAAK ENGFELGTAF LPKPEGVELG
GTPIGGGSLW IIGGHPEEEI KAAWEFVKWM AEPEQQIRWH LGTGYFPVRK DAVETLLYQG
YYSEYPHHLT ALLQLLLSVQ TPNTRGAVIG PFPEVRDIIE TAIEKMINGE MTPEEALAWA
EKEATRAIRE YNELYE