Gene TRQ2_1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1666 
Symbol 
ID6093116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1689305 
End bp1690810 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content50% 
IMG OID642488867 
Productextracellular solute-binding protein 
Protein accessionYP_001739684 
Protein GI170289446 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGC TTGTCTGGTT GTTTCTGGTT CTGGCGGTGA CTCTTTCTTT TGCAGCAAAG 
GACATCATCG TGGTGGGTAC CACGGACAAG ATCAGAACTC TCGATCCTGC AAACTGCTAT
GACTACTTCT CCTCGAACAT ACTCCAGAAC GTCCTGGTCG GTCTGGTTGA CTACGAGATA
GGAACCAGCA ACCTGAAACC GGTGCTCGCA GAAAGATGGG AAGTCGATGA AACGGGAACG
GTCTACACCT TCTATCTGAG AAAAGACGCA AAGTTCGAGG ATGGAACACC GATCGATGCA
CACGTGTTCA AGTATTCCTT CGACAGGGTT ATGAGACTCA ACGGAGATCC CGCATTTTTG
CTTTCGGACA TAGTCGAAAA AACGGAAGTG GTGGACGATT ACACGTTCCG TGTAACACTG
AAGTACCCGT TCTCCGCGTT CGTCTCCGTT CTTGGCTACA CCGTGGCCTA TCCGGTGAAT
CCAAAGGTTT ATCCAGCCGA TTCCTTCTAC GAAGGGATAC CTTCGGCCTC TGGGCCTTAC
AGGATCAAAG AGTGGATCAG AGACGTGAGG ATCGTTCTTG AGGCCAATCC GAACTACTTC
GGTGAAAAGC CAAAGACAAA GACCATCGTG ATCAACTTCT ACGAGAGTGC CTCCACCCTC
AGACTGGCAC TCGAAACAGG AGAGATCGAC GTTGCATACA GGCATCTCGA TCCAAGGGAT
ATCATCGATC TTGAGGGAAG AGAGGACATT GTTGTCTACA AAGGAAACAG CCCGCAAATA
AGATATCTCG TGATAAACGT GACACAGCCT CCGTTCGACA ACGTGAAAGT GAGACAGGCA
CTCGCCTATG CGGTCAACAG GTCTGTCATC GTCGAGGACG TGTTTGCAGG GCTTGCAAAA
CCGCTGTACT CGATGATTCC AGAAGGCATG TGGGGACACA AGAGTGTCTT CCCTGAGAGA
GATCTGGAAA AAGCAAAAGC ATTACTCAAA GAGGCTGGCT ACGACGAAAA CAACCCGCTC
GTGATCGATC TCTGGTACAC ACCCACACAC TACGGAACAA CGGAGGCGGA CGTTGCACAG
GTGTTGAAGG AATCGTTCGA AGAAACGGGT GTCATAAAAG TGAACCTGAA GTACGCCGAA
TGGTCCACCT ACGTGGAATA TTTCCTGAAC GGTACCATGG GACTGTTCCT GCTTGGTTGG
TATCCGGATT ATCTCGACCC AGATGACTAC GTGTGGCCCT TCCTGAGCGA AAGTGGTGCA
AAATCTCTGG GAAGTTTCTA TTCGAATCCC GAAGTGGAAA ACCTCATGAT AGAAGCCAGA
AAGCTCACCG ATCAGGAAAA GAGAGCCGAG ATCTACTACA AGGTCCAGGA GATCCTCGCC
AGGGACGTTC CCTACATACC GCTCTGGCAG GGTGTTGCCA CCTGTGCAGC GAAAAAGCAG
GTGAAGGGGA TCCTGCTTGA GCCCACACAG ATATTCAGAT ACTACATACT CTACTGGGAA
GAGTGA
 
Protein sequence
MKRLVWLFLV LAVTLSFAAK DIIVVGTTDK IRTLDPANCY DYFSSNILQN VLVGLVDYEI 
GTSNLKPVLA ERWEVDETGT VYTFYLRKDA KFEDGTPIDA HVFKYSFDRV MRLNGDPAFL
LSDIVEKTEV VDDYTFRVTL KYPFSAFVSV LGYTVAYPVN PKVYPADSFY EGIPSASGPY
RIKEWIRDVR IVLEANPNYF GEKPKTKTIV INFYESASTL RLALETGEID VAYRHLDPRD
IIDLEGREDI VVYKGNSPQI RYLVINVTQP PFDNVKVRQA LAYAVNRSVI VEDVFAGLAK
PLYSMIPEGM WGHKSVFPER DLEKAKALLK EAGYDENNPL VIDLWYTPTH YGTTEADVAQ
VLKESFEETG VIKVNLKYAE WSTYVEYFLN GTMGLFLLGW YPDYLDPDDY VWPFLSESGA
KSLGSFYSNP EVENLMIEAR KLTDQEKRAE IYYKVQEILA RDVPYIPLWQ GVATCAAKKQ
VKGILLEPTQ IFRYYILYWE E