Gene TRQ2_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0914 
Symbol 
ID6092344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp946290 
End bp948104 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content50% 
IMG OID642488111 
Productextracellular solute-binding protein 
Protein accessionYP_001738948 
Protein GI170288710 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGT CACTTGTACT GTTACTGGCT CTTTTGGTTC TTTCCAGTTT GATGGCGCAG 
GTGTCTCTGC CACGTGAAGA CACAGTCTAC ATCGGAGGAG CCCTCTGGGG TCCTGCAACC
ACCTGGAACC TCTATGCACC GCAGTCCACG TGGGGTACTG ATCAGTTCAT GTACCTTCCG
GCGTTCCAGT ACGACCTTGG AAGAGACGCT TGGATTCCTG TCATCGCAGA AAGATACGAA
TTCGTGGACG ACAAAACTCT GAGGATCTAC ATCAGACCTG AAGCAAGATG GAGCGATGGG
GTGTCTATCA CCGCAGAGGA TTTTGTCTAC GCTCTGGAGC TCACCAAAGA ACTCGGAATA
GGCCCCGGCG GTGGATGGGA TACCTACATC GAATACGTGA AAGCTGTTGA CACAAAAGTG
GTTGAATTCA AGGCGAAAGA AGAGAATCTC AATTACTTCC AGTTCCTTTC CTACTCCCTC
GGTGCACAAC CGATGCCCAA ACACGTCTAC GAAAGGATCA GAGCACAGAT GAACATAAAA
GACTGGATCA ACGACAAACC TGAAGAACAG GTTGTTTCTG GTCCTTACAA ACTCTACTAC
TACGACCCGA ACATCGTTGT GTACCAGAGA GTTGACGACT GGTGGGGTAA AGACATTTTC
GGACTTCCAA GACCCAAGTA TCTGGCTCAC GTCATTTACA AGGACAACCC GAGTGCCAGT
CTCGCGTTCG AAAGAGGCGA CATTGACTGG AACGGACTCT TCATTCCGAG TGTCTGGGAA
CTGTGGGAGA AGAAAGGCCT TCCGGTTGGA ACGTGGTACA AAAAGGAACC TTACTTCATT
CCCGACGGTG TGGGATTCGT GTACGTAAAC AATACCAAAC CTGGTTTGAG CGACCCAGCT
GTGAGAAAAG CGATCGCTTA CGCTATTCCG TACAACGAAA TGCTCAAAAA GGCTTACTTC
GGTTATGGAA GCCAGGCTCA CCCGTCCATG GTGATCGATC TCTTCGAACC GTACAAGCAG
TACATCGATT ACGACCTTGC AAAGAAAACC TTTGGAACTG AAGATGGAAG AATCCCGTTC
GATCTCGATA TGGCAAACAA GATCTTGGAC GAGGCAGGGT ACAAAAAAGG ACCTGATGGT
GTAAGGGTTG GCCCCGATGG CACGAAACTT GGTCCGTACA CGATATCTGT TCCGTACGGC
TGGACTGACT GGATGATGAT GTGTGAGATG ATCGCAAAGA ATCTGAGAAG CATAGGTATC
GATGTAAGAA CTGAATTTCC AGATTACTCT GTATGGGCAG ACAGAATGAC GAAAGGAACG
TTCGACCTCA TCATATCCTG GAGTGTTGGT CCGAGCTTCG ATCATCCGTT CAACATATAC
AGGTTTGTGC TCGATAAGAG GCTGTCTGCT CCTGTAGGTG AAGTCACGTG GGCTGGAGAC
TGGGAAAGGT ACGATAATGA TGAGGTAGTC GAACTCCTCG ACAAAGCAGT TTCTACACTC
GATCCTGAGG TGAGAAAACA GGCGTACTTC AGAATCCAGC AGATCATCTA CAGAGATATG
CCGAGCATAC CCGCGTTCTA CACGGCTCAC TGGTACGAAT ACTCGACGAA GTACTGGATC
AACTGGCCGA GCGAGGACAA TCCAGCCTGG TTCAGACCTT CTCCATGGCA CGCGGACACC
TGGCCGACTC TCTTCATCAT CTCCAAGAAG AGCGATCCAC AGCCCGTACC GTCCTGGCTT
GGAACGGTTG ATGAAGGAGG AATCGAGATA CCCACCGCGA AGATCTTCGA AGATCTCCAG
AAAGCGGCCA TGTGA
 
Protein sequence
MRKSLVLLLA LLVLSSLMAQ VSLPREDTVY IGGALWGPAT TWNLYAPQST WGTDQFMYLP 
AFQYDLGRDA WIPVIAERYE FVDDKTLRIY IRPEARWSDG VSITAEDFVY ALELTKELGI
GPGGGWDTYI EYVKAVDTKV VEFKAKEENL NYFQFLSYSL GAQPMPKHVY ERIRAQMNIK
DWINDKPEEQ VVSGPYKLYY YDPNIVVYQR VDDWWGKDIF GLPRPKYLAH VIYKDNPSAS
LAFERGDIDW NGLFIPSVWE LWEKKGLPVG TWYKKEPYFI PDGVGFVYVN NTKPGLSDPA
VRKAIAYAIP YNEMLKKAYF GYGSQAHPSM VIDLFEPYKQ YIDYDLAKKT FGTEDGRIPF
DLDMANKILD EAGYKKGPDG VRVGPDGTKL GPYTISVPYG WTDWMMMCEM IAKNLRSIGI
DVRTEFPDYS VWADRMTKGT FDLIISWSVG PSFDHPFNIY RFVLDKRLSA PVGEVTWAGD
WERYDNDEVV ELLDKAVSTL DPEVRKQAYF RIQQIIYRDM PSIPAFYTAH WYEYSTKYWI
NWPSEDNPAW FRPSPWHADT WPTLFIISKK SDPQPVPSWL GTVDEGGIEI PTAKIFEDLQ
KAAM