Gene TRQ2_0890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0890 
Symbol 
ID6092320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp921899 
End bp923872 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content50% 
IMG OID642488088 
Productextracellular solute-binding protein 
Protein accessionYP_001738925 
Protein GI170288687 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAAGGT TGCTTGTTTT GCTGTCGCTT GTGTTCATGG TTGTTTTAGC TCTTGCTGCC 
AACGACACAT GGGTCTTCTA CGCAACACCG GAAGAGTACT ACAAGGCTAC AGGAAAGAAG
ATTACCGAGT ACCATGAATC ACCGATGCTG ACCAAACTCG TCGAAGAAGG AAAGCTTCCA
CCCGTCGAAC AGAGACTTCC GGAGGAACCG CTCGTGGTTC AGCCTGTTGA AAAAGTTGGA
CAGTTCGGTG GTACCTGGAG AAGGGTCTGG AAAGGGCCTT CTGACAGGTG GGGTATTTCC
AAACTCATCG AAGTGAAACT CGCGTTCTGG GACAAAGAGG GTGGAAAACT CGTTCCGGGG
CTTGCGAAGA GCTGGGAAGT TCTGGAGAAC GGAAGGGTAT ACATCTTCCA TCTGAGAAAG
GGTGTGAAGT GGTCCGATGG AGTACCGTAC ACGGCCCACG ATATCGTGTT CTGGGTTAAC
GACATCGTAG GAAACGACGA TATCACACCT TCGAAACCTG ACTGGTACAA CATTGGTGTG
AAAGTCGAGG CACTCGATGA TTACACGGTG AAGTTCGAAT TCAGCAAGCC TTATGGATTG
TTCCTTCTGA AAGTTCCATA CGGTGGATTT ACCGGAGCAC CAGCACACTA TCTGAAACAG
TTCCATCCAA AGTACACACC GATGGAAGAA ATAGAGAAGA AGATGGTGGA AGGTGTGCAC
AACACCTGGG TGGACCTCTT CAACGATAAA AACGACTTCC TTGAAAACAC CGAGCTTCCA
ACACTATCAC CGTGGAAGCC TATCACCGAT CCAACAGAAC AGTTCTACAT ACTCGAGAGA
AACCCGTACT TCTGGGCGGT TGATATCGAA GGGAATCAGC TTCCATACAT CGATTACGTG
AGGCACGAAT ACGTCAAGAA CGACGAAGTC ATACTCCTGA AAGCGATCTC CGGTGAAATC
GATATGCAGT GGAGACATAT CGGAGGACTG GGAGCGGGAG CAGGAAACTT CACACTGCTC
ATGGAGAACG CCCAGAGTGG AGGATACAGG GTGCTGAAAT GGATCGCTGC GAACGGTTCT
GCCAGCAGAA TCTCATTGAA CTACGCTCAC TCCGACGAGG TGCTGAGGAA GGTCTTCAAC
GATGTGAGGT TCAGGCAGGC TCTCTCACTC GCTATCAACA GGGAAGAAAT CAACGAGATT
CTCTTCAACG GTCTCGCTGA GCCAAGGCAG GCATCTCTCG TGAGTGGATC CCCATACTTC
GATCCCGAGT GGGAAAAAGC TTACGCAGAG TACGATCCAG ACAGAGCGAA CAAGCTTCTC
GATGAGATGG GGCTGAAGTG GGATGACAAG CACGAATACA GACTCTTACC AGATGGCAGA
CCACTCCGAT TCACCATCAC TGTGACTGGA CAGTTTCATG TTGACGTCTG GACGATGGTG
AAGGAATACT GGAGACAGAT AGGGGTCTGG GTGGAGATCG AGAACGTTGA AAGGTCTCTC
TTCTACGAAA GAGCCGATGC CGGTGACTTC GATGCGATGG TGTGGAACAT GGATAGGGCT
GCTCAACCAC TCTCTTCACC GATGGTCATC TTCCCGGGTT CCGAGGACAT AGCAGACTTC
TGGTACATAG GATGGAGTGA CTGGATCTCG TACTACATCG ACAAGAACAT AAGAGGCGTG
GAACCCGAAG AAGTACCCGA AGGGCCTGAA CCACCAGAGG TCGTCTACAG ACTTGTCGAT
CTGTACTACC AGATAGCCTC CACGCCGGAT CCTGATAAAA TCAAAGAGCT CATGGCAGAA
GCAACGAAGA TCCATAGAGA AAATCTCTGG ATGATAGGAA CCGTCGGAGA AGACCTTTCG
CCTGCCATAG CGAAGAACAA CTTCAGAAAC GTACCAGAAT TTCTCGTAAC GGACGATGTG
TTGAGAACTC CTCTGAATGC CATGCCGATG CAGTTCTTCA TCGAACAGAA ATGA
 
Protein sequence
MRRLLVLLSL VFMVVLALAA NDTWVFYATP EEYYKATGKK ITEYHESPML TKLVEEGKLP 
PVEQRLPEEP LVVQPVEKVG QFGGTWRRVW KGPSDRWGIS KLIEVKLAFW DKEGGKLVPG
LAKSWEVLEN GRVYIFHLRK GVKWSDGVPY TAHDIVFWVN DIVGNDDITP SKPDWYNIGV
KVEALDDYTV KFEFSKPYGL FLLKVPYGGF TGAPAHYLKQ FHPKYTPMEE IEKKMVEGVH
NTWVDLFNDK NDFLENTELP TLSPWKPITD PTEQFYILER NPYFWAVDIE GNQLPYIDYV
RHEYVKNDEV ILLKAISGEI DMQWRHIGGL GAGAGNFTLL MENAQSGGYR VLKWIAANGS
ASRISLNYAH SDEVLRKVFN DVRFRQALSL AINREEINEI LFNGLAEPRQ ASLVSGSPYF
DPEWEKAYAE YDPDRANKLL DEMGLKWDDK HEYRLLPDGR PLRFTITVTG QFHVDVWTMV
KEYWRQIGVW VEIENVERSL FYERADAGDF DAMVWNMDRA AQPLSSPMVI FPGSEDIADF
WYIGWSDWIS YYIDKNIRGV EPEEVPEGPE PPEVVYRLVD LYYQIASTPD PDKIKELMAE
ATKIHRENLW MIGTVGEDLS PAIAKNNFRN VPEFLVTDDV LRTPLNAMPM QFFIEQK