Gene TRQ2_0622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0622 
Symbol 
ID6092039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp626023 
End bp627999 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content48% 
IMG OID642487808 
Productextracellular solute-binding protein 
Protein accessionYP_001738658 
Protein GI170288420 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.16709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAG TCTTTGTTTT TCTGCTGGTC TCATTGTTCA TTGTGATTGG TCTTTCCTGG 
AGTGTCTACG CAACACCTGA AGACTACTAC AAAGCCACGG GTAAAAAGAT TGAAAAGTTC
AACGAGGCAC CCATGCTTGC CGAACTTGTG AAACAAGGAA AGCTCCCACC GGTCGAGCAG
AGGCTTCCAA AGGAACCTCT TGTTGTAGTT CCTGAGGAAA GTGTAGGACA GTACGGTGGC
ACCTGGAGGA GAGTCTGGAA AGGGCCTTCT GACAGGTGGG GTATTCCCAG GATCAACCAA
GCGAGTCTTG TGTTCTGGGA CAAGAACGGA GAGAAGTTTG TACCGGGAGT AGCGAAAAGC
TGGGACATAC TTGAAAATGG AAAAGTATAT GTCTTTCATC TCAGAGAAGG CATGAAGTGG
TCCGATGGTC ATCCCTACAC TTCCGAAGAT ATTCTTTTCT GGGTCGACGA TATACTGGGG
AACGATGAAC TCACCCCTGC GAAACCCGCC TGGTACAGAC TTCTCGACAG GGTGGAAGCT
CCTGATCCCT ACACGGTGAA ATTCGTGTTC AAACAACCGT ACGCTCTGTT TCTGCTTCAG
GTGGCGAACA GAGGATTCAC AGGCTCTCCC AAACATTTCT TGAAGCAATT CCACCCCAAT
TACACCCCTA TGGAAGAGAT CGAAAAGAAA ATGGTTGAGG GTGTTCACAA CACATGGGTA
GATCTTTTCA ATGACAAAAG TGATTTTCTC GAAAGTCTTG ATCTGCCCGT TTTGACACCC
TGGAAACCCA TCACAGATCC AACAGAACAG TTCTACATAC TCGAGAGAAA TCCATACTTC
TGGGCGGTTG ACATCGAAGG GAATCAACTG CCTTACATCG ATAGGATCAG ACACGAATAC
GTTCAAAGTA GTGAGGTTAT CATGTTGAAA GCAATCTCCG GAGAAATCGA TATGCAGTGG
AGGCACATTG GTCTTCTGGG ACCCGGCCCG GGTGTTTTGC CGCTTCTTCT CGAAAACGCC
AAGAGCGGTG GTTACAAGGT TTTGAGATGG AAAACCGACA ATGGATCCGT GAGCATGGTG
ATGCTGAACA TCTCCGATCC ACCAGATCCT GTACTCGGAG AAGTTTTCAG GGATGTGAGA
TTCAGACAGG CGCTTTCACT TGCTATCAAC AGAGAGGAGA TCAACGAGAT TCTCTTCAAC
GGACTCGCAG AACCAAGGCA AGCCTCGTTC GTGAGTGGAT CTCCGTACTA CGATCCCGAA
TGGGAGAAGG CGTATGTGGA GTATGATCCA GACAGAGCAA ACAAACTCCT CGACGAAATG
GGACTGAAGT GGGACAGCAA ACGCGAGTAC AGACTTCTTC CCAATGGAAA ACCTCTGAGA
TTCACCGTAC AGGTCACTGG TCAGACCCAC GTTGATGTCT GGACGATGGT GAAAGAATAC
TGGAAACAGA TAGGTGTGTG GGTCGAAATC GAAAACCTCG AAAGGTCACT CTATGATTCG
AGGCTCAGTG CACACGATTT CGATGCACAG GCTTGGGTGA TGGACAGGGC AAGTCAGCCC
CTTGTGGATC CCCTGTGGAT CATTCCTGGA AGCACGGAGT ACGCTTCCGC GTGGTACATT
GGCTGGGCTG ATTGGGCTGG TTCTTACCTT GAAGGAGAAG AATCTCTGAA GGAATATCTA
CAGCAGGAAG ATGCGATCGT TCCACCCGAG GGTATAAAGG AGACTCTCGA AAAGCTACTC
GACGTATGGA AGGAGATTCA AAACACTTCC GATCCTGAAA AGATCAAGGA ACTCATGAAA
GAAGTAACGA AGATCCACAG GGAAAATCTC TGGATGATAG GAACAGTGGG CGAAGATATA
TCTCCTGCCA TCGTTAAGAA CAACTTCAAG AACGTACCGG AGGAACTGGT GACGGCAACG
CCGTTCTTCA GTCCATGGAA CGCCATGCCG ATACAATTCT ACATAGAACA GAAATGA
 
Protein sequence
MRKVFVFLLV SLFIVIGLSW SVYATPEDYY KATGKKIEKF NEAPMLAELV KQGKLPPVEQ 
RLPKEPLVVV PEESVGQYGG TWRRVWKGPS DRWGIPRINQ ASLVFWDKNG EKFVPGVAKS
WDILENGKVY VFHLREGMKW SDGHPYTSED ILFWVDDILG NDELTPAKPA WYRLLDRVEA
PDPYTVKFVF KQPYALFLLQ VANRGFTGSP KHFLKQFHPN YTPMEEIEKK MVEGVHNTWV
DLFNDKSDFL ESLDLPVLTP WKPITDPTEQ FYILERNPYF WAVDIEGNQL PYIDRIRHEY
VQSSEVIMLK AISGEIDMQW RHIGLLGPGP GVLPLLLENA KSGGYKVLRW KTDNGSVSMV
MLNISDPPDP VLGEVFRDVR FRQALSLAIN REEINEILFN GLAEPRQASF VSGSPYYDPE
WEKAYVEYDP DRANKLLDEM GLKWDSKREY RLLPNGKPLR FTVQVTGQTH VDVWTMVKEY
WKQIGVWVEI ENLERSLYDS RLSAHDFDAQ AWVMDRASQP LVDPLWIIPG STEYASAWYI
GWADWAGSYL EGEESLKEYL QQEDAIVPPE GIKETLEKLL DVWKEIQNTS DPEKIKELMK
EVTKIHRENL WMIGTVGEDI SPAIVKNNFK NVPEELVTAT PFFSPWNAMP IQFYIEQK