Gene TRQ2_0512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0512 
Symbol 
ID6091927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp513529 
End bp514800 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content44% 
IMG OID642487699 
Productextracellular solute-binding protein 
Protein accessionYP_001738551 
Protein GI170288313 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00024476 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGAAAC TCGCAGTTGT TCTTTTGATC TCTTTGATAC TTCTACCGGT GCTGGTCAGT 
GCTGTGAAAC TCACCATATG GTGTGGAGGA GGTACTGAAA GAAAGGGTCT GGAAGCGGTG
GTTGCTGAAT ACAAGAAATT GAATCCAGAT GTCGAGATCG AACTTGTGGA TGTTCCTTAC
AGTTCTTATG AGCAGAAGAT AAGATTGGGA ATACTGAGTG GTGATCTCCC AGATCTTGTA
ACAATTACGT ACCCATATGC ACCTGGATAC ATGCAGTACA TGCTCGATCT GAGACCTTAC
ATTCAGAAGT ATCTTGGAAT CACACCAGAT GATTTCCTAA AATCTCTCTA CGATGTTGTA
AGAATTCGTA TAACCACAAA CGAAGGAGAA ATCAAATATG TTCCTTTGCA CTTCACGGCA
CAGTGTCTCT GGGTAAACAA GGATTATTTC GAAAAAGCAG GCGTCCCCTA TCCACCTTTT
GGAGGAAGAG AAGAACCCTG GACATGGGAA GAATTCATTT CGGCTTTGAA AAAAGTGAAA
GAAGCAAATG GCATTCCTTA CGCTCTCTCA ATGCAGAGAA CTGCTGAAAG ATTGTTCGCA
TACATGGCAA TCAGAGGAGT CAAAATAATT GATGAAAATC TCGATTTTAC ACTGGATAAG
GATCCAAGAG CAAAACAGCT GCTTCAGGAT TTTGCAAACA TGTTTAAAGA AGGTCTCATG
GTTCCCGCCG AATGGATCTC TGCTCAGGAT CCAAACATGG CATTTGGAGG AGGATTGACA
GCAGTTCTGT GGGCTGGAAG CTGGAGCACA GCCGATCTTC TTTCGATTGA AGGTAAAAAC
TTTGTGCCTG CTTATCTTCC AAAGGATATG TACTGGCTGA GTCTCGAAGG TGGAAGGTTC
TTTGGTACCT TCAAAACGGG CGATAAAGCC AGAGAAGAAG CAGCCGCTAA ATTCGCACTG
TGGGCTGGTT GGAAAGGTCT CGGCTATGAC ATCTATTTGA AGACTACATT TCATATGTCT
GCCTACAAGA ATCACCATGT GGACTATGGA AATCCAATCA TGGATCAGGT TCAGAAAGTC
TGTGGCGATA TGATAGCTAG TACACCGGAA TGGGTTGTCA CCATAAGGAA TTCTGTTGCC
TGGTCCAGAT TGCAGTCTCC AATTGTCAGT CAGATGTCCG CTCTGGTGGC GGGTCAAACA
ACAGTGGACA ATGTTATAAA GGCGCTTCGA AATGAATACG ACAAAATAGT AGCCGAAGTT
GGAAAGAAAT AA
 
Protein sequence
MRKLAVVLLI SLILLPVLVS AVKLTIWCGG GTERKGLEAV VAEYKKLNPD VEIELVDVPY 
SSYEQKIRLG ILSGDLPDLV TITYPYAPGY MQYMLDLRPY IQKYLGITPD DFLKSLYDVV
RIRITTNEGE IKYVPLHFTA QCLWVNKDYF EKAGVPYPPF GGREEPWTWE EFISALKKVK
EANGIPYALS MQRTAERLFA YMAIRGVKII DENLDFTLDK DPRAKQLLQD FANMFKEGLM
VPAEWISAQD PNMAFGGGLT AVLWAGSWST ADLLSIEGKN FVPAYLPKDM YWLSLEGGRF
FGTFKTGDKA REEAAAKFAL WAGWKGLGYD IYLKTTFHMS AYKNHHVDYG NPIMDQVQKV
CGDMIASTPE WVVTIRNSVA WSRLQSPIVS QMSALVAGQT TVDNVIKALR NEYDKIVAEV
GKK