Gene TRQ2_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0871 
Symbol 
ID6092301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp897714 
End bp900050 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content50% 
IMG OID642488069 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001738906 
Protein GI170288668 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCGT ACAGGGATCC TTCGCAACCC ATCGAAGTGA GAGTGAGAGA TCTTCTTTCC 
AGAATGACGC TGGAAGAGAA AGTAGCCCAG CTTGGGTCTG TCTGGGGTTA CGAACTGATA
GACGAGAGGG GAAAGTTCAG TAGAGAAAAA GCAAAAGAAC TCCTCAAAAA TGGTATTGGT
CAGGTCACGA GGCCCGGTGG ATCAACGAAC CTTGAACCTC AAGAAGCCGC GGAACTTGTG
AACGAAATAC AGAGATTTCT CGTGGAAGAA ACACGTCTTG GAATTCCTGC GATGATACAC
GAGGAATGTC TCACCGGTTA CATGGGACTT GGAGGAACCA ACTTCCCTCA GGCAATAGCA
ATGGCGAGTA CATGGGATCC AGATCTCATA GAAAAAATGA CCACCGCCAT CAGAGAGGAT
ATGAGAAAGA TAGGAGCACA TCAGGGTCTC GCACCTGTTC TGGATGTCGC AAGAGACCCG
AGGTGGGGGA GAACAGAAGA GACGTTCGGA GAATCTCCCT ATCTGGTGGC GAGGATGGGA
GTCTCTTACG TGAAAGGCCT CCAGGGGGAG GATATCAAAA AAGGTGTCGT TGCCACAGTG
AAACACTTCG CCGGATACAG CGCTTCTGAA GGCGGAAAGA ACTGGGCACC AACGAACATT
CCAGAGAGGG AATTCAAAGA GGTCTTTCTC TTTCCGTTCG AAGCGGCCGT TAAAGAAGCG
AATGTGCTTT CTGTGATGAA CTCCTACAGC GAAATAGACG GTGTCCCATG TGCAGCGAAC
AGGAAACTCC TCACAGACAT TCTCAGAAAA GACTGGGGAT TCGAAGGAAT CGTCGTTTCT
GACTATTTCG CTGTGAAAGT TCTGGAAGAT TATCACAGAA TAGCAAGGGA TAAGTCAGAA
GCCGCAAGAC TCGCACTTGA AGCGGGGATA GATGTTGAAC TTCCGAAGAC AGAATGTTAT
CAATATTTGA AAGACCTTGT TGAAAAAGGC ATCATCTCCG AAGCTTTGAT CGACGAGGCA
GTCGCCAGGG TGCTGAGGCT GAAGTTCATG CTTGGGCTCT TCGAAAATCC CTACGTTGAG
GTGGAAAAAG CAAAGATAGA AAGCCACAGA GACATCGCGC TCGAGATAGC AAGGAAATCT
ATCATACTCC TCAAAAACGA TGGGATACTG CCGCTTTCGA AAGAAAAGAA AGTGGCGTTG
ATCGGACCGA ACGCGGGTGA GGTGAGAAAT CTCCTCGGAG ATTACATGTA TCTTGCACAC
ATAAGGGCTC TCCTCGACAA CATAGACGAT GTCTTTGGAA ATCCTCAGAT CCCGAGAGAA
AACTACGAAA GACTGAAGAA GAGCATAGAA GAACATATGA AGAGTATTCC GAGTGTTCTC
GACGCCTTCA AAGAAGAAGG GATCGAATTC GAATACGCAA AGGGCTGTGA AGTGACAGGG
GAAGACAGAA GCGGCTTCGA AGAGGCGATA GAAATTGCAA AGAAATCCGA CGTTGCCATC
GTTGTCGTAG GAGACAAATC TGGACTCACC CTTGATTGCA CAACCGGTGA GTCCAGAGAC
ATGGCAAACC TCAAGCTTCC AGGAGTCCAG GAAGAACTCG TCCTCGAAGT CGCAAAGACA
GGAAAACCCG TCGTTCTTGT CCTCATCACG GGAAGACCCT ACTCACTCAA AAACGTCGTC
GACAAGGTGA ACGCCATCCT TCAGGTGTGG CTTCCGGGGG AGGCGGGAGG AAGATCGATC
GTTGACATCA TCTACGGAAA GGTGAATCCC TCTGGAAAAC TCCCGATCAG CTTTCCAAGA
AGCGCTGGTC AGATTCCTGT CTTCCACTAC GTCAAACCCT CCGGAGGAAG GTCTCACTGG
CACGGAGACT ACGTAGACGA GAGCACAAAG CCTCTTTTCC CGTTTGGGCA CGGTTTGTCT
TACACGAAGT TCGAGTACAG CAACCTCCGG ATCGAGCCGA AGGAAGTGCC ACCGGCCGGT
GAGGTGGTGA TAAAGGTGGA CGTGGAAAAC ACTGGAGACA GAGACGGAGA CGAGGTGGTT
CAGCTTTACA TCGGTCGTGA GTTTGCGAGC GTCACAAGGC CTGTGAAAGA GCTGAAGGGC
TTCAAGAGGG TTTCTTTGAA GGCGAAAGAG AAGAAGACTG TTGTGTTCAG GCTTCACATG
GACGTGCTCG CCTACTACGA CAGAGACATG AAACTTGTAG TTGAACCCGG TGAGTTCAGG
GTGATGGTGG GAAGCTCTTC TGAAGACATA AGACTCACAG GTTCTTTCTC CGTCGTCGGT
GAAAAAAGAG AAGTGGTGGG AATGAGGAAA TTCTTCACGG AAGCCTGCGA GGAGTGA
 
Protein sequence
MEPYRDPSQP IEVRVRDLLS RMTLEEKVAQ LGSVWGYELI DERGKFSREK AKELLKNGIG 
QVTRPGGSTN LEPQEAAELV NEIQRFLVEE TRLGIPAMIH EECLTGYMGL GGTNFPQAIA
MASTWDPDLI EKMTTAIRED MRKIGAHQGL APVLDVARDP RWGRTEETFG ESPYLVARMG
VSYVKGLQGE DIKKGVVATV KHFAGYSASE GGKNWAPTNI PEREFKEVFL FPFEAAVKEA
NVLSVMNSYS EIDGVPCAAN RKLLTDILRK DWGFEGIVVS DYFAVKVLED YHRIARDKSE
AARLALEAGI DVELPKTECY QYLKDLVEKG IISEALIDEA VARVLRLKFM LGLFENPYVE
VEKAKIESHR DIALEIARKS IILLKNDGIL PLSKEKKVAL IGPNAGEVRN LLGDYMYLAH
IRALLDNIDD VFGNPQIPRE NYERLKKSIE EHMKSIPSVL DAFKEEGIEF EYAKGCEVTG
EDRSGFEEAI EIAKKSDVAI VVVGDKSGLT LDCTTGESRD MANLKLPGVQ EELVLEVAKT
GKPVVLVLIT GRPYSLKNVV DKVNAILQVW LPGEAGGRSI VDIIYGKVNP SGKLPISFPR
SAGQIPVFHY VKPSGGRSHW HGDYVDESTK PLFPFGHGLS YTKFEYSNLR IEPKEVPPAG
EVVIKVDVEN TGDRDGDEVV QLYIGREFAS VTRPVKELKG FKRVSLKAKE KKTVVFRLHM
DVLAYYDRDM KLVVEPGEFR VMVGSSSEDI RLTGSFSVVG EKREVVGMRK FFTEACEE