Gene Tpet_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0848 
Symbol 
ID5171732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp862674 
End bp865010 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content50% 
IMG OID640563367 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001244443 
Protein GI148269983 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTGT ACAGGGATCC TTCGCAACCC ATTGAAGTGA GAGTGAGAGA TCTTCTTTCC 
AGAATGACGC TGGAAGAGAA AGCAGCCCAG CTTGGGTCTG TCTGGGGTTA CGAACTGATA
GACGAGAGAG GAAAGTTCAG TAGAGAAAAA GCAAAAGAAC TCCTCAAAAA TGGTATTGGT
CAGGTCACGA GGCCCGGTGG ATCGACGAAC CTTGAACCTC AAGAAGCCGC GGAACTTGTG
AACGAAATAC AGAGATTTCT CGTGGAAGAA ACACGCCTTG GAATTCCTGC GATGATACAC
GAGGAATGTC TCACCGGTTA CATGGGACTT GGAGGAACCA ACTTCCCTCA GGCAATAGCA
ATGGCGAGTA CATGGGATCC GGATCTCATA GAAAAAATGA CCACCGCCAT CAGAGAGGAT
ATGAGAAAGA TAGGAGCACA TCAGGGTCTC GCACCCGTTC TGGATGTCGC AAGAGACCCG
AGGTGGGGAA GAACGGAAGA AACGTTCGGA GAATCTCCCT ACCTGGTGGC GAGGATGGGA
GTCTCTTACG TAAAAGGCCT CCAAGGTGAG GATATCAAAA AAGGTGTCGT TGCCACAGTG
AAACACTTCG CCGGATACAG CGCTTCTGAA GGCGGAAAGA ACTGGGCCCC AACGAACATT
CCGGAGAGGG AATTCAAAGA GGTCTTTCTC TTTCCGTTCG AAGCGGCTGT TAAAGAAGCG
AATGTGCTTT CTGTGATGAA CTCCTACAGC GAGATAGACG GTGTCCCATG TGCAGCGAAC
AGGAAACTCC TCACAGACAT TCTCAGAAAA GACTGGGGGT TCAAAGGAAT CGTCGTTTCT
GACTATTTCG CTGTGAAAGT TCTGGAAGAT TATCACAGAA TAGCAAGGGA TAAGTCAGAA
GCCGCAAGAC TCGCGCTCGA AGCGGGGATA GATGTTGAAC TTCCGAAGAC AGAATGTTAT
CAATATTTGA AAGACCTTGT TGAAAAAGGC ATCATCTCCG AAGCTTTGAT CGACGAGGCA
GTCGCCAGGG TGCTGAGGCT GAAGTTCATG CTCGGACTCT TCGAGAATCC CTACGTTGAG
GTGGAAAAAG CGAAAATAGA AAGCCACAAA GACATCGCAC TCGATATAGC AAGGAAATCC
ATTATCCTTC TCAAGAATGA TGGAATTTTG CCTCTTCAGA AAAACAAAAA AGTTGCCCTG
ATCGGACCGA ACGCGGGTGA GGTGAGAAAT CTCCTCGGAG ATTACATGTA TCTTGCACAC
ATAAGGGCTC TCCTCGACAA CATAGACGAT GTCTTTGGAA ATCCTCAGAT CCCGAGAGAA
AACTACGAAA GACTGAAGAA GAGCATAGAA GAACATATGA AGAGTATTCC GAGTGTTCTC
GACGCCTTCA AAGAAGAAGG GATCGAATTC GAATACGCAA AGGGCTGTGA AGTGACAGGG
GAAGACAGAA GCGGCTTCGA AGAGGCGATA GAAATTGCAA AGAAATCCGA CGTTGCCATC
GTTGTCGTAG GAGACAAATC TGGACTCACC CTTGATTGCA CAACCGGTGA GTCCAGAGAC
ATGGCAAACC TCAAGCTTCC AGGAGTCCAG GAAGAACTCG TCCTCGAAGT CGCAAAGACA
GGAAAACCCG TCGTTCTTGT CCTCATCACG GGAAGACCCT ACTCACTCAA AAACGTCGTC
GACAAGGTGA ACGCCATCCT TCAGGTGTGG CTTCCGGGGG AGGCGGGAGG AAGAGCGATC
GTTGACATCA TCTACGGAAA GGTGAATCCC TCTGGAAAAC TCCCGATCAG TTTCCCAAGA
AGCGCTGGTC AGATTCCTGT CTTCCACTAC GTCAAACCCT CCGGAGGAAG GTCTCACTGG
CACGGAGACT ACGTAGACGA GAGCACAAAG CCTCTTTTCC CGTTTGGGCA CGGTTTGTCT
TACACGAAGT TCGAGTACAG CAACCTCCGG ATCGAGCCGA AGGAAGTGCC ACCGGCCGGT
GAGGTGGTGA TAAAGGTGGA CGTGGAAAAC ACTGGAGACA GAGACGGAGA CGAGGTGGTT
CAGCTTTACA TCGGTCGTGA GTTTGCGAGC GTCACAAGGC CTGTGAAAGA GCTGAAGGGC
TTCAAGAGGG TTTCTTTGAA GGCGAAAGAG AAGAAGACTG TCGTGTTCAG GCTTCACATG
GACGTGCTCG CCTACTACGA CAGAGACATG AAACTCGTGG TCGAACCGGG TGAATTCAAA
GTGATGGTAG GAAGCTCTTC TGAAGACATC AGACTCACTG GTTCCTTCAC TGTGGTCGGT
GAAAAAAGAG AAGTGGTGGG AATGAGGAAA TTCTTCACGG AAGCCTGCGA GGAGTGA
 
Protein sequence
MELYRDPSQP IEVRVRDLLS RMTLEEKAAQ LGSVWGYELI DERGKFSREK AKELLKNGIG 
QVTRPGGSTN LEPQEAAELV NEIQRFLVEE TRLGIPAMIH EECLTGYMGL GGTNFPQAIA
MASTWDPDLI EKMTTAIRED MRKIGAHQGL APVLDVARDP RWGRTEETFG ESPYLVARMG
VSYVKGLQGE DIKKGVVATV KHFAGYSASE GGKNWAPTNI PEREFKEVFL FPFEAAVKEA
NVLSVMNSYS EIDGVPCAAN RKLLTDILRK DWGFKGIVVS DYFAVKVLED YHRIARDKSE
AARLALEAGI DVELPKTECY QYLKDLVEKG IISEALIDEA VARVLRLKFM LGLFENPYVE
VEKAKIESHK DIALDIARKS IILLKNDGIL PLQKNKKVAL IGPNAGEVRN LLGDYMYLAH
IRALLDNIDD VFGNPQIPRE NYERLKKSIE EHMKSIPSVL DAFKEEGIEF EYAKGCEVTG
EDRSGFEEAI EIAKKSDVAI VVVGDKSGLT LDCTTGESRD MANLKLPGVQ EELVLEVAKT
GKPVVLVLIT GRPYSLKNVV DKVNAILQVW LPGEAGGRAI VDIIYGKVNP SGKLPISFPR
SAGQIPVFHY VKPSGGRSHW HGDYVDESTK PLFPFGHGLS YTKFEYSNLR IEPKEVPPAG
EVVIKVDVEN TGDRDGDEVV QLYIGREFAS VTRPVKELKG FKRVSLKAKE KKTVVFRLHM
DVLAYYDRDM KLVVEPGEFK VMVGSSSEDI RLTGSFTVVG EKREVVGMRK FFTEACEE