Gene Tpet_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0119 
Symbol 
ID5171269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp113168 
End bp114571 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content46% 
IMG OID640562620 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001243724 
Protein GI148269264 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGTGG ATCTTGGAAA GCTGTTCTTC TGCGGTTTCA ACGACTTCAA CGAAGAAGTC 
AAGGAAATAA TCAGAAAATA CAGACCGGCC GGTATTTTGA TCTATCCAGG AGTTCTTTCG
AAAGAGTACC TTCTAATGGA TTTCATGAGT TTTCTATCGA AGGAAGGAGA TTTTCTCGTC
AGTTCCGATC ACGAGGGTGG CCAGCTTGAG GTGTTGAAAT ACGTTCCCTC GTCTCCGGGA
AATCTTGCCT TTGGGAAAAA CTCACCAGAT GTGACTTACA GATATTCCAA GATCGCAGGA
AAGATCATGG AGATTGTGGG GTTCAACATG GTTTTTGCTC CTGTTCTGGA TCTTCTTTCT
GAGGAAAGCT CTTCGGTGAT CGACATGAGA AGCTACGGCT CAGATCCCAA AATCGTAGCC
GAGCACGGGG CAAAAGCCTG TGAAGGTTAT CTGGAAGGTG GAGTTATGCC CTGCATCAAG
CACTTTCCAG GTCACGGAAG AGCAAGAGAA GACTCTCACC TCACCCTTCC TGTGGTCGAT
GCACCCTTTG AAAAACTCTG GGAAGAGGAT CTTCTGCCGT TCAGAAAGGT GCTGGAAAGG
GAGAAAAAGG TCACGGTCAT GACGGCCCAC GTCAGATACT CTTCGATAGA CAGTCTCCCG
GCTACTCTTT CGGAGAAGAT CATAACGGAC GTTCTCAGAA AAAAGATCGG TTTCGACGGT
CTTGTGATCA GTGACGCTAT GGAAATGAGC GCTGTGTCGA ACAATTTCTC TGTTGAAGAG
ATTGTGAGTC TCTTTCTGAA CGCGGGAGGA AACATGATCC TTCTCGGTGA TTACAGAAAT
CTTCCGGTTT ACTATGAAAC GCTGGTGAAA CTCCTCGAGG ATGGAAAGGT CCAGAAGGAC
AAAGTGGAGC GCTCCATAAG GATGGTGGAA AAATATCTTG CTTTTGCGAA GAAAAACAGC
GGTGTTGGTT TCCTCGCCGA TTCTTCGGCA AAGGCTGTGG AATTCCTCGG TTTTGAAAAG
ATAGATCATA CCAGTGAAGT GACTCTTCTC GTTCCTTCCA GTGAGAATCT GAGTCAGGCA
GACACCACGG GGGGCGATTA CGATCAGATT CCAGAGATCG TTTCCAGATT TTTCGAAGTC
GAGAATGTTG TTCGATACAC CGTAGAAGAC GGTCCCGAGT TCGTTGAAGG TGATTTGATC
TTCGATTTTG TAGCCGACAT ACCGAACGAA AAGGCTTTGA AAGCCCATCT GAGCCTTCCG
GCAGAAAAGA CCGTTTACTT CGTTCTGAGA AATCCGTTCG ATGTCAGGTA TTTCGAGGGA
AGAAAGATAG TCGTCACAAG ATCGACGAAA CCCATTTCTA TCTATAAATC CTTAGAACAT
TTTTTAGGGA GGTGTGATTC ATGA
 
Protein sequence
MDVDLGKLFF CGFNDFNEEV KEIIRKYRPA GILIYPGVLS KEYLLMDFMS FLSKEGDFLV 
SSDHEGGQLE VLKYVPSSPG NLAFGKNSPD VTYRYSKIAG KIMEIVGFNM VFAPVLDLLS
EESSSVIDMR SYGSDPKIVA EHGAKACEGY LEGGVMPCIK HFPGHGRARE DSHLTLPVVD
APFEKLWEED LLPFRKVLER EKKVTVMTAH VRYSSIDSLP ATLSEKIITD VLRKKIGFDG
LVISDAMEMS AVSNNFSVEE IVSLFLNAGG NMILLGDYRN LPVYYETLVK LLEDGKVQKD
KVERSIRMVE KYLAFAKKNS GVGFLADSSA KAVEFLGFEK IDHTSEVTLL VPSSENLSQA
DTTGGDYDQI PEIVSRFFEV ENVVRYTVED GPEFVEGDLI FDFVADIPNE KALKAHLSLP
AEKTVYFVLR NPFDVRYFEG RKIVVTRSTK PISIYKSLEH FLGRCDS