Gene Tpet_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0639 
Symbol 
ID5171357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp643236 
End bp644663 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content47% 
IMG OID640563146 
Productglycoside hydrolase family protein 
Protein accessionYP_001244235 
Protein GI148269775 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000370471 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCC TTTTGATGGC TACTTTGTTT CTTATCTTGC CGAGTGGTTG TCTTGTTTTA 
GGAATTGAGG ATAATATTCC AAGTTTCAGA TGGGCTACAG TCCACGATCC ATCTGTTATA
AAAGCTGGCG ATACTTTCTA CGTTTTTGGC TCACACCTTC AAATTGCCAA ATCGAACGAT
CTGATGCACT GGACACAGGT GAACGTAGGG GTTTATAACA ACAACCCCAT AATCCCAAAT
ATATTCACCG AGCTGAAAGA AACTTTCGAA TGGGCTGAAA CAAACACTCT TTGGGCACCC
CATGTGATTC AGCTTCAAGA TGGTAGGTAC TACTTCTACT ACTGTGCGTG CAGAGGAGAT
TCGCCACGAT CAGCAATGGG GATCGCAGTC TCCGATAACA TCGAGGGCCC TTACAGAAAC
CTCGGGATAA TTCTGCGATC TGGGTATCGC CCCGGAGAAG GAATGTGTGA AGAAGGAGTA
CCATACGATG CGAGAATCCA TCCAAACGTT GTGGATCCAC ATGTTTTTTA CGACAAAGAA
GGTAATCTTT GGATGGTTTA CGGGTCCTAC TCCGGTGGCA TTTACATACT AAAGCTCGAC
CCAGAGACGG GTTTTCCTCT CCCAGGACAG GGGTACGGAA AGAAACTCAC AGGAGGAAAT
CACAGCAGGA TCGAAGGTCC CTTTATCCTC TACAGTCCTG ATACAGATTA TTACTATCTC
TTTCTGAGCT TTGGAGGGCT CGACTACAGG GGAGGATACA ACATCAGGGT TGCAAGGTCC
AAAAACCCCG ACGGTCCTTA TTATGACGCA GAAGGTCGAA ACATGATAGA TTGTTACGGC
CCGTCGTTCC TGGAAGGCAA CGATCCTTAC ATAGCACCTT TCGGTGTGAA ACTGGTGGGT
AACTTCACCC TGAGCGAAGG AAACACCATA GACTTCCGAG TGTTCGGATA CGTATCTCCG
GGGCACAACT CTGCCTATTA CGATCCAAAA ACTGGGAAGT ACTTCATCTT CTTCCACACG
AGGTTCCCCG GCAGAGGAGA GACGTACCAG CTCAGGGTCC ATCAGCTCTT CCTCAACGAA
GACGGATGGT TCGTCATGGC TCCTTTCCCA TATGCCGGTG AGACCATTGA AGATCTATCT
TTTCAAGAGA TAGCAGGGGA ATATCAACTA TTAATACATG ATAAGGAAAT GACGAACGAG
ATAAGGAAAC CCGTGAGAAT CGCTCTGAAT CCGGACGGAA CTGTCACTGG AGCTCAGACT
GGTGAATGGG AGAAGAAGGG ACATTATATA ACTCTGAAAC TCGAAGGAGA GATCTACAAA
GGAGTGACCT TGAAACAGTG GCACTATTCC GAGAAAAAGT GGGTGACAGT GTTTTCCGCT
CTATCACAGA AGGGAGTTTC AGTGTGGGGT ATAAAAACTT CTGAGTAG
 
Protein sequence
MKILLMATLF LILPSGCLVL GIEDNIPSFR WATVHDPSVI KAGDTFYVFG SHLQIAKSND 
LMHWTQVNVG VYNNNPIIPN IFTELKETFE WAETNTLWAP HVIQLQDGRY YFYYCACRGD
SPRSAMGIAV SDNIEGPYRN LGIILRSGYR PGEGMCEEGV PYDARIHPNV VDPHVFYDKE
GNLWMVYGSY SGGIYILKLD PETGFPLPGQ GYGKKLTGGN HSRIEGPFIL YSPDTDYYYL
FLSFGGLDYR GGYNIRVARS KNPDGPYYDA EGRNMIDCYG PSFLEGNDPY IAPFGVKLVG
NFTLSEGNTI DFRVFGYVSP GHNSAYYDPK TGKYFIFFHT RFPGRGETYQ LRVHQLFLNE
DGWFVMAPFP YAGETIEDLS FQEIAGEYQL LIHDKEMTNE IRKPVRIALN PDGTVTGAQT
GEWEKKGHYI TLKLEGEIYK GVTLKQWHYS EKKWVTVFSA LSQKGVSVWG IKTSE