Gene Cthe_0212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0212 
Symbol 
ID4808630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp258469 
End bp259884 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content40% 
IMG OID640105625 
ProductBeta-glucosidase 
Protein accessionYP_001036646 
Protein GI125972736 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.292852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCCTC TAGGTTATAA TTATATTATT ACACTGTTTG CAAATAATAT CTTAAAGGGT 
GTGGTAAACA TGTCAAAGAT AACTTTCCCA AAAGATTTCA TATGGGGTTC TGCAACAGCA
GCATATCAGA TTGAAGGTGC ATACAACGAA GACGGCAAAG GTGAATCTAT ATGGGACCGT
TTTTCCCACA CGCCAGGAAA TATAGCAGAC GGACATACCG GCGATGTTGC ATGCGACCAC
TATCATCGTT ATGAAGAAGA TATCAAAATA ATGAAAGAAA TCGGTATTAA ATCATACAGG
TTTTCCATCT CATGGCCCAG AATCTTTCCT GAAGGAACAG GTAAATTAAA TCAAAAGGGA
CTGGATTTTT ACAAAAGGCT CACAAATCTG CTTCTGGAAA ACGGAATTAT GCCTGCAATC
ACTCTTTATC ACTGGGACCT TCCCCAAAAG CTTCAGGATA AAGGCGGATG GAAAAACCGG
GACACCACCG ATTATTTTAC AGAATACTCT GAAGTAATAT TTAAAAATCT CGGAGATATC
GTTCCAATAT GGTTTACTCA CAATGAACCC GGTGTTGTTT CTTTGCTTGG CCACTTTTTA
GGAATTCATG CCCCTGGGAT AAAAGACCTC CGCACTTCAT TGGAAGTCTC GCACAATCTT
CTTTTGTCCC ACGGCAAGGC CGTGAAACTG TTTAGAGAAA TGAATATTGA CGCCCAAATT
GGAATAGCTC TCAATTTATC TTACCATTAT CCCGCATCCG AAAAAGCTGA GGATATTGAA
GCAGCGGAAT TGTCATTTTC TCTGGCGGGA AGGTGGTATC TGGATCCTGT GCTAAAAGGC
CGGTATCCTG AAAACGCATT GAAACTTTAT AAAAAGAAGG GTATTGAGCT TTCTTTCCCT
GAAGATGACC TGAAACTTAT CAGTCAGCCA ATAGACTTCA TAGCATTCAA CAATTATTCT
TCGGAATTTA TAAAATATGA TCCGTCCAGT GAGTCAGGTT TTTCACCTGC AAACTCCATA
TTAGAAAAGT TCGAAAAAAC AGATATGGGC TGGATCATAT ATCCTGAAGG CTTGTATGAT
CTGCTTATGC TCCTTGACAG GGATTATGGA AAGCCAAACA TTGTTATCAG CGAAAACGGA
GCCGCCTTCA AAGATGAAAT AGGTAGCAAC GGAAAGATAG AAGACACAAA GAGAATCCAA
TATCTTAAAG ATTATCTGAC CCAGGCTCAC AGGGCAATTC AGGACGGTGT AAACTTAAAA
GCATACTACT TGTGGTCGCT TTTGGACAAC TTTGAATGGG CTTACGGGTA CAACAAGAGA
TTCGGAATCG TTCACGTAAA TTTTGATACG TTGGAAAGAA AAATAAAGGA TAGCGGCTAC
TGGTACAAAG AAGTAATCAA AAACAACGGT TTTTAA
 
Protein sequence
MFPLGYNYII TLFANNILKG VVNMSKITFP KDFIWGSATA AYQIEGAYNE DGKGESIWDR 
FSHTPGNIAD GHTGDVACDH YHRYEEDIKI MKEIGIKSYR FSISWPRIFP EGTGKLNQKG
LDFYKRLTNL LLENGIMPAI TLYHWDLPQK LQDKGGWKNR DTTDYFTEYS EVIFKNLGDI
VPIWFTHNEP GVVSLLGHFL GIHAPGIKDL RTSLEVSHNL LLSHGKAVKL FREMNIDAQI
GIALNLSYHY PASEKAEDIE AAELSFSLAG RWYLDPVLKG RYPENALKLY KKKGIELSFP
EDDLKLISQP IDFIAFNNYS SEFIKYDPSS ESGFSPANSI LEKFEKTDMG WIIYPEGLYD
LLMLLDRDYG KPNIVISENG AAFKDEIGSN GKIEDTKRIQ YLKDYLTQAH RAIQDGVNLK
AYYLWSLLDN FEWAYGYNKR FGIVHVNFDT LERKIKDSGY WYKEVIKNNG F