Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0212 |
Symbol | |
ID | 4808630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 258469 |
End bp | 259884 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105625 |
Product | Beta-glucosidase |
Protein accession | YP_001036646 |
Protein GI | 125972736 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.292852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCCTC TAGGTTATAA TTATATTATT ACACTGTTTG CAAATAATAT CTTAAAGGGT GTGGTAAACA TGTCAAAGAT AACTTTCCCA AAAGATTTCA TATGGGGTTC TGCAACAGCA GCATATCAGA TTGAAGGTGC ATACAACGAA GACGGCAAAG GTGAATCTAT ATGGGACCGT TTTTCCCACA CGCCAGGAAA TATAGCAGAC GGACATACCG GCGATGTTGC ATGCGACCAC TATCATCGTT ATGAAGAAGA TATCAAAATA ATGAAAGAAA TCGGTATTAA ATCATACAGG TTTTCCATCT CATGGCCCAG AATCTTTCCT GAAGGAACAG GTAAATTAAA TCAAAAGGGA CTGGATTTTT ACAAAAGGCT CACAAATCTG CTTCTGGAAA ACGGAATTAT GCCTGCAATC ACTCTTTATC ACTGGGACCT TCCCCAAAAG CTTCAGGATA AAGGCGGATG GAAAAACCGG GACACCACCG ATTATTTTAC AGAATACTCT GAAGTAATAT TTAAAAATCT CGGAGATATC GTTCCAATAT GGTTTACTCA CAATGAACCC GGTGTTGTTT CTTTGCTTGG CCACTTTTTA GGAATTCATG CCCCTGGGAT AAAAGACCTC CGCACTTCAT TGGAAGTCTC GCACAATCTT CTTTTGTCCC ACGGCAAGGC CGTGAAACTG TTTAGAGAAA TGAATATTGA CGCCCAAATT GGAATAGCTC TCAATTTATC TTACCATTAT CCCGCATCCG AAAAAGCTGA GGATATTGAA GCAGCGGAAT TGTCATTTTC TCTGGCGGGA AGGTGGTATC TGGATCCTGT GCTAAAAGGC CGGTATCCTG AAAACGCATT GAAACTTTAT AAAAAGAAGG GTATTGAGCT TTCTTTCCCT GAAGATGACC TGAAACTTAT CAGTCAGCCA ATAGACTTCA TAGCATTCAA CAATTATTCT TCGGAATTTA TAAAATATGA TCCGTCCAGT GAGTCAGGTT TTTCACCTGC AAACTCCATA TTAGAAAAGT TCGAAAAAAC AGATATGGGC TGGATCATAT ATCCTGAAGG CTTGTATGAT CTGCTTATGC TCCTTGACAG GGATTATGGA AAGCCAAACA TTGTTATCAG CGAAAACGGA GCCGCCTTCA AAGATGAAAT AGGTAGCAAC GGAAAGATAG AAGACACAAA GAGAATCCAA TATCTTAAAG ATTATCTGAC CCAGGCTCAC AGGGCAATTC AGGACGGTGT AAACTTAAAA GCATACTACT TGTGGTCGCT TTTGGACAAC TTTGAATGGG CTTACGGGTA CAACAAGAGA TTCGGAATCG TTCACGTAAA TTTTGATACG TTGGAAAGAA AAATAAAGGA TAGCGGCTAC TGGTACAAAG AAGTAATCAA AAACAACGGT TTTTAA
|
Protein sequence | MFPLGYNYII TLFANNILKG VVNMSKITFP KDFIWGSATA AYQIEGAYNE DGKGESIWDR FSHTPGNIAD GHTGDVACDH YHRYEEDIKI MKEIGIKSYR FSISWPRIFP EGTGKLNQKG LDFYKRLTNL LLENGIMPAI TLYHWDLPQK LQDKGGWKNR DTTDYFTEYS EVIFKNLGDI VPIWFTHNEP GVVSLLGHFL GIHAPGIKDL RTSLEVSHNL LLSHGKAVKL FREMNIDAQI GIALNLSYHY PASEKAEDIE AAELSFSLAG RWYLDPVLKG RYPENALKLY KKKGIELSFP EDDLKLISQP IDFIAFNNYS SEFIKYDPSS ESGFSPANSI LEKFEKTDMG WIIYPEGLYD LLMLLDRDYG KPNIVISENG AAFKDEIGSN GKIEDTKRIQ YLKDYLTQAH RAIQDGVNLK AYYLWSLLDN FEWAYGYNKR FGIVHVNFDT LERKIKDSGY WYKEVIKNNG F
|
| |