Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1365 |
Symbol | |
ID | 7978165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1436998 |
End bp | 1439094 |
Gene Length | 2097 bp |
Protein Length | 698 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798296 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_002949469 |
Protein GI | 239826845 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGTCAT TTTACAAAAA AATAACGGCA GCAGTGATAG CGGTGATGAT GCTTTTTGTC CCGTGGGCTT CGCCATCAGC GAACGCGGAG GAAACAACGG CCGATTTAAT CATTTACCAA AACATTCCTG AAGTAGACGT GAAGATAAAC GGAAATCAGC TTTCATTGCG GGCCCTTCAT GTTCATCAGG AAGGATATTT CACCGAAGCA ACAGAAGGGA TAAGATGGAA ATCATCCAAT AAAACGGTAG CTGCCGTTGA TGATCATGGT GTTGTTACTT TCTTCGGAAA AAACGGAAGA ACGTTTATTA CGGTGACAGA CGGAAAACGG AAAGACCGCA TCGCTTTTAA GGTCAAAACC GCTCCTGCAT CGCAAAAAAA CAGCAAACAG AACATGAAAG TGACCGTTGT CAAAGAAAAG GGAAGCCGCT ATAACATCAT TCAAAAAGCG ATAGATGGCT TAACGTTGGA AGAAAAAATC GGCCAAATGC TGATGCCGGA TTTTCGCAAT TTCAATGGGA AGCCTGTCAC TGATATGCTT CCGGAAATTA AGGAACTCAT ACAGAAGTAT CATCTTGGCG GCATCATTTT ATTCCGTGAA AATGTGGTGA CGACGGAGCA GACGGCAAAG CTCGTCGCGG ACTATCAGCA GGCAGCTGAA AAGTTCGGTT TGCTTATATC GATAGACCAG GAAGGCGGCA TTGTGACGCG GCTGCAGTCA GGGACAAATA TGCCGGGAAA TATGGCATTA GGGGCGACAC GATCGAAAGA GCTTGCCTGG AAAGTAGGGC ATGCGATTGG AGAAGAATTA CATGCGCTGG GAATTAACAT GAACTTGGCA CCTGTATTGG ACGTAAACAA TAATCCGGAT AATCCGGTGA TCGGCGTTCG TTCGTTTGGA GAAAATCCAG AGCTTGTCGC GGAACTAGGG GTATCTTATA TCAAAGGATT GCAAGACAGC GGAGTTGCCG CTACAGCCAA ACATTTCCCA GGACATGGAG ATACCGCTGT CGATTCCCAT CTTGGGTTGC CGGAAGTATC GCACGATCGC GACCGGCTTC AAAAAGTCGA ACTGTATCCG TTTCAAAAAG CAATGGAAGC GGGAATTGAC GCGGTTATGA CCGCTCATGT CACATTCCCG AAAGTCGATG ATACAAAGGT GATTTCGCAA AAAGACGGCA CGGCAATTTC GCTTCCTGCT ACTTTGTCGT ATAAAGTGCT AACCGAGTTA ATGCGTGAAG AAATGGGCTT TGACGGTGTC ATTATTACCG ATGCGATGAA TATGAAAGCG ATTAGCGACC ACTTTGGACC GGTGGATGCG GCGGTAAGAG CGGTTCAAGC TGGTGCTGAT ATTGTACTCA TGCCGGTGGG GCTGGAAGAA GTGGCAAACG GATTAAAGAA AGCCGTGCAA AACGGTGGCA TTTCGCAAGA ACGCATCAAC GCCTCTGTAA AGCGAATTTT AACGTTAAAA GTGAAACGCG GCATTTTTAA ACAAGAAACA CCGCCTGATA TGCAGCAAAT CATCGACCAA GCATTGCGTG TTGTCGGTTC GCAAGAGCAT AAACAAGTGG AAGCCGAGGC AGCGGCTAAA TCGATTACAC TAGTGAAAAA CGACCATGTA TTGCCGCTTC AAACAAGCAA AGTGAACAAT CTCGTCGTTG TAGGAAATAC GATGATTGCA TCATTAGAGA AAGCAGTGCG AACCTATGTT CCAAACGCGG CTGTGATCCA AGCGTCTGCT CCTTTAAATG AGGCGCAGCT GCAAAAAGTG AAAGAAGCAG ATGCGGTCAT TGTCGGCACG TATACATTGA ATGTATCAGG AAGGCTGCCT TCTAGCCCGC AAATGAAAAT GGTGCAGCAA ATTTTCGCAA ATACGGATGC TCCTGTCATC GCAATCGGGA TTCGCAATCC GTATGATGTG ATGGCCTATC CGCATGTGGA TGCCTATCTT GCGCAGTATG GTTTTCAGCA GGCAAGTTTC CAAGCGGCAG TGGATACGAT TTTCGGAGTA AATACACCGA CTGGCAAACT TCCAGTAACC ATTCCGAGTT ACGATGGCGG CGTGCTGTAC GAATACGGCC ATGGTCTCAC TTACTAA
|
Protein sequence | MLSFYKKITA AVIAVMMLFV PWASPSANAE ETTADLIIYQ NIPEVDVKIN GNQLSLRALH VHQEGYFTEA TEGIRWKSSN KTVAAVDDHG VVTFFGKNGR TFITVTDGKR KDRIAFKVKT APASQKNSKQ NMKVTVVKEK GSRYNIIQKA IDGLTLEEKI GQMLMPDFRN FNGKPVTDML PEIKELIQKY HLGGIILFRE NVVTTEQTAK LVADYQQAAE KFGLLISIDQ EGGIVTRLQS GTNMPGNMAL GATRSKELAW KVGHAIGEEL HALGINMNLA PVLDVNNNPD NPVIGVRSFG ENPELVAELG VSYIKGLQDS GVAATAKHFP GHGDTAVDSH LGLPEVSHDR DRLQKVELYP FQKAMEAGID AVMTAHVTFP KVDDTKVISQ KDGTAISLPA TLSYKVLTEL MREEMGFDGV IITDAMNMKA ISDHFGPVDA AVRAVQAGAD IVLMPVGLEE VANGLKKAVQ NGGISQERIN ASVKRILTLK VKRGIFKQET PPDMQQIIDQ ALRVVGSQEH KQVEAEAAAK SITLVKNDHV LPLQTSKVNN LVVVGNTMIA SLEKAVRTYV PNAAVIQASA PLNEAQLQKV KEADAVIVGT YTLNVSGRLP SSPQMKMVQQ IFANTDAPVI AIGIRNPYDV MAYPHVDAYL AQYGFQQASF QAAVDTIFGV NTPTGKLPVT IPSYDGGVLY EYGHGLTY
|
| |