Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2655 |
Symbol | |
ID | 7978316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2687755 |
End bp | 2688840 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644799456 |
Product | Cellulase |
Protein accession | YP_002950615 |
Protein GI | 239827991 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000026647 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAAT TCGATGAAAC GTTGACGATG CTGAAGGATT TAACCGATGC GAGAGGCGTC CCTGGAAATG AACGGGAAGC GCGCGAAGTG ATGAAAAAGT ACATAGCTCC TTACGCCGAT GAAGTAACGA CAGACGGTCT TGGCAGCTTG ATTGCGAAAA AGAAAGGAAC AGACGAAGGT CCTAAAATTA TGATTGCCGG CCATTTGGAT GAAGTCGGCT TTATGGTGAC GCAAATCGAT GACAAAGGAT TTATCCGCTT CCAAACGCTA GGCGGCTGGT GGAGCCAAGT CATGCTAGCG CAGCGTGTCA CCATTTTAAC GCGTAAAGGA GAAATTACCG GCGTCATCGG TTCGAAACCG CCCCACATTT TGCCGCCGGA AGCGCGCAAA AAGCCAGTCG AAATCAAAGA TATGTTCATC GACATCGGCG CGACAAGCCG GGAAGAAGCA ATGGAATGGG GCGTGCGTCC GGGCGATTCG ATCGTTCCGT ATTTTGAATT TACCGTGTTG AACAATGAAA AAATGCTGCT TGCGAAAGCC TGGGACAACC GGATCGGCTG CGCGATTGCG ATTGAGGTAT TAAAGCAATT AAAAGATGTC GATCACCCGA ACGTTGTCTA TGGCGTCGGC ACGGTGCAGG AAGAAGTCGG TTTGCGCGGA GCGAGAACGG CGGCGCATTT CATCCAGCCG GATATCGCGT TTGCCGTGGA TGTCGGCATT GCCGGCGATA CGCCGGGAGT TTCCGAAAAA GAAGCGATGG GCAAGCTTGG CGCTGGGCCG CATATCGTCT TATACGACGC AACAATGGTG TCGCATCGCG GTTTGCGCGA ATTTGTCATC GATGTTGCCG AAGAACTCAA CATTCCGTAT CATTTTGATG CAATGCCAGG CGGCGGCACT GACGCCGGCG CGATTCACTT AACGGCAAGC GGCGTGCCGT CGCTGACGAT CGCGATTCCA ACGCGCTACA TCCATTCCCA TGCGGCGATT TTGCACCGTG ACGATTACGA AAATACGGTA AAATTGCTTG TCGAAGTCAT CAAGCGCTTA GACGCGGAAA AAGTAAAACA AATTACATTC GAATAA
|
Protein sequence | MAKFDETLTM LKDLTDARGV PGNEREAREV MKKYIAPYAD EVTTDGLGSL IAKKKGTDEG PKIMIAGHLD EVGFMVTQID DKGFIRFQTL GGWWSQVMLA QRVTILTRKG EITGVIGSKP PHILPPEARK KPVEIKDMFI DIGATSREEA MEWGVRPGDS IVPYFEFTVL NNEKMLLAKA WDNRIGCAIA IEVLKQLKDV DHPNVVYGVG TVQEEVGLRG ARTAAHFIQP DIAFAVDVGI AGDTPGVSEK EAMGKLGAGP HIVLYDATMV SHRGLREFVI DVAEELNIPY HFDAMPGGGT DAGAIHLTAS GVPSLTIAIP TRYIHSHAAI LHRDDYENTV KLLVEVIKRL DAEKVKQITF E
|
| |