Gene GWCH70_2655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2655 
Symbol 
ID7978316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2687755 
End bp2688840 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content51% 
IMG OID644799456 
ProductCellulase 
Protein accessionYP_002950615 
Protein GI239827991 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000026647 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAT TCGATGAAAC GTTGACGATG CTGAAGGATT TAACCGATGC GAGAGGCGTC 
CCTGGAAATG AACGGGAAGC GCGCGAAGTG ATGAAAAAGT ACATAGCTCC TTACGCCGAT
GAAGTAACGA CAGACGGTCT TGGCAGCTTG ATTGCGAAAA AGAAAGGAAC AGACGAAGGT
CCTAAAATTA TGATTGCCGG CCATTTGGAT GAAGTCGGCT TTATGGTGAC GCAAATCGAT
GACAAAGGAT TTATCCGCTT CCAAACGCTA GGCGGCTGGT GGAGCCAAGT CATGCTAGCG
CAGCGTGTCA CCATTTTAAC GCGTAAAGGA GAAATTACCG GCGTCATCGG TTCGAAACCG
CCCCACATTT TGCCGCCGGA AGCGCGCAAA AAGCCAGTCG AAATCAAAGA TATGTTCATC
GACATCGGCG CGACAAGCCG GGAAGAAGCA ATGGAATGGG GCGTGCGTCC GGGCGATTCG
ATCGTTCCGT ATTTTGAATT TACCGTGTTG AACAATGAAA AAATGCTGCT TGCGAAAGCC
TGGGACAACC GGATCGGCTG CGCGATTGCG ATTGAGGTAT TAAAGCAATT AAAAGATGTC
GATCACCCGA ACGTTGTCTA TGGCGTCGGC ACGGTGCAGG AAGAAGTCGG TTTGCGCGGA
GCGAGAACGG CGGCGCATTT CATCCAGCCG GATATCGCGT TTGCCGTGGA TGTCGGCATT
GCCGGCGATA CGCCGGGAGT TTCCGAAAAA GAAGCGATGG GCAAGCTTGG CGCTGGGCCG
CATATCGTCT TATACGACGC AACAATGGTG TCGCATCGCG GTTTGCGCGA ATTTGTCATC
GATGTTGCCG AAGAACTCAA CATTCCGTAT CATTTTGATG CAATGCCAGG CGGCGGCACT
GACGCCGGCG CGATTCACTT AACGGCAAGC GGCGTGCCGT CGCTGACGAT CGCGATTCCA
ACGCGCTACA TCCATTCCCA TGCGGCGATT TTGCACCGTG ACGATTACGA AAATACGGTA
AAATTGCTTG TCGAAGTCAT CAAGCGCTTA GACGCGGAAA AAGTAAAACA AATTACATTC
GAATAA
 
Protein sequence
MAKFDETLTM LKDLTDARGV PGNEREAREV MKKYIAPYAD EVTTDGLGSL IAKKKGTDEG 
PKIMIAGHLD EVGFMVTQID DKGFIRFQTL GGWWSQVMLA QRVTILTRKG EITGVIGSKP
PHILPPEARK KPVEIKDMFI DIGATSREEA MEWGVRPGDS IVPYFEFTVL NNEKMLLAKA
WDNRIGCAIA IEVLKQLKDV DHPNVVYGVG TVQEEVGLRG ARTAAHFIQP DIAFAVDVGI
AGDTPGVSEK EAMGKLGAGP HIVLYDATMV SHRGLREFVI DVAEELNIPY HFDAMPGGGT
DAGAIHLTAS GVPSLTIAIP TRYIHSHAAI LHRDDYENTV KLLVEVIKRL DAEKVKQITF
E