Gene GWCH70_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2030 
Symbol 
ID7978983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2088120 
End bp2089439 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content50% 
IMG OID644798852 
Productpeptidase M42 family protein 
Protein accessionYP_002950022 
Protein GI239827398 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAT GGACAAAACT CATGATTCGG CACGGATTTC ATCCGGAAGA TGAGGAAACA 
GGAATTACAT CACAATGGCT TGCGCAAACA TTCGCAGCGC TTATCCAACG GCACCAGCAC
CTTGACACCA TTTCCGAAAC GGACTGGATG GAAGCGCTCC AACAAACCGC CAAGCAAATT
GTTTTTTACA ACGATGACAT TCCAGGAAGA GAAACGCTGG TCGACCCGAC GAAAAACGAA
CTTCCTCTCA TTCAAATCGA TCCGTACGTC CGCGGCATCG TCCGCTGGCT GAACATGATG
CATATTTACA CCGTTTACAG CTGCGACGGA GAGGGAGTTC GCCCGGCAAC GATTTATTTT
CTCGAAGACT TATCCGCCCA GCAGCTCGCC ATCATCCGGG CTTGCACTCC GCCACACGTT
CGAATTAGAG CGAAAAAAAG AAAAGTAACA TTATTTTATC AGCGCGGACA CATCGATGAC
CTTTTAACGA TGGCCGAACG GTTATACAAC GTCTGGCGAA ATCCAGAGCT GCTAACGACG
TACCGTTTAG AAACATTCAA ACACCGCCTA TATTCCCTTC TTTCCATCAA CGGAAGAAGC
GGCAGGGAAA CGATGATTCG GCAAATGCTC TATCGAAAGC TCCAACAAAA AACCGATTGG
TGCCAAATCG ATGCTTACGG AAACTTGCTT GCCGCGGTTT ATTGCGGAAA CGGCCCGACG
ATTCTGCTTT CCGCCCATAT GGATACGGTT CGCCCGTTTT CACCGAAACG TACGATTATC
GAAAGCGGAA CCGTACTAAG CAGCTCGCGC GGCATCTTAG GCGCCGACGA CCGCGCGGGA
ATCGCGGTCA TCTTAGAAAT ACTTGATTTC ATTCGCCATT CCCGCTTCCA AGGAACGCTG
AAAATCGCCT TTACCGTCGA GGAAGAAATC GGCTGCCTCG GCTCGCGTAA CATCGACCCA
ACATTTTTGC AAGACGTCGA CGCCGCGATT GTTGTAGACC GCCGCGGAAC GCGCGATATC
GTCACTTCTT ACGCCGGCAT CGTGCCGTTT TGCACCGATG AATACGGCCG CATTTTCGAA
ACAGCCGGAG CGCTCGCCGG CATGCCCGAC TGGAAAATGA CCCATGGCGG ACTAAGCGAC
GCCAAAGTCT TCGCCGAATT CGGCATTCCA TCTGTCAACT TATCCGTCGG CTACGAGCAC
GAACATACCG AATTCGAAAC GCTCGACTAC AAAGCAACTC TTGAAACGGT GATGTTACTT
GAAACGGCAT TTGAAAACAA TATGATTACA GAAGAACTAG TCGTCACGTA TAAGTGTTAG
 
Protein sequence
MEKWTKLMIR HGFHPEDEET GITSQWLAQT FAALIQRHQH LDTISETDWM EALQQTAKQI 
VFYNDDIPGR ETLVDPTKNE LPLIQIDPYV RGIVRWLNMM HIYTVYSCDG EGVRPATIYF
LEDLSAQQLA IIRACTPPHV RIRAKKRKVT LFYQRGHIDD LLTMAERLYN VWRNPELLTT
YRLETFKHRL YSLLSINGRS GRETMIRQML YRKLQQKTDW CQIDAYGNLL AAVYCGNGPT
ILLSAHMDTV RPFSPKRTII ESGTVLSSSR GILGADDRAG IAVILEILDF IRHSRFQGTL
KIAFTVEEEI GCLGSRNIDP TFLQDVDAAI VVDRRGTRDI VTSYAGIVPF CTDEYGRIFE
TAGALAGMPD WKMTHGGLSD AKVFAEFGIP SVNLSVGYEH EHTEFETLDY KATLETVMLL
ETAFENNMIT EELVVTYKC