Gene BCG9842_B0558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B0558 
Symbol 
ID7183374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp4502354 
End bp4503439 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content41% 
IMG OID643552468 
Productpeptidase, M42 family 
Protein accessionYP_002448135 
Protein GI218899724 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000121832 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.5021500000000004e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAAAAT TAGACGCGAC ATTGACAATG CTAAAAGAAT TAACAGATGC ACGTGGTATT 
GCCGGTAACG AGCGTGAACC ACGCGAAGTA ATGAAGAAAT ATATCGAGCC ATTTGCAGAC
GAGCTTTCTA CTGATAATTT AGGAAGTTTA GTTGCGAAAA AAGTAGGGGA AGAAAACGGC
CCGAAAATTA TGGTTGCAGG TCATTTAGAT GAAGTTGGCT TTATGATTAC GCAAATTGAT
GACAAAGGTT TCCTACGCTT CCAAACGGTG GGTGGCTGGT GGTCACAAGT TATGCTTGCA
CAGCGCGTGA CGATTGTAAC GCGTAAAGGA GATGTAACAG GTGTAATTGG TTCAAAACCA
CCGCACATCT TACCTCCAGA AGCACGTAAA AAGCCAGTTG AAATTAAAGA CATGTTCATC
GATATCGGTG CTTCTAGCCA AGAAGAAGCA ATGGAGTGGG GCGTACGACC AGGAGATCAA
GTTGTACCTT ACTTTGAATT CCAAGTGATG AAGAATGAAA AAATGTTACT TGCAAAAGCA
TGGGATAACC GAATCGGTTG TGCGATTGCA ATTGACGTAT TAAAACAATT AAAAGATGAA
AAGCATCCAA ACGTTGTATA CGGCGTTGGA ACTGTACAAG AAGAAGTTGG TCTTCGTGGT
GCAAAAACAT CTGCGAATTA TATTAAACCA GATATCGCGT TCGCAGTAGA TGTTGGTATC
GCTGGAGATA CACCGGGTGT AACGTCAAAA GAAGCGCAAA GTAAAATGGG CGATGGACCA
CAAATCATTT TGTATGATGC TTCTGTTATT GGACATACAG GTTTACGTGA TTTCGTAGTT
GATGTTGCTG ACGAATTACA AATCCCATAC CAATATGATT CAGTAGCGGG CGGGGGAACT
GATGCGGGTG CGATTCATAT TGCTGTAAAC GGTATTCCTT CTATGGCAAT TACAATTGCA
ACGCGCTATA TTCATTCTCA TGCAGCAATG TTACACCGTG ATGATTATGA AAATGCAGTA
AAGTTAATTG TAGAAGTTAT TAAACGTCTT GATAAGGATG CTGTACATAA CATTACATTT
AATTAA
 
Protein sequence
MTKLDATLTM LKELTDARGI AGNEREPREV MKKYIEPFAD ELSTDNLGSL VAKKVGEENG 
PKIMVAGHLD EVGFMITQID DKGFLRFQTV GGWWSQVMLA QRVTIVTRKG DVTGVIGSKP
PHILPPEARK KPVEIKDMFI DIGASSQEEA MEWGVRPGDQ VVPYFEFQVM KNEKMLLAKA
WDNRIGCAIA IDVLKQLKDE KHPNVVYGVG TVQEEVGLRG AKTSANYIKP DIAFAVDVGI
AGDTPGVTSK EAQSKMGDGP QIILYDASVI GHTGLRDFVV DVADELQIPY QYDSVAGGGT
DAGAIHIAVN GIPSMAITIA TRYIHSHAAM LHRDDYENAV KLIVEVIKRL DKDAVHNITF
N