Gene BCG9842_B3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B3004 
SymbolmmsA1 
ID7182116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp2178664 
End bp2180124 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content38% 
IMG OID643550047 
Productmethylmalonic acid semialdehyde dehydrogenase 
Protein accessionYP_002445717 
Protein GI218897306 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.18708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACAA CTGAAATTAA ACGAGTAAAA AATCACATTA ACGGTGAGTG GGTTGAATCG 
ACAGGTACGG AAGTAGAAGC AGTTCCAAAT CCAGCGACTG GAAAAATTAT CGCTTACGTT
CCACTATCTC CAAAAGAAGA TGTTGAAAAA GCTGTTGAAG CGGCAAAAGC AGCATTTGAA
ACGTGGTCTA AAGTACCAGT TCCGAATCGT TCAAGAAATT TATATAAATA TTTACAGCTA
TTACAAGAAA ACAAAGATGA GCTTGCGAAA ATCATTACGC TAGAGAATGG TAAGACGCTA
ACGGATGCAA CAGGTGAAGT ACAGCGTGGT ATTGAAGCAG TGGAACTTGC AACATCGGCA
CCTAATTTAA TGATGGGTCA AGCGCTGCCG AATATTGCTA GTGGAATTGA TGGATCAATT
TGGCGCTACC CAATCGGAGT TGTTGCTGGT ATTACGCCGT TTAACTTCCC AATGATGATT
CCGTTATGGA TGTTCCCACT TGCAATTGCT TGCGGTAATA CATTCGTATT AAAAACATCG
GAAAGAACGC CACTTTTAGC GGAGCGACTT GTAGAATTAT TCTATGAAGC AGGTTTCCCA
AAAGGCGTAT TAAATTTAGT ACAAGGCGGA AAAGATGTTG TAAATAGCAT TTTAGAAAAT
AAAGATATTC AAGCTGTTTC GTTCGTCGGT TCTGAGCCAG TAGCTCGTTA CGTATATGAA
ACAGGTACGA AACACGGAAA ACGTGTACAA GCGTTAGCGG GTGCAAAAAA CCATGCGATT
GTAATGCCAG ATTGCAATCT TGAGAAAACA GTACAAGGTG TAATTGGATC TGCATTTGCA
AGTAGTGGAG AGCGCTGCAT GGCATGCTCA GTAGTAGCAG TAGTGGATGA AATTGCTGAT
GAATTCATTG ATGTATTAGT AGCAGAAACG AAAAAATTAA AAGTAGGCGA TGGCTTTAAC
GAAGATAACT ATGTTGGACC ATTAATTCGT GAATCTCATA AAGAGCGTGT TTTAGGCTAT
ATTAGTAGTG GTGTAGCAGA TGGGGCAACT TTATTAGTAG ATGGCCGTAA AATTAATGAA
GAAGTTGGAG AAGGTTATTT TGTAGGTGCG ACAATCTTTG ATGGCGTGAA TCAAGAAATG
AAAATTTGGC AAGATGAAAT TTTTGCTCCA GTATTAAGCA TTGTACGTGT TAAAGATTTA
GAAGAAGGTA TTAAACTAAC AAATCAATCT AAATTTGCAA ATGGTGCGGT TATTTATACG
TCAAATGGTA AACATGCACA AACATTCCGT GATAACATCG ATGCTGGTAT GATTGGTGTA
AATGTAAATG TTCCAGCACC AATGGCATTC TTCGCATTTG CAGGAAATAA AGCTTCATTC
TTTGGTGATT TAGGTACAAA TGGTACAGAT GGCGTTCAAT TCTATACACG TAAAAAAGTT
GTAACTGAGC GCTGGTTTTA A
 
Protein sequence
MITTEIKRVK NHINGEWVES TGTEVEAVPN PATGKIIAYV PLSPKEDVEK AVEAAKAAFE 
TWSKVPVPNR SRNLYKYLQL LQENKDELAK IITLENGKTL TDATGEVQRG IEAVELATSA
PNLMMGQALP NIASGIDGSI WRYPIGVVAG ITPFNFPMMI PLWMFPLAIA CGNTFVLKTS
ERTPLLAERL VELFYEAGFP KGVLNLVQGG KDVVNSILEN KDIQAVSFVG SEPVARYVYE
TGTKHGKRVQ ALAGAKNHAI VMPDCNLEKT VQGVIGSAFA SSGERCMACS VVAVVDEIAD
EFIDVLVAET KKLKVGDGFN EDNYVGPLIR ESHKERVLGY ISSGVADGAT LLVDGRKINE
EVGEGYFVGA TIFDGVNQEM KIWQDEIFAP VLSIVRVKDL EEGIKLTNQS KFANGAVIYT
SNGKHAQTFR DNIDAGMIGV NVNVPAPMAF FAFAGNKASF FGDLGTNGTD GVQFYTRKKV
VTERWF