Gene pE33L466_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0301 
SymboliolA 
ID3399735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp299708 
End bp301171 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content40% 
IMG OID637660125 
Productmethylmalonate-semialdehyde dehydrogenase (acylating) 
Protein accessionYP_245789 
Protein GI67078169 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGTAC AAACAGCACA AATTGTAAAA AACTACATTG GCGGCGAATG GGTAGAATCC 
ATTTCAACTA AGATGGAAGC TGTATATAAT CCTGCAACAG GAGAAGTAAT CGCTCAAGTA
CCACTTTCAA CAAAAGTAGA TGTTGAACAA GCTGTGCTAG CAGCAAATGA AGCATTCAAA
TCTTGGTCTA AAACGGCTGT ACCAAAACGC GCTCGTATTC TATTTAAATA TCAACAATTA
CTAGTAGATA ACTGGGAAGA TTTAGCGAAA CTGATTACGA TTGAAAACGG AAAAAGCTAT
AACGAGGCTT ACGGTGAAGT TCTTCGTGGT ATTGAGTGTG TGGAGTTTGC TGCAGGTGCT
CCTACATTAA TGATGGGAAA ACAACTTCCT GATATTGCAA CAGGTATTGA GTCTGGTATG
TATCGTTACC CAATTGGTGT TATAGGCGGG ATTACACCTT TTAACTTCCC AATGATGGTT
CCGTGCTGGA TGTTCCCACT TGCGATTGCT TGTGGTAATA CATTTGTGTT AAAACCTTCA
GAGCGTACAC CACTTCTAGC AGCAAAATTA GTGGAACTAG CTGAAGAAGC TGGTTTACCG
AAAGGCGTTT TAAATATCGT AAATGGAGCT CATGATGTAG TAAACGGTCT TCTTGAACAC
AAATTAGTGA AGGCAATTTC ATTCGTAGGT TCTCAGCCAG TTGCAGAATA TGTATACAAA
AAAGGAACAG AAAACTTAAA ACGCGTTCAA GCATTAGCGG GTGCGAAAAA CCATTCCATT
GTATTAAGTG ATGCGAATCT TGAACTAGCA ACAAAGCAAA TTATTAGTGC TGCATTCGGC
TCAGCTGGTG AGCGTTGTAT GGCTGCTTCT GTTGTAACAG TACAAGAAGA AATTGCAGAT
CAATTAGTTG GAAGACTAGT AGAAGAAGCA AACAAAATTG TAATTGGCAA TGGTCTTGAT
GAAGATGTAT TTTTAGGACC AGTTATTCGC GATAACCATA AAGAGCGCAC AATTGGTTAC
ATCGATTCAG GTGTAGAACA GGGCGCTACA TTAGTTCGTG ATGGACGCGA AGATACAGCT
GTAAAAGGAG CTGGTTACTT CGTTGGCCCA ACAATTTTTG ACCATGTTAC ACAAGAAATG
AAAATCTGGC AAGATGAGAT TTTTGCTCCT GTTTTATCTA TTGTTCGTGT GAAATCATTG
GATGAAGCAA TTGAAATTGC GAATGAGTCT CGATTTGCAA ATGGGGCTTG CATTTATACA
GATAGCGGAG CAAGTGTACG TCAATTCCGT GAAACAATTG AATCCGGTAT GTTAGGTGTG
AATGTTGGGG TTCCGGCCCC AATGGCATTC TTCCCGTTCT CTGGATGGAA AGATTCGTTC
TATGGTGACC TTCATGCGAA TGGTACAGAT GGCGTTGAGT TTTATACAAG AAAGAAAATG
CTTACATCTC GTTGGGAGAA GTAA
 
Protein sequence
MTVQTAQIVK NYIGGEWVES ISTKMEAVYN PATGEVIAQV PLSTKVDVEQ AVLAANEAFK 
SWSKTAVPKR ARILFKYQQL LVDNWEDLAK LITIENGKSY NEAYGEVLRG IECVEFAAGA
PTLMMGKQLP DIATGIESGM YRYPIGVIGG ITPFNFPMMV PCWMFPLAIA CGNTFVLKPS
ERTPLLAAKL VELAEEAGLP KGVLNIVNGA HDVVNGLLEH KLVKAISFVG SQPVAEYVYK
KGTENLKRVQ ALAGAKNHSI VLSDANLELA TKQIISAAFG SAGERCMAAS VVTVQEEIAD
QLVGRLVEEA NKIVIGNGLD EDVFLGPVIR DNHKERTIGY IDSGVEQGAT LVRDGREDTA
VKGAGYFVGP TIFDHVTQEM KIWQDEIFAP VLSIVRVKSL DEAIEIANES RFANGACIYT
DSGASVRQFR ETIESGMLGV NVGVPAPMAF FPFSGWKDSF YGDLHANGTD GVEFYTRKKM
LTSRWEK