Gene Mpe_A3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3524 
Symbol 
ID4786227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3734536 
End bp3736056 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content69% 
IMG OID640092105 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_001022712 
Protein GI124268708 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.132108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.367269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCCC CCGAAGCTTT CACCGCCACC GCCGAGATCG GCCATTTCAT CGGCGGACGT 
GCCGTGCCCA GCACGAGCGG GCGCCGTCAG GCGGTCTACA ACCCGGCCAC CGGCGCCGTC
GCGCGCCAGG TCGCGCTGGC CTCGGCCGAC GAGGTGAACG CTGCGGTCGC CGCCGCCCAG
GCCGCGTTCC CGGCCTGGGC CGACACGCCG CCGCTGCGGC GCGCTCGGGT GCTGAACAAG
TTCCTGCAGC TGCTCAACGA GCAGCGCGAC ACGCTGGCCG CGATGATCAC CGCCGAGCAC
GGCAAGGTGT TCACCGACGC GCAGGGCGAG GTCACGCGCG GCATCGAGAT CGTCGAGTTC
GCCTGCGGCG CGCCGCAGCT GCTGAAGACC GACTTCACCG ACCAGGTCGG CACGGGCATC
GACAATTGGG TGCTGCGCCA GCCGCTGGGC GTGGTGGCCG GCATCACGCC GTTCAACTTC
CCGGTCATGG TGCCGATGTG GATGTTCCCG ATGGCCATCG CCACCGGCAA CAGCTTCGTG
CTCAAGCCCA GCGAGCGCGA CCCCAGCCCC AGCCTCTTCA TCGCCGAGCT GCTGAAGCAG
GCCGGCCTGC CCGACGGCGT GTTCAACGTC GTGCAGGGCG ACAAGCTGGC GGTCGACACG
CTGCTGACCC ACCCCGACGT GAAGGCGGTG AGCTTCGTCG GCTCGACCCC GATCGCGCAG
TACATCTACG AGACCGGCGC GAAGCACGGC AAGCGCGTGC AGGCGCTTGG CGGTGCGAAG
AACCACATGG TGGTGATGCC CGACGCCGAC CTGGAGCAGA GCGTCGACGC GCTGATCGGC
GCGGCCTACG GCTCGGCCGG CGAGCGCTGC ATGGCGATCT CGGTGGCGGT GCTGGTGGGC
GACGTGGCCG ACCAGATCGT GCCGAAGCTC GCCGAGCGCG CCAAGGCGCT GAAGGTCAAG
AACGGCATGG AGCTCGACGC CGAGATGGGC CCGATCGTCA CGCCGCAGGC GCTCGAGCGC
ATCGAAGGCT ACATCGCGCA CGGCGTCGAC GAGGGCGCGA AGCTGGTGGT CGACGGCCGC
GGCCTGAAGG TGCCCGGTCA CGAGCAGGGC TTCTTCACCG GCGGCACGCT GTTCGATCAC
GTGACGCCCG AGATGAAGAT CTACAAGGAA GAGATCTTCG GCCCGGTGCT GGCCTGCGTG
CGCGTGCCCG ACTTCGCCAG CGCAGTGGCC CTGGTCAATG CGCACGAGTT CGGCAACGGC
GTCGCCTGCT TCACGCGCGA CGGTCACGTG GCGCGCGAGT TCTCGCGCCG CATCCAGGTC
GGCATGGTCG GCATCAACGT GCCGATCCCG GTGCCGATGG CCTGGCACGG CTTCGGCGGC
TGGAAGAAGA GCCTGTTCGG CGACATGCAC GCCTACGGCG AGGAGGGCGT GCGCTTCTAC
ACGAAGCAGA AGTCGGTGAT GCAGCGCTGG CCCGAGAGCA CGCCCAAGGG CGCCGAGTTC
GTGATGCCGA CGTCGAAGTA G
 
Protein sequence
MGAPEAFTAT AEIGHFIGGR AVPSTSGRRQ AVYNPATGAV ARQVALASAD EVNAAVAAAQ 
AAFPAWADTP PLRRARVLNK FLQLLNEQRD TLAAMITAEH GKVFTDAQGE VTRGIEIVEF
ACGAPQLLKT DFTDQVGTGI DNWVLRQPLG VVAGITPFNF PVMVPMWMFP MAIATGNSFV
LKPSERDPSP SLFIAELLKQ AGLPDGVFNV VQGDKLAVDT LLTHPDVKAV SFVGSTPIAQ
YIYETGAKHG KRVQALGGAK NHMVVMPDAD LEQSVDALIG AAYGSAGERC MAISVAVLVG
DVADQIVPKL AERAKALKVK NGMELDAEMG PIVTPQALER IEGYIAHGVD EGAKLVVDGR
GLKVPGHEQG FFTGGTLFDH VTPEMKIYKE EIFGPVLACV RVPDFASAVA LVNAHEFGNG
VACFTRDGHV AREFSRRIQV GMVGINVPIP VPMAWHGFGG WKKSLFGDMH AYGEEGVRFY
TKQKSVMQRW PESTPKGAEF VMPTSK