Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3524 |
Symbol | |
ID | 4786227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3734536 |
End bp | 3736056 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640092105 |
Product | methylmalonate-semialdehyde dehydrogenase [acylating] |
Protein accession | YP_001022712 |
Protein GI | 124268708 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.132108 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.367269 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCCC CCGAAGCTTT CACCGCCACC GCCGAGATCG GCCATTTCAT CGGCGGACGT GCCGTGCCCA GCACGAGCGG GCGCCGTCAG GCGGTCTACA ACCCGGCCAC CGGCGCCGTC GCGCGCCAGG TCGCGCTGGC CTCGGCCGAC GAGGTGAACG CTGCGGTCGC CGCCGCCCAG GCCGCGTTCC CGGCCTGGGC CGACACGCCG CCGCTGCGGC GCGCTCGGGT GCTGAACAAG TTCCTGCAGC TGCTCAACGA GCAGCGCGAC ACGCTGGCCG CGATGATCAC CGCCGAGCAC GGCAAGGTGT TCACCGACGC GCAGGGCGAG GTCACGCGCG GCATCGAGAT CGTCGAGTTC GCCTGCGGCG CGCCGCAGCT GCTGAAGACC GACTTCACCG ACCAGGTCGG CACGGGCATC GACAATTGGG TGCTGCGCCA GCCGCTGGGC GTGGTGGCCG GCATCACGCC GTTCAACTTC CCGGTCATGG TGCCGATGTG GATGTTCCCG ATGGCCATCG CCACCGGCAA CAGCTTCGTG CTCAAGCCCA GCGAGCGCGA CCCCAGCCCC AGCCTCTTCA TCGCCGAGCT GCTGAAGCAG GCCGGCCTGC CCGACGGCGT GTTCAACGTC GTGCAGGGCG ACAAGCTGGC GGTCGACACG CTGCTGACCC ACCCCGACGT GAAGGCGGTG AGCTTCGTCG GCTCGACCCC GATCGCGCAG TACATCTACG AGACCGGCGC GAAGCACGGC AAGCGCGTGC AGGCGCTTGG CGGTGCGAAG AACCACATGG TGGTGATGCC CGACGCCGAC CTGGAGCAGA GCGTCGACGC GCTGATCGGC GCGGCCTACG GCTCGGCCGG CGAGCGCTGC ATGGCGATCT CGGTGGCGGT GCTGGTGGGC GACGTGGCCG ACCAGATCGT GCCGAAGCTC GCCGAGCGCG CCAAGGCGCT GAAGGTCAAG AACGGCATGG AGCTCGACGC CGAGATGGGC CCGATCGTCA CGCCGCAGGC GCTCGAGCGC ATCGAAGGCT ACATCGCGCA CGGCGTCGAC GAGGGCGCGA AGCTGGTGGT CGACGGCCGC GGCCTGAAGG TGCCCGGTCA CGAGCAGGGC TTCTTCACCG GCGGCACGCT GTTCGATCAC GTGACGCCCG AGATGAAGAT CTACAAGGAA GAGATCTTCG GCCCGGTGCT GGCCTGCGTG CGCGTGCCCG ACTTCGCCAG CGCAGTGGCC CTGGTCAATG CGCACGAGTT CGGCAACGGC GTCGCCTGCT TCACGCGCGA CGGTCACGTG GCGCGCGAGT TCTCGCGCCG CATCCAGGTC GGCATGGTCG GCATCAACGT GCCGATCCCG GTGCCGATGG CCTGGCACGG CTTCGGCGGC TGGAAGAAGA GCCTGTTCGG CGACATGCAC GCCTACGGCG AGGAGGGCGT GCGCTTCTAC ACGAAGCAGA AGTCGGTGAT GCAGCGCTGG CCCGAGAGCA CGCCCAAGGG CGCCGAGTTC GTGATGCCGA CGTCGAAGTA G
|
Protein sequence | MGAPEAFTAT AEIGHFIGGR AVPSTSGRRQ AVYNPATGAV ARQVALASAD EVNAAVAAAQ AAFPAWADTP PLRRARVLNK FLQLLNEQRD TLAAMITAEH GKVFTDAQGE VTRGIEIVEF ACGAPQLLKT DFTDQVGTGI DNWVLRQPLG VVAGITPFNF PVMVPMWMFP MAIATGNSFV LKPSERDPSP SLFIAELLKQ AGLPDGVFNV VQGDKLAVDT LLTHPDVKAV SFVGSTPIAQ YIYETGAKHG KRVQALGGAK NHMVVMPDAD LEQSVDALIG AAYGSAGERC MAISVAVLVG DVADQIVPKL AERAKALKVK NGMELDAEMG PIVTPQALER IEGYIAHGVD EGAKLVVDGR GLKVPGHEQG FFTGGTLFDH VTPEMKIYKE EIFGPVLACV RVPDFASAVA LVNAHEFGNG VACFTRDGHV AREFSRRIQV GMVGINVPIP VPMAWHGFGG WKKSLFGDMH AYGEEGVRFY TKQKSVMQRW PESTPKGAEF VMPTSK
|
| |