Gene Mflv_5010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_5010 
Symbol 
ID4976321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5335188 
End bp5336708 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID640459237 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001136264 
Protein GI145225586 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.352611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC GCATCCCCCA CTTCATCGAT GGAAAGCGCA GCGAGCTGGC CTCCACGCGC 
ACCGCCGGCG TGCTGAACCC GAGCACCGGC GAAGTTCAGT CCGAGGTGCT GCTCGCCAGC
GCCGCCGACG TCGACACCGC GGTGGCCTCC GCGGTCGAGG CGCAGAAGGA GTGGGCGGCG
TGGAACCCGC AGCGCCGCGC CCGCGTGTTC ATGAAGTTCA TCCAGCTTGT CAATGATCAC
GTCGACGAGC TCGCCGAGCT GCTGTCCATC GAGCACGGCA AGACCGTCGC CGACTCCAAG
GGCGACATCC AGCGCGGCAT CGAGGTCATC GAGTTCGCCA TCGGCATCCC GCACCTGCTC
AAGGGCGAGT TCACCGAGAA CGCCGGCACC GGCATCGACG TCTACTCGAT CCGCCAGCCC
CTCGGCGTCG TCGCCGGCAT CACCCCGTTC AACTTCCCCG CGATGATCCC GCTGTGGAAG
GCCGGCCCCG CGCTGGCATG CGGAAACGCG TTCATCCTCA AGCCTTCCGA GCGCGACCCG
TCGGTGCCGC TGCGGCTGGC CGAGCTGTTC CTCGAAGCCG GCCTGCCCGC GGGCGTCTTC
CAGGTCGTCC AGGGTGACAA GGAAGCCGTC GACGCGATCC TGACCCACCC CGACATCCAG
GCCGTCGGCT TCGTCGGGTC CTCCGACATC GCGCAGTACA TCTACTCGAC CGCGGCCGCC
CACGGCAAGC GCTCACAGTG CTTCGGCGGC GCGAAGAACC ACATGATCAT CATGCCCGAC
GCCGACCTCG ACCAGGCCGT CGACGCACTC ATCGGCGCCG GCTACGGCAG CGCCGGCGAG
CGCTGCATGG CCATCAGCGT CGCCGTCCCC GTCGGCGAGG AAACCGCCAA CCGCCTCCGT
AATCGCCTGG TGGAGCGCGT CAACCAGCTC CGCGTGGGCC ACAGCCTCGA CCCGAAGGCC
GACTACGGCC CGCTGGTGAC CGGCGCCGCA CTCGAGCGGG TCCGCGACTA CATCGGCCAG
GGCGTCGAGG CCGGCGCCGA ACTCGTCGTC GACGGCCGCG AGCGCGCCAC CGACGAACTG
AGCTTCGACG ACCAGGACCT GTCGAAGGGC TACTTCATCG GCCCCACCCT GTTCGACCAC
GTCACCACCG ACATGTCGAT CTACACCGAC GAGATCTTCG GCCCCGTGCT GTGCATCGTG
CGCGCCGCCG ACTACGACGA AGCACTGAGC CTGCCCACCA AGCACGAATA CGGCAACGGT
GTCGCGATCT TCACCCGCGA CGGCGACGCC GCCCGCGACT TCGTGTCCAA GGTCCAGGTC
GGCATGGTCG GCGTCAACGT CCCGATCCCG GTGCCCGTCG CCTACCACAC CTTCGGCGGC
TGGAAGCGCT CCGGCTTCGG TGACCTCAAC CAGCACGGCC CGGCCTCGAT CCAGTTCTAC
ACCAAGGTCA AGACCGTCAC CGAGCGCTGG CCCTCGGGCA TCAAGGATGG CGCCGAGTTC
GTCATCCCGA CGATGAAATA G
 
Protein sequence
MTTRIPHFID GKRSELASTR TAGVLNPSTG EVQSEVLLAS AADVDTAVAS AVEAQKEWAA 
WNPQRRARVF MKFIQLVNDH VDELAELLSI EHGKTVADSK GDIQRGIEVI EFAIGIPHLL
KGEFTENAGT GIDVYSIRQP LGVVAGITPF NFPAMIPLWK AGPALACGNA FILKPSERDP
SVPLRLAELF LEAGLPAGVF QVVQGDKEAV DAILTHPDIQ AVGFVGSSDI AQYIYSTAAA
HGKRSQCFGG AKNHMIIMPD ADLDQAVDAL IGAGYGSAGE RCMAISVAVP VGEETANRLR
NRLVERVNQL RVGHSLDPKA DYGPLVTGAA LERVRDYIGQ GVEAGAELVV DGRERATDEL
SFDDQDLSKG YFIGPTLFDH VTTDMSIYTD EIFGPVLCIV RAADYDEALS LPTKHEYGNG
VAIFTRDGDA ARDFVSKVQV GMVGVNVPIP VPVAYHTFGG WKRSGFGDLN QHGPASIQFY
TKVKTVTERW PSGIKDGAEF VIPTMK