Gene Mmcs_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1064 
Symbol 
ID4109903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1159889 
End bp1161409 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID638030187 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_638234 
Protein GI108798037 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAACC AGATCCAGCA CTTCATCAAC GGCAAGCGCA CCGCAGGTGA GTCCACCCGC 
ACGGCCGACG TGATGAACCC GAGCACGGGG GCGGTGCAGG CCCAGGTCCT GCTCGGCTCG
CGCGCCGACG TCGACGCCGC GGTGGCCGGC GCCGCCGAGG CCCAGAAGGA GTGGGCGGCG
TGGAACCCGC AGCGCCGCGC CCGCGTCATG ATGCGCTTCA TCGAACTGGT CAACCAGCAC
ATGGACGAGT TGGCCGAACT GCTGTCGATC GAACACGGCA AGACCGTCCC CGACGCCAAG
GGCGACATCC AGCGCGGTAT CGAGGTCATC GAGTTCGCGA TCGGCATCCC CCACCTGATC
AAGGGTGAGT ACACCGAGGG CGCGGGCACC GGCATCGACG TCTACTCGAT GCGCCAGCCG
CTCGGCGTCG TCGCCGGCAT CACACCCTTC AACTTCCCGG CGATGATCCC GCTGTGGAAG
GCCGGCCCCG CGCTGGCCTG CGGTAACGCC TTCATCCTCA AGCCGTCCGA GCGCGACCCC
TCGGTCCCGG TCCGGCTGGC CGAATTGTTC ATCGAGGCCG GGCTACCCGC GGGGGTGTTC
CAGGTCGTGC ACGGCGACAA AGAGGCCGTC GACGCGATCC TCGAGCATCC GGTGATCCAG
GCCGTCGGCT TCGTCGGCAG CTCCGACATC GCCCAGTACA TCTACGCCGG CGCGACGGCC
AACGGGAAGC GCGCGCAGTG CTTCGGCGGC GCGAAGAACC ACATGATCGT GATGCCCGAC
GCCGATCTCG ACCAGGCCGT CGACGCGCTG ATCGGCGCCG GCTACGGCAG CGCCGGCGAG
CGCTGCATGG CGATCAGCGT GGCGGTGCCC GTCGGCAAGG AGACCGCGGA CCGGCTGCGC
AACCGGCTGG TCGAGCGGGT CAACAACCTG CGCGTCGGCC ACAGCCTCGA CCCGAAGGCC
GATTACGGCC CACTGGTCAC CGAGGCCGCG CTCAACCGGG TCCGCGACTA CATCAACCAG
GGCGTCGAGG CGGGCGCCGA GGCCGTCGTC GACGGGCGCG AGCGTTCCAG CGACGAGATG
CAGTTCGGCG ACGACAGCCT CGAGGGCGGC TACTTCATCG GCCCCACGCT GTTCGACCAC
GTCACCCCGG ACATGTCGAT CTACACCGAC GAGATCTTCG GCCCGGTGCT GTGCATCGTG
CGCGCCGACA ACTACGAAGA GGCACTGCGC CTGCCCACCG AGCACGAATA CGGCAACGGC
GTGGCGATCT TCACCCGCGA CGGCGACACC GCACGCGACT TCGTCGCCAA GGTCCAGGTC
GGCATGGTCG GGGTCAACGT CCCGATCCCG GTTCCGGTGT CGTACCACAC CTTCGGCGGC
TGGAAGCGTT CCGGCTTCGG CGACCTCAAC CAGCACGGGC CGCACTCGAT CCTGTTCTAC
ACCAAGACCA AGACCGTCAC GCAGCGCTGG CCGTCGGGCA TCAAGGATGG CGCCGAATTC
GTCATCCCCA CGATGAAGTA G
 
Protein sequence
MTNQIQHFIN GKRTAGESTR TADVMNPSTG AVQAQVLLGS RADVDAAVAG AAEAQKEWAA 
WNPQRRARVM MRFIELVNQH MDELAELLSI EHGKTVPDAK GDIQRGIEVI EFAIGIPHLI
KGEYTEGAGT GIDVYSMRQP LGVVAGITPF NFPAMIPLWK AGPALACGNA FILKPSERDP
SVPVRLAELF IEAGLPAGVF QVVHGDKEAV DAILEHPVIQ AVGFVGSSDI AQYIYAGATA
NGKRAQCFGG AKNHMIVMPD ADLDQAVDAL IGAGYGSAGE RCMAISVAVP VGKETADRLR
NRLVERVNNL RVGHSLDPKA DYGPLVTEAA LNRVRDYINQ GVEAGAEAVV DGRERSSDEM
QFGDDSLEGG YFIGPTLFDH VTPDMSIYTD EIFGPVLCIV RADNYEEALR LPTEHEYGNG
VAIFTRDGDT ARDFVAKVQV GMVGVNVPIP VPVSYHTFGG WKRSGFGDLN QHGPHSILFY
TKTKTVTQRW PSGIKDGAEF VIPTMK