Gene Mjls_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1091 
Symbol 
ID4876831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1167789 
End bp1169309 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID640138404 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_001069389 
Protein GI126433698 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACC AGATCCAGCA CTTCATCAAC GGCAAGCGCA CCGCAGGCGA GTCCACCCGC 
ACCGCGGACG TGATGAACCC GAGCACGGGG GCGGTGCAGG CCCAGGTCCT GCTCGGCTCG
CGCGCCGACG TCGACGCCGC GGTGGCCGGC GCCGCCGAGG CTCAGAAGGA GTGGGCGGCG
TGGAACCCGC AGCGCCGCGC CCGCGTCATG ATGCGCTTCA TCGAACTGGT CAACCAGCAC
ATGGACGAGT TGGCCGAACT GCTGTCGATC GAACACGGCA AGACCGTCCC CGACGCCAAG
GGCGATATCC AGCGCGGTAT CGAGGTCATC GAGTTCGCGA TCGGCATCCC CCACCTGATC
AAGGGTGAGT ACACCGAGGG CGCGGGCACC GGCATCGACG TCTACTCGAT GCGCCAGCCG
CTCGGCGTCG TCGCCGGCAT CACACCCTTC AACTTCCCGG CGATGATCCC GCTGTGGAAG
GCCGGTCCCG CGCTGGCCTG CGGTAACGCC TTCATCCTCA AGCCGTCCGA GCGCGACCCC
TCGGTGCCGG TGCGCCTGGC CGAATTGTTC ATCGAGGCCG GCCTGCCCGC GGGAGTGTTC
CAGGTCGTGC ACGGCGACAA GGAAGCCGTC GACGCGATCC TCGAGCATCC GGTGATCCAG
GCTGTCGGCT TCGTCGGCAG CTCCGACATC GCCCAGTACA TCTACGCCGG CGCGACAGCC
AACGGTAAGC GCGCGCAGTG CTTCGGCGGC GCGAAGAACC ACATGATCGT GATGCCCGAC
GCCGATCTCG ACCAGGCCGT CGACGCGCTG ATCGGCGCCG GCTACGGCAG CGCCGGCGAG
CGCTGCATGG CGATCAGCGT GGCGGTACCC GTCGGCAAGG AGACCGCGGA CCGGCTGCGC
AACCGGCTGG TCGAGCGGGT CAACAACCTG CGCGTCGGCC ACAGCCTCGA CCCGAAGGCC
GATTACGGCC CACTGGTCAC CGAGGCCGCG CTCAACCGGG TCCGCGACTA CATCAACCAG
GGCGTCGAGG CGGGCGCCGA GGCCGTCGTC GACGGGCGCG AGCGTTCCAG CGACGAGATG
CAGTTCGGCG ACGACAGCCT CGAGGGCGGC TACTTCATCG GCCCCACGCT GTTCGACCAC
GTCACCCCGG ACATGTCGAT CTACACCGAC GAGATCTTCG GCCCGGTGCT GTGCATCGTG
CGCGCCGACA ACTACGAAGA GGCACTGCGC CTGCCCACCG AGCACGAGTA CGGCAACGGC
GTGGCGATCT TCACCCGCGA CGGCGACACC GCACGCGACT TCGTCGCCAA GGTCCAGGTC
GGCATGGTCG GGGTCAACGT CCCGATCCCG GTTCCGGTGT CGTACCACAC CTTCGGCGGC
TGGAAGCGTT CCGGCTTCGG CGACCTCAAC CAGCACGGGC CGCACTCGAT CCTGTTCTAC
ACCAAGACCA AGACCGTCAC GCAGCGCTGG CCGTCGGGCA TCAAGGATGG CGCCGAATTC
GTCATCCCCA CGATGAAGTA G
 
Protein sequence
MTNQIQHFIN GKRTAGESTR TADVMNPSTG AVQAQVLLGS RADVDAAVAG AAEAQKEWAA 
WNPQRRARVM MRFIELVNQH MDELAELLSI EHGKTVPDAK GDIQRGIEVI EFAIGIPHLI
KGEYTEGAGT GIDVYSMRQP LGVVAGITPF NFPAMIPLWK AGPALACGNA FILKPSERDP
SVPVRLAELF IEAGLPAGVF QVVHGDKEAV DAILEHPVIQ AVGFVGSSDI AQYIYAGATA
NGKRAQCFGG AKNHMIVMPD ADLDQAVDAL IGAGYGSAGE RCMAISVAVP VGKETADRLR
NRLVERVNNL RVGHSLDPKA DYGPLVTEAA LNRVRDYINQ GVEAGAEAVV DGRERSSDEM
QFGDDSLEGG YFIGPTLFDH VTPDMSIYTD EIFGPVLCIV RADNYEEALR LPTEHEYGNG
VAIFTRDGDT ARDFVAKVQV GMVGVNVPIP VPVSYHTFGG WKRSGFGDLN QHGPHSILFY
TKTKTVTQRW PSGIKDGAEF VIPTMK