Gene Mmcs_5437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5437 
Symbol 
ID4114522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp18249 
End bp19196 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content61% 
IMG OID638034592 
Productacetaldehyde dehydrogenase 
Protein accessionYP_642593 
Protein GI108802397 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4569] Acetaldehyde dehydrogenase (acetylating) 
TIGRFAM ID[TIGR03215] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.747247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACT CCAAGGTCGC AGTCATCGGT TCGGGCAACA TCGGTACCGA CCTAGTCGTC 
AAATTGAAGA AGTTGGCGAC CAACGTCGAG ATCGCTGTGT TGGTCGGCAT CGACCCGTCG
TCGGATGGTC TGGCTCGTGC CCGCCGGATG GGTATCGGCA CAGTCGACAC CGGTGTGCAG
GGTTTGATCG AGCACGCCGA ATTCGATGAG ATCGACATCA TCTTCGATTC CACGTCGGCG
AAAGCGCATC TCGTCAACGA GGAAGCGTTG CGTACCTTTG GCAAGCGGCT GATCGACCTG
ACTCCCGCTG CAGTCGGTCC CTACGTCGTG CCTGCCGTGA ATCTCGACGA CCACTTGGGT
GCGCCGAACG TCAACATGGT CACCTGCGGC GGTCAGGCGA CGATCCCTAT CGTCGCGGCG
ATCTCATCGG TCACGGCGGT GCACTACGCC GAGATCGTCG CCTCGATCGC GTCGAAATCG
GCGGGTCCGG GAACACGGTC GAATATCGAT GAATTCACCC AAACCACCTC AGCGGCAATC
GAAAAGGTAG GCGGAGCAGC ACACGGCAAG GCGATCATCG TTCTCAATCC CGCGGAGCCA
CCGTTGATCA TGCGCGATAC CGTCTTGGCT CTCGTGACGG ATCCCGATCA GAACCGCATC
AGGCAGTCGG TTATAGACAT GGTGGAGAAG GTGTCGGCCT ACGTGCCGGG CTACCGACTC
AAACAGGAAG TGCAGTTCAC CCAGCTCGAC GACGCCGAGT CCGTCGCGAC CCTGACCGGA
GGAGTCGACA AGGGGCCCGG GCTGTGGAAG GTGGCGGTCT TCCTAGAAGT CGAGGGTGCC
GCGCACTACT TGCCGGCCTA CGCCGGCAAT CTCGACATCA TGACCTCGGC GGCACTACAG
GTGGCCGAGC GGATCGCGGC GAACACTGTG CAGGAGGCCA CGCGATGA
 
Protein sequence
MSHSKVAVIG SGNIGTDLVV KLKKLATNVE IAVLVGIDPS SDGLARARRM GIGTVDTGVQ 
GLIEHAEFDE IDIIFDSTSA KAHLVNEEAL RTFGKRLIDL TPAAVGPYVV PAVNLDDHLG
APNVNMVTCG GQATIPIVAA ISSVTAVHYA EIVASIASKS AGPGTRSNID EFTQTTSAAI
EKVGGAAHGK AIIVLNPAEP PLIMRDTVLA LVTDPDQNRI RQSVIDMVEK VSAYVPGYRL
KQEVQFTQLD DAESVATLTG GVDKGPGLWK VAVFLEVEGA AHYLPAYAGN LDIMTSAALQ
VAERIAANTV QEATR