Gene Mkms_5834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5834 
Symbol 
ID4610542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008704 
Strand
Start bp42883 
End bp43830 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content61% 
IMG OID639789488 
Productacetaldehyde dehydrogenase 
Protein accessionYP_935823 
Protein GI119855220 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4569] Acetaldehyde dehydrogenase (acetylating) 
TIGRFAM ID[TIGR03215] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones84 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACT CCAAGGTCGC AGTCATCGGT TCGGGCAACA TCGGTACCGA CCTAGTCGTC 
AAATTGAAGA AGTTGGCGAC CAACGTCGAG ATCGCTGTGT TGGTCGGCAT CGACCCGTCG
TCGGATGGTC TGGCTCGTGC CCGCCGGATG GGTATCGGCA CAGTCGACAC CGGTGTGCAG
GGTTTGATCG AGCACGCCGA ATTCGATGAG ATCGACATCA TCTTCGATTC CACGTCGGCG
AAAGCGCATC TCGTCAACGA GGAAGCGTTG CGTACCTTTG GCAAGCGGCT GATCGACCTG
ACTCCCGCTG CAGTCGGTCC CTACGTCGTG CCTGCCGTGA ATCTCGACGA CCACTTGGGT
GCGCCGAACG TCAACATGGT CACCTGCGGC GGTCAGGCGA CGATCCCTAT CGTCGCGGCG
ATCTCATCGG TCACGGCGGT GCACTACGCC GAGATCGTCG CCTCGATCGC GTCGAAATCG
GCGGGTCCGG GAACACGGTC GAATATCGAT GAATTCACCC AAACCACCTC AGCGGCAATC
GAAAAGGTAG GCGGAGCAGC ACACGGCAAG GCGATCATCG TTCTCAATCC CGCGGAGCCA
CCGTTGATCA TGCGCGATAC CGTCTTGGCT CTCGTGACGG ATCCCGATCA GAACCGCATC
AGGCAGTCGG TTATAGACAT GGTGGAGAAG GTGTCGGCCT ACGTGCCGGG CTACCGACTC
AAACAGGAAG TGCAGTTCAC CCAGCTCGAC GACGCCGAGT CCGTCGCGAC CCTGACCGGA
GGAGTCGACA AGGGGCCCGG GCTGTGGAAG GTGGCGGTCT TCCTAGAAGT CGAGGGTGCC
GCGCACTACT TGCCGGCCTA CGCCGGCAAT CTCGACATCA TGACCTCGGC GGCACTACAG
GTGGCCGAGC GGATCGCGGC GAACACTGTG CAGGAGGCCA CGCGATGA
 
Protein sequence
MSHSKVAVIG SGNIGTDLVV KLKKLATNVE IAVLVGIDPS SDGLARARRM GIGTVDTGVQ 
GLIEHAEFDE IDIIFDSTSA KAHLVNEEAL RTFGKRLIDL TPAAVGPYVV PAVNLDDHLG
APNVNMVTCG GQATIPIVAA ISSVTAVHYA EIVASIASKS AGPGTRSNID EFTQTTSAAI
EKVGGAAHGK AIIVLNPAEP PLIMRDTVLA LVTDPDQNRI RQSVIDMVEK VSAYVPGYRL
KQEVQFTQLD DAESVATLTG GVDKGPGLWK VAVFLEVEGA AHYLPAYAGN LDIMTSAALQ
VAERIAANTV QEATR