Gene Mkms_4477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4477 
Symbol 
ID4612421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4705970 
End bp4708003 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content69% 
IMG OID639794164 
Productshort chain dehydrogenase 
Protein accessionYP_940458 
Protein GI119870506 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA CCGTGAGAGA ACTGATCGAC CGCTCCAACC GTCTGGGCGC CGACCCGAAG 
AACACCAACT ACGCCGGCGG CAACACCTCG GCCAAGGGCA CCGAGACCGA CCCGGTCACC
GGTGCGCCCG TGGAGTTGTT GTGGGTCAAG GGTTCCGGCG GTGACCTCGG CACGTTGACC
GAGAAGGGGC TGGCCGTGCT GCGGCTGGAT CGGATGCGCG CGCTCGTCGA CGTCTACCCG
GGTGTCGAGC GTGAGGACGA GATGGTCGCC GCGTTCGACT ACTGCCTGCA CGGCCGCGGC
GGCGCGGCAC CGTCGATCGA CACCGCCATG CACGGTCTGG TCGACGCCGC CCACGTCGAC
CACCTGCACC CCGACTCGGG TATCGCGATC GCGACGGCGG CCGACGGGGA AGCCTTGACC
AAGCAGATCT TCGGCGACCG GGTGGTGTGG GTGCCGTGGC GGCGGCCCGG CTTCCAGCTG
GGTCTCGACA TCGCGGAGAT CAAACGGGCC AACCCGCAGG CCATCGGCAC CATCCTCGGC
GGCCACGGCA TCACCGCGTG GGGTGACACG TCGGCCGAGG CCGAGGCGCA TTCGCTGGAG
ATCATCGAAA CCGCGGAGCG CTACATCGCC GAACACGGCC GCCGCTATCC GTTCGGCGCG
CCTCTGGCGG GCTACGGGGC CCTCCCCGAG GGTCAGCGCC GGGACAAGGC CGCCGCACTC
GCACCGTTCC TGCGTGGGCT GGTCTCCACC GACAAACCCC AGGTCGGCCA TTTCACCGAC
GATCCGCGCG TGCTCGAATT CCTCAGCTGC GGTGAGCATC CGCGACTGGC CGCCCTCGGC
ACCAGCTGCC CCGACCACTT CCTGCGCACC AAGGTCAAAC CGCTGGTGCT CGACCTGCCC
GCCGACACCT CGGTCGAGGA CTGCAAGGCG CGACTGGCCG AACTGCACGA GGCGTACCGC
GCCGACTACC GCGCCTACTA CGAGCGGCAC GCCGGACCCG ACAGCCCCGC GATGCGCGGC
GCCGACCCGG CCATCGTGCT GATCCCCGGT GTCGGGATGT TCAGCTACGG CAAGGACAAG
CAGACCGCGC GGGTCACGGG CGAGTTCTAC CTCAACGCGA TCAACGTCAT GCGGGGTGCC
GAGGCGATCT CCACGTACGC GCCGATCGAC GAGGCGGAGA AGTTCCGCAT CGAGTACTGG
GCGCTGGAGG AAGCCAAACT GCAGCGTATG CCGAAGCCCA AACCGCTGGC CACCCGCATC
GCGTTGGTGA CGGGCGCAGC CTCGGGTATC GGCAAGGCCA TCGCCACCCG GCTGGCCGCC
GAGGGCGCCT GCGTGGTGAT CGCGGATCTG GACGCCGGAA AGGCCGCGGC CGCAGCGGAA
GAGCTCGGGA ACACCGACAT GGCGGTGGGG ATCGCCGCCG ACGTCACCGA CGAAGCGGCG
GTGCAGGCCG CCGTCGACGC CACCGTCCTG GCGTTCGGCG GCATCGACAT CGTCGTCAAC
AACGCCGGTC TCTCCCTGTC GAAGTCGCTG CTCGAGACCA CCGCCGCGGA CTGGGACCTC
CAGCACGATG TGATGTCGCG CGGGTCCTTC CTGGTGTCCA AGGCCGCGGC CAGGGCGCTG
ATCGACCAGG GGCTCGGCGG CGACATCATC TACATCTCGT CGAAGAACTC GGTGTTCGCC
GGGCCGAACA ACATCGCGTA CTCGGCGACC AAGGCTGATC AGGCCCACCA GGTGCGGCTG
CTCGCGGCCG AACTCGGTGA GCACGGCGTC AAGGTCAACG GCATCAACCC CGACGGCGTG
GTGCGCGGTT CGGGCATCTT CGCCGGCGGC TGGGGCGCCA AACGGGCCGC GGTCTACGGG
GTTCCAGAGG AGGAGCTCGG CGCCTACTAC GCGCAACGCA CCCTGCTCAA GCGTGAGGTG
CTGCCCGAGA ACGTCGCCAA CGCCGCGTTC GCGCTGTGCA CGTCGGACTT CTCGCACACC
ACCGGGTTGC ATGTGCCGGT CGACGCGGGC GTCGCCGCGG CTTTCTTGCG ATGA
 
Protein sequence
MTDTVRELID RSNRLGADPK NTNYAGGNTS AKGTETDPVT GAPVELLWVK GSGGDLGTLT 
EKGLAVLRLD RMRALVDVYP GVEREDEMVA AFDYCLHGRG GAAPSIDTAM HGLVDAAHVD
HLHPDSGIAI ATAADGEALT KQIFGDRVVW VPWRRPGFQL GLDIAEIKRA NPQAIGTILG
GHGITAWGDT SAEAEAHSLE IIETAERYIA EHGRRYPFGA PLAGYGALPE GQRRDKAAAL
APFLRGLVST DKPQVGHFTD DPRVLEFLSC GEHPRLAALG TSCPDHFLRT KVKPLVLDLP
ADTSVEDCKA RLAELHEAYR ADYRAYYERH AGPDSPAMRG ADPAIVLIPG VGMFSYGKDK
QTARVTGEFY LNAINVMRGA EAISTYAPID EAEKFRIEYW ALEEAKLQRM PKPKPLATRI
ALVTGAASGI GKAIATRLAA EGACVVIADL DAGKAAAAAE ELGNTDMAVG IAADVTDEAA
VQAAVDATVL AFGGIDIVVN NAGLSLSKSL LETTAADWDL QHDVMSRGSF LVSKAAARAL
IDQGLGGDII YISSKNSVFA GPNNIAYSAT KADQAHQVRL LAAELGEHGV KVNGINPDGV
VRGSGIFAGG WGAKRAAVYG VPEEELGAYY AQRTLLKREV LPENVANAAF ALCTSDFSHT
TGLHVPVDAG VAAAFLR