Gene Mkms_4338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4338 
Symbol 
ID4612280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4556929 
End bp4557798 
Gene Length870 bp 
Protein Length289 aa 
Translation table11 
GC content70% 
IMG OID639794023 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_940319 
Protein GI119870367 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.582042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.314505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACT GGACCGCCGC CGACCTCCCC TCCTTCGCCG GCCGCTCGGT GATCGTCACC 
GGAGCCAACA GCGGCCTCGG CCTCGTCACC GCCCGCGAAC TGGCCCGCGT CGGCGCCGAC
GTCGTCCTCG CGGTGCGCAA CACCGCCAAG GGCGACGAGG CCGCGGCCAC CATGACCGGC
AACGTCACCG TCCGCAAACT CGATCTGCAG GACCTCGCGT CGGTGCGCGA GTTCGCCGCG
GGCACCGACA GGGTGGACGT GCTGGTCAAC AACGCCGGGA TCATGGCGGT GCCCTACGCA
CGGACCGTCG ACGGGTTCGA GAGCCAGATC GGCACCAACC ACCTCGGCCA CTTCGCGCTG
ACCAATCTGC TGCTGCCCAA GATCACCGAC CGCGTGGTCA CGGTGTCGTC GTTCATGCAC
GTCCTCGGGC GCATCAACCT CGACGACCTC AACTGGAAGG CGCGGCCGTA CTCGGCGTGG
CTGGCCTACG GCCAGTCGAA ACTGGCCAAC CTCCTGTTCA CCAGCGAACT GCAGAACCGG
TTGCGTCGCG CCGGTTCCCC GGTGCGTGCT CTGGCGGCCC ACCCCGGGTA TTCGCACACC
AACCTGCAGG GTCAGTCCGG CCGCAAGCTC GGCGACGCGT TGATGGCCTT CGGCGGTGAG
TACTTCGCGA CCGACGCCGA CTTCGGCGCC CGCCAGACGC TGTACGCGGT GGCCCAGGAC
CTGCCCGGGG ACACCTTCGT CGGCCCGAAG TTCGCGATGC GCGGGCCGAC CGGTCCGGTG
TGGCGCACGC CGCTGGCCCG CGACATGAAG ACCGCCGCGG CGCTGTGGGA GCTGTCCGAG
CAGCTCACCG GCACCCGGTT CCCGCTCTAG
 
Protein sequence
MSDWTAADLP SFAGRSVIVT GANSGLGLVT ARELARVGAD VVLAVRNTAK GDEAAATMTG 
NVTVRKLDLQ DLASVREFAA GTDRVDVLVN NAGIMAVPYA RTVDGFESQI GTNHLGHFAL
TNLLLPKITD RVVTVSSFMH VLGRINLDDL NWKARPYSAW LAYGQSKLAN LLFTSELQNR
LRRAGSPVRA LAAHPGYSHT NLQGQSGRKL GDALMAFGGE YFATDADFGA RQTLYAVAQD
LPGDTFVGPK FAMRGPTGPV WRTPLARDMK TAAALWELSE QLTGTRFPL