Gene Mkms_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0100 
Symbol 
ID4615506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp113523 
End bp114386 
Gene Length864 bp 
Protein Length287 aa 
Translation table11 
GC content67% 
IMG OID639789777 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_936109 
Protein GI119866157 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0938298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGAG TGCAGGATCG CGTCATCGTC GTCACCGGAG CCGGCGGAGG GCTGGGCCGT 
GAGTACGCGC TGACGCTGGC CCGCGAGGGC GCCGCGGTCG TCGTCAACGA CCTCGGCGGT
GCGCGCGACG GCACCGGCGC CGGATCGGCG ATGGCCGATC AGGTGGTCGA CGAGATCAAG
GCCGCGGGTG GCCGGGCCGC GGCCAACTAC GACTCCGTCG CCGAACCCGA GGGCGCCGAG
AACATCATCA AGACCGCGAT CGACGAGTTC GGCAAGGTCG ACGGTGTGGT GAGCAACGCG
GGCATCCTGC GCGACGGCAC GTTCCACAAG ATGACGTTCG AGAACTGGGA CGCGGTGCTC
AAGGTGCACC TGTACGGCGG GTACAACGTG ATCCGTGCGG CGTGGCCGCA CTTCCGTGAG
CAGAGCTTCG GCCGCGTCGT CGTCGCCACC TCGACCAGCG GGCTGTTCGG CAACTTCGGC
CAGGCCAACT ACGGCGCAGC GAAACTCGGC CTCGTCGGCC TGATCAACAC GCTCGCCCAG
GAAGGCGCGA AGTACAACAT CAAGACCAAC GCCGTCGCAC CGATCGCCGC GACCCGGATG
ACGCAGGACA TCCTGCCGCC GGAGGTCTTC GAGAAGCTCA CGCCGGAGTA CGTCGCGCCG
GTCGTCGCGC ACCTGATGAC CGAGGAACTG ACCGACACCG ACTCGATCTT CATCGTCGGC
GGCGGCAAAA TACAGCGGGC AGCGCTCTTT CAGAACGAAG GTACTACCTT CACCAAGGTG
CCCACCCTCG ACGACGTCGC GTCCCGGTGG GGTGAGATCA CCGATCTGTC CGCAGCGCAG
CAGGCCAGCT TCAAGCTCGG CTGA
 
Protein sequence
MPGVQDRVIV VTGAGGGLGR EYALTLAREG AAVVVNDLGG ARDGTGAGSA MADQVVDEIK 
AAGGRAAANY DSVAEPEGAE NIIKTAIDEF GKVDGVVSNA GILRDGTFHK MTFENWDAVL
KVHLYGGYNV IRAAWPHFRE QSFGRVVVAT STSGLFGNFG QANYGAAKLG LVGLINTLAQ
EGAKYNIKTN AVAPIAATRM TQDILPPEVF EKLTPEYVAP VVAHLMTEEL TDTDSIFIVG
GGKIQRAALF QNEGTTFTKV PTLDDVASRW GEITDLSAAQ QASFKLG