Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4338 |
Symbol | |
ID | 4612280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 4556929 |
End bp | 4557798 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639794023 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_940319 |
Protein GI | 119870367 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.582042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.314505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACT GGACCGCCGC CGACCTCCCC TCCTTCGCCG GCCGCTCGGT GATCGTCACC GGAGCCAACA GCGGCCTCGG CCTCGTCACC GCCCGCGAAC TGGCCCGCGT CGGCGCCGAC GTCGTCCTCG CGGTGCGCAA CACCGCCAAG GGCGACGAGG CCGCGGCCAC CATGACCGGC AACGTCACCG TCCGCAAACT CGATCTGCAG GACCTCGCGT CGGTGCGCGA GTTCGCCGCG GGCACCGACA GGGTGGACGT GCTGGTCAAC AACGCCGGGA TCATGGCGGT GCCCTACGCA CGGACCGTCG ACGGGTTCGA GAGCCAGATC GGCACCAACC ACCTCGGCCA CTTCGCGCTG ACCAATCTGC TGCTGCCCAA GATCACCGAC CGCGTGGTCA CGGTGTCGTC GTTCATGCAC GTCCTCGGGC GCATCAACCT CGACGACCTC AACTGGAAGG CGCGGCCGTA CTCGGCGTGG CTGGCCTACG GCCAGTCGAA ACTGGCCAAC CTCCTGTTCA CCAGCGAACT GCAGAACCGG TTGCGTCGCG CCGGTTCCCC GGTGCGTGCT CTGGCGGCCC ACCCCGGGTA TTCGCACACC AACCTGCAGG GTCAGTCCGG CCGCAAGCTC GGCGACGCGT TGATGGCCTT CGGCGGTGAG TACTTCGCGA CCGACGCCGA CTTCGGCGCC CGCCAGACGC TGTACGCGGT GGCCCAGGAC CTGCCCGGGG ACACCTTCGT CGGCCCGAAG TTCGCGATGC GCGGGCCGAC CGGTCCGGTG TGGCGCACGC CGCTGGCCCG CGACATGAAG ACCGCCGCGG CGCTGTGGGA GCTGTCCGAG CAGCTCACCG GCACCCGGTT CCCGCTCTAG
|
Protein sequence | MSDWTAADLP SFAGRSVIVT GANSGLGLVT ARELARVGAD VVLAVRNTAK GDEAAATMTG NVTVRKLDLQ DLASVREFAA GTDRVDVLVN NAGIMAVPYA RTVDGFESQI GTNHLGHFAL TNLLLPKITD RVVTVSSFMH VLGRINLDDL NWKARPYSAW LAYGQSKLAN LLFTSELQNR LRRAGSPVRA LAAHPGYSHT NLQGQSGRKL GDALMAFGGE YFATDADFGA RQTLYAVAQD LPGDTFVGPK FAMRGPTGPV WRTPLARDMK TAAALWELSE QLTGTRFPL
|
| |