Gene Mkms_5042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5042 
Symbol 
ID4612721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5283085 
End bp5285304 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content69% 
IMG OID639794735 
Productglycerol dehydratase 
Protein accessionYP_941021 
Protein GI119871069 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4909] Propanediol dehydratase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.359522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCC TCGATGCCAA ACCGGTCAAC CTCGACGGAT TCAGTGTCAC CGACCCCGCG 
CTGGGTCTGG TCGCGATGCA CAGCCCGCAC GACCCGCAGC CGTCGCTGGT CGTGCGCGAC
GGACGGGTCG TCGAACTCGA CGGCAGGCCG GCCGCCGACT TCGACGTGAT CGACGAGTTC
ATCGCCCGCT ACGGCATCGA CCTCACGGTC GCCGAGGAGG CGATGGCGCT CGACGATGCG
GTGCTGGCCC GGATGGCGGT CGACGTCAAC GTGCCGCGCG CGGAGGTGGT GCGTCTGATC
GGCGGCACCA CGCCGGCCAA GCTGGCCCGG GTCATGGCGG TGATGACGCC GGTCGAGATG
CAGATGGCGA TGCACAAGAT GCGGGCCAGG CGCACCCCGA GCAACCAGGC CCACGTCACC
AACCAGCTCG ACGATCCGCT GCTGATCGCG GCCGACGCGG CCAGTGCGGT GGCCTACGGC
TTCCGCGAGG TCGAGACGAC GGTGCCGGTG TTCGGCGACG CGCCGTCGAA TGCGATCGCA
CTGCTGATCG GCAGCCAGGT CGGCGTCCCG GGCGCCATGG CGCAGTGCTC GATCGAGGAG
GCGATGGAAC TGCGGCTCGG GCTGCGGGGC CTGACCAGCT ACGCCGAGAC GATCTCGATC
TACGGCACCG AACAGGTCTT CGTCGACGGT GACGACACTC CGTTCTCCAA GGCGATCCTC
ACCGCCGCGT ACGCCTCGCG CGGGCTCAAG ATGCGGGTCA CCAGCGGCGG CGGCGCCGAG
GTGCTGATGG GTGCCGCCGA GAAGTGCTCG ATCCTCTACC TCGAATCGCG GTGTGTGTCG
CTGGCGCGGG CGCTGGGCTC GCAGGGTGTG CAGAACGGCG GCATCGACGG GGTCGGCGTG
GTGGCGTCGG TGCCCGAGGG CATGAAAGAA CTGCTCGCCG AGAACCTGAT GGTGATGATG
CGCGATCTGG AATCGTGTGC GGGCAACGAC AACCTGATCT CCGAATCCGA TATCCGCCGC
AGCGCGCACA CGCTGCCGGT GCTGCTGGCC GGCGCGGACT TCATCTTCTC CGGCTTCGGA
TCGATTCCGC GGTACGACAA CGCTTTTGCG CTGTCGAACT TCAACGCCGA CGATATGGAC
GACTTTTTGG TGCTGCAGCG GGACTGGGGT GCCGACGGTG GACTGCGCAC CGTGTCACGC
GAGCATCTGG CGCGGGTGCG GAGGCGGGCG GCGACGGCGG TGCAGGCCGT GTACCGCGAT
CTGGGCCTCG CCGACTTCGA CGACACCCGC ATCGACGCCG TGGTGGTGGC CAACGACTCC
CGCGATCTAC CCGCCGGGGA TCCGAAGGCC GTCGCGGAGG CGGCCACCGC GATCGAGGCC
AGGCAGCTCA CCGTGTTCGA CGTCGTCGCG GCGCTGCACC GCACCGGGTA TGCACCGGAG
GCCGAGGCGA TCATGCGGTT GACCCGCGAA CGCCTGCGCG GTGACCAGTT GCAGACCTCG
GCGATCTTCG ACGACCAGTT CCAGGTGCTG TCGAAGATCA CCGATCCGAA CGACTACGCC
GGACCCGGCA GCGGCTACAC GCCGACCGAG AAACGCCGTG CCGAGATCGA CGGTATCCGG
CAGGCCCGCA CGTCGGCCGA ACTCACCGCC GACCAGGCCG AACACCGGGG CCACGTCGTG
TTCTCCGACG TCGAACCGGC CCATCAGGGC AGCGATCCGC GCGAGGTGTG CATCGGCCTC
TCCCCGGCGC TGGGACGGTC GGTGTGGCTG ACACTGTGCG GTCTGACGGT CGGTGAGGTG
TTGCGCCAGC TCTCCGCCGG TCTCGAGGAG GAGGGGTGTG TGCCGCGCCT GGTCCGGGTC
CGGTCGACCA TCGACGTGGG GCTGATCGGG TTGACCGCCG CCCGGCTGTC CGGATCCGGT
ATCGGAATCG GGTTGCAGGG CAAGGGAACC GCACTCATCC ACCGGCGCGA CCTGGCGCCG
CTGGCGAACC TCGAACTGTT CAGCGTGGCC CCGCTGCTCA CCGCGAAGAT GTACCGCGAG
CTGGGCCGCA ACGCCGCCCG GCACGCCAAG GGGATGGCGC CACTGCCGAT CCTGGCCGGC
GGTACCGACG AATCCATCTC CGCCCGCTAC CACGCCCGAG CCGTGGCGTT GGTGGCGCTG
GAACGCCAGG CCTGCGAACC GGGGCAGGCA CCGATCACCG TGGAGGCCAA ACGAGTATGA
 
Protein sequence
MRILDAKPVN LDGFSVTDPA LGLVAMHSPH DPQPSLVVRD GRVVELDGRP AADFDVIDEF 
IARYGIDLTV AEEAMALDDA VLARMAVDVN VPRAEVVRLI GGTTPAKLAR VMAVMTPVEM
QMAMHKMRAR RTPSNQAHVT NQLDDPLLIA ADAASAVAYG FREVETTVPV FGDAPSNAIA
LLIGSQVGVP GAMAQCSIEE AMELRLGLRG LTSYAETISI YGTEQVFVDG DDTPFSKAIL
TAAYASRGLK MRVTSGGGAE VLMGAAEKCS ILYLESRCVS LARALGSQGV QNGGIDGVGV
VASVPEGMKE LLAENLMVMM RDLESCAGND NLISESDIRR SAHTLPVLLA GADFIFSGFG
SIPRYDNAFA LSNFNADDMD DFLVLQRDWG ADGGLRTVSR EHLARVRRRA ATAVQAVYRD
LGLADFDDTR IDAVVVANDS RDLPAGDPKA VAEAATAIEA RQLTVFDVVA ALHRTGYAPE
AEAIMRLTRE RLRGDQLQTS AIFDDQFQVL SKITDPNDYA GPGSGYTPTE KRRAEIDGIR
QARTSAELTA DQAEHRGHVV FSDVEPAHQG SDPREVCIGL SPALGRSVWL TLCGLTVGEV
LRQLSAGLEE EGCVPRLVRV RSTIDVGLIG LTAARLSGSG IGIGLQGKGT ALIHRRDLAP
LANLELFSVA PLLTAKMYRE LGRNAARHAK GMAPLPILAG GTDESISARY HARAVALVAL
ERQACEPGQA PITVEAKRV