Gene Mkms_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3535 
Symbol 
ID4611465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3723014 
End bp3728128 
Gene Length5115 bp 
Protein Length1704 aa 
Translation table11 
GC content73% 
IMG OID639793211 
Productbeta-ketoacyl synthase 
Protein accessionYP_939519 
Protein GI119869567 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGGG ACTTAGGCAA GGCTAACCAA TTGGCACCCG GCCTCCCCGA ACCCGTCGCG 
ATCGTCGGCA TCGGCTGCCG GCTGGCCGGC GACATCACGA CCCCCGCGGA CTTCTGGACG
TTCCTGCTCG ACGGGGGCAG TCGCGTGCGC GAGGTGCCCG CGGACCGCTG GGAGCCCTAT
CTGCGGCGCG ACCCCCGCAA CGCCGCGGTG CTGCGGGAGA CGACGCCGTG GGGCACCTTC
CTCGACGACC TGGCCGGTTT CGACGCCGAA TTCTTCGGTG TCTCACCGCG TGAAGCCGAA
CTGATGGATC CGCAGCAGCG ACTGGCGCTC GAAGTCTCGT GGGAGGCACT CGAACACGCC
GGGGTTCCGC CGCGCTCGCT GGCCGGCAGC GACACCGCGG TGCTGATGGG CGTGAACTCC
GACGACTACG GCAAGCTCAT CATGGAGGAC CTGCCGGGGA TCGAGGCGTG GACCGGGATC
GGCACGTCGC TGTGCGGTAT CGCCAACCGC GTCTCGCACC TGCTCGATCT GCGCGGCCCC
AGCGTCGCAC TCGATGCGGC GTGCGCGGCC TCGCTGGTGG CCGTGCACCA GGCGTGCCAA
CTGCTGCGGA CGGGGGAGAC GTCGCTGGCG CTCGCCGGCG GTGTCAGCGC ACTGATCGGG
CCCGGTCTGA CCCGGGTCCT CGACCAGGCG GGGGCGACCG CGCCCGACGG CCGGTGCAAG
AGCTTCGACG CCGACGCCGA CGGCTACGGC CGCGGTGAGG GCGCCGCGGT CGTGGTGCTC
AAGCGGTTGG CCGACGCCGT GCGGGACGGT GACCGGGTGA TCGCCGTCGT CCGCGGTGGG
GCGGTGGCCC AGGACGGACG CACGGTCGGC ATCATGTCGC CCAACGGCGC CGCCCAGGAG
GACCTGTTTC GGCGCACCTG TGCGACCTCC GGAATCGACC CGGCGACAGT CGGATTCGTC
GAGGCGCACG GGACCGGCAC GCCGACCGGC GATCCGGTCG AACTCGACGC GCTCGCCGCC
GTGTACGGGG CGCGGCGGGC CGACGGTGAA CCGTGCCGCG TCGGATCGGT CAAACCCAAC
ACCGGTCACC TCGAGGGCGG GGCCGGGGTG GTGGGGTTGA TCAAGGCGGC GATGGCGCTG
CAGCACGAGG CGATCCCCCC GACCGCCGGC GTCCGCACAC TCACCCCGGC CGTGGACTGG
GCGTCGAGTG GACTGCGGGT TCCGACCCGG GTCGAGGCGT GGCCGCGTCG CGACGATGTC
GCGCGCCGGG CCGCGGTGTG CAGCTACGGC TACGGCGGCA CCATCGCCCA TGTCCTGCTG
GAAGAGGCGC CGGTAGCCGT ACGCCCGGAG GCGGAGGCGG ACGCCGAACC CGTGGTCGTC
CCGCTGTCGG GGCGGTCGTC CGCGCGCCTG ACCCGCCACG CCGCCGCTCT CGCCGATCAC
CTTCGGCGGC AACCACATTC GGTGGGAGAG GTGGCCGCGA CCCTGTGGGC GCGGCGCTCC
CACGAACCGG TGCGCGCCGC GGTGATCGCA CACGACGGCG CCGACCTGGT GACCGGGCTC
GACGCGCTCG CCGACGACCG CCCGACACCG TCGGTTCTCA CCGGCAACGT ACTCGCGGGT
GCGGCCGACG GTGCGGTGTG GGTGTTCTCC GGACACGGAT CGCACTGGCC CGGCATGGGT
CGCGAACTGC TCTCGCACGA ACCGGCTTTC GCGGCGGTCA TCGACGCGGT GGAACCGGTG
TTCGCCGCCG AACTCGGGTT CTCGCCGCGT GCCGCGCTGC ACAGCGGTGA CCTCGGCGGA
ACCGATCGGG TGCAGGCGCT GACGTTCGCG ATGCAGGTCG GCCTGGCCGC GGTGCTGCGC
GAGCGCGGTC TGCGTCCGGC CGCGGTGATC GGCCACTCCG TCGGTGAGGT GGCCGCCAAC
GTCGTCGCCG GTGTCTTCGA CCTCGCCCAC GGGGCCGCGG TGGCCTGCTA CCGCGCACGG
GGCTTCCGAT CGGTGGCCGG CGCGGGGGCG ATGGCCCTGG TCCGTCTTCC CTTCGCCGAG
GCCGACCGGC GCCTCGGCGA CCGCACCGAT GTGGTGGCGG CGATCAGCGC CTCACCGGAG
TCGACGGTCA TCTCGGGCAC CGTCGAAGCG GTCGATGAGG TGTCGGCGCG CTGGACCGAC
GAGGGGATGA CGGTGCGCCG GGTGAACACC GACGTGGCGT TCCACAGCCC GGCGATGGAC
GGATTGACCG CCGAACTCGC CCGTCTCACA GCAGGTCTGG CGCCGACGAA GTCCGCCGGC
ATGCCGCTGT ACTCCACCGC GCTGCCCGAC CCCCGCTCGA CCGCGCCACG CGACCCCGAC
TACTGGGTGG CCAACCTGCG TGGCCGGGTG CGCTTCGCCG AAGCCGTGAC CGCCGCGGCC
GAGGACGGAC ACCGGCTGTT CCTCGAGGTG TCGGCGCATC CGGTGGTCGC GCACTCCATC
GCGGAGACGC TGGCGCACCA CGCGATCGAC GACCACGCGG TCGTGCCGGT GCTGCGGCGC
GAACAGCCCG AGCTGCCTGC CGTCGCCGCC GCCGTCGGAG CGCTGTACTG CCACGGCGCA
CCGGTCGCGC ACCGGGTCGA CACCGAAAAA CCCTGGGCCG CGGATCTTCC CGGAACGCAG
TGGGTGCACC GGCGACACTG GCGCACGCCC GCCGCACCCC CTGGCGGTCG CGGTGTGCAC
GAGCCGGATT CGCACACCCT GCTCGGCGGT CCGATGGAGG TGACGGGGGC GGTGCCCGCA
CGGGTGTGGC AGACACGGCT GGACTTGTCG ACGCGGCCCT ACCCGGGCGA CCACCCGGTG
CAGGGCACCG AGATCGTGCC CGCCGCCGTG TTGCTGAACA CGTTCCTGGC CGCGGCCGGA
ACCGATCTGG CCGACGTGCG GTTGCGCACA CCCGTGCCGC CGGCACGTGC GCGCGACATC
CAGGTGGTGT CGCAGGACCG CTCGCTGGCC CTGGCGTCCC GGGTGGTCGA CGACGGCGAG
GACGCCGACG GTGGCTGGTT GACCCACTGC ACCGCGCTGG CCGCGCCGGG TGGGGAACCC
GCGCTGACCG TGCTCGACGA GGACGAGATC CGCGCGCGGT GTCCGGAGGT GTTGCCGTCG
ACCCATGTCG TGGACACGTT GGCGACCCTC GGCGTGGCCG CGATGGGATT CGGTTGGCAG
GTGCTCGACC TGCACCGCGG TGACGGTGAG CTCTTCGCCC GTGTCGCGGC CGACGTCGAC
GGATCGACGC CGGCGACGTG GGCGGGACTG CTCGACGCCG CGACCTCGGC GGCCTCGACG
ATCTTCGACG GCCCGCCGCG ACTGCGCATG CCCGCCCGGA TCGAACGGGT GCACGTCCAC
GACAAACCAC CCGCCGTCGC GCTGCTGCAC GTACGCCGCC GCGAGGCGGG CACCGTCACC
GACGTGGTAC TCGCCGAGGA GTCCGGTGCG GTCTCGGTGT CCTTGACCGG CATGGCATTC
GAGGAGCTCG AGAATCCCAG CGGCCGCGAC ACGGCGCGGC TGTTGCACCA CGTCGCGTGG
CAACCGGTCG CATGGCCGGA CGCCGACCTG CCTGCCGAGG TGGTGCTCGT CGGCGGGGAC
GCCGCTACGC GGGCCTTCGT CACCCGCGAC CTCGACGAGG CCGGTGTGCC GCACCGATCT
GTCGGTCATG CAACGGAACT CGGTGCGCTC CCGAGCGGGT CGGTGGTGCT CGTGCTGCCG
CGCGCCGACG AGACACCGCA GACGTCGGCC GAACTCGTGC TCAGCACCGT GCAGCATCTC
GACGCGTCCG GTGCTCGAAC CAGGTTGTGG GTGCTGACCT CCGACGTGCA CGAAGGCGTC
AACCCGGCGC ACGCCCCGCT GTGGGGAATG GCGCGGGTGG CGGCGGCCGA ACATCCACAC
CTGTGGGGCG GCGTGCTCGA CATCACCGGG GACCGGTTGC CGGTGCGGGC GCTCGGCGCG
CTGCACGGGC ACGGTGTGGT GGTGGTGCGC GACGGCGTCG CGTACGCGGC GCGGTTGGCC
CACGCCGGAC CGGGGGAGGC GGCGCCGCTG CAGTGCTCGC CGGGCGGTAC CTACCTGATC
ACCGGCGGCA CCGGCGTGCT GGGGCTGCGG CTGGCGCAGC GCCTGGCCGA CCTGGGGGCG
CGGCGGCTGG TGCTGGTGTC GCGGTCTGGC ATTCCCGAGC GCAGCGCATG GCGCGACCAC
AGCGACCGGG AAGTGGTCGC CGTGGTCTCC GCCCTCGAAC AGCGGGGGGT GTCGGTGAAG
GTGGCGGCCG TCGACGTCGG CGCACCCGCA GCGGCCACCG CGCTGCGGTC GGCGCTGGTC
GACCTGCCGC CGGTGCGCGG AGTCATCCAC GCCGCGGGCG TGGAAGCCGG TGCGCTGCTG
GCCACGACGA CCGCGGACGA TCTCGACGCG GCGATGCGGC CGAAGGTCGC CGGGCTCAAC
ACGCTGCACG AGCTGTTCCC GCCCGGCGAG CTGGATTGGA TGGTGCTGTT CTCGTCGTGC
GGCTATCTGG CGGGCTTCCC CGGCCAGGGT GCCTACGCGT GCGGCAACGC CTACCTCGAC
GCCTTCGCCC GCCACCGCCG CCGCCTCGGC GACCGGACCA CCAGCGTCGC GTGGACGGCG
TGGCGGGGTA TGGGCATGGG CTCGGCCTCC GGATTCGTCG CCGCCCAGCT CGACGCGCTC
GGGATGGGCA CGGTGGGACT CGACGACGCG ATGCGGGCGC TGGACTCGGC GATGCGCGAC
GACGACCCCA ACGTGGTGGT GTTGCCGGTG CTGCCGTCGG CGGCATCGGT GCCGATCCTC
GCCGACGTCG CGCCCACCGA ATCCGCGGAG CCGGTCGCGG ACCGCGGTGA TGAGGACGTC
GCGGAGTGGG CGGCTCGCCA GGTGCTCACC GCGGTCTCCT CCGAACTCGG TTGCGCCGCA
GATGATGTCG ACCTACGGCT GCCGTTGGTG GAGATCGGTG TCGACTCCAT CATGACCGTC
GCCCTGCGGC GCCGGCTGGA GAAGCAGACC GGGCTGTCGC TGCCGCCGAC GCTGCTGTGG
GAGTACCCGA CCGCCGCGGC GGTGACCGAC CGGATCACCG AACTGCTGAC CGTCGAGGAC
GATTCGGCCG CATAA
 
Protein sequence
MSRDLGKANQ LAPGLPEPVA IVGIGCRLAG DITTPADFWT FLLDGGSRVR EVPADRWEPY 
LRRDPRNAAV LRETTPWGTF LDDLAGFDAE FFGVSPREAE LMDPQQRLAL EVSWEALEHA
GVPPRSLAGS DTAVLMGVNS DDYGKLIMED LPGIEAWTGI GTSLCGIANR VSHLLDLRGP
SVALDAACAA SLVAVHQACQ LLRTGETSLA LAGGVSALIG PGLTRVLDQA GATAPDGRCK
SFDADADGYG RGEGAAVVVL KRLADAVRDG DRVIAVVRGG AVAQDGRTVG IMSPNGAAQE
DLFRRTCATS GIDPATVGFV EAHGTGTPTG DPVELDALAA VYGARRADGE PCRVGSVKPN
TGHLEGGAGV VGLIKAAMAL QHEAIPPTAG VRTLTPAVDW ASSGLRVPTR VEAWPRRDDV
ARRAAVCSYG YGGTIAHVLL EEAPVAVRPE AEADAEPVVV PLSGRSSARL TRHAAALADH
LRRQPHSVGE VAATLWARRS HEPVRAAVIA HDGADLVTGL DALADDRPTP SVLTGNVLAG
AADGAVWVFS GHGSHWPGMG RELLSHEPAF AAVIDAVEPV FAAELGFSPR AALHSGDLGG
TDRVQALTFA MQVGLAAVLR ERGLRPAAVI GHSVGEVAAN VVAGVFDLAH GAAVACYRAR
GFRSVAGAGA MALVRLPFAE ADRRLGDRTD VVAAISASPE STVISGTVEA VDEVSARWTD
EGMTVRRVNT DVAFHSPAMD GLTAELARLT AGLAPTKSAG MPLYSTALPD PRSTAPRDPD
YWVANLRGRV RFAEAVTAAA EDGHRLFLEV SAHPVVAHSI AETLAHHAID DHAVVPVLRR
EQPELPAVAA AVGALYCHGA PVAHRVDTEK PWAADLPGTQ WVHRRHWRTP AAPPGGRGVH
EPDSHTLLGG PMEVTGAVPA RVWQTRLDLS TRPYPGDHPV QGTEIVPAAV LLNTFLAAAG
TDLADVRLRT PVPPARARDI QVVSQDRSLA LASRVVDDGE DADGGWLTHC TALAAPGGEP
ALTVLDEDEI RARCPEVLPS THVVDTLATL GVAAMGFGWQ VLDLHRGDGE LFARVAADVD
GSTPATWAGL LDAATSAAST IFDGPPRLRM PARIERVHVH DKPPAVALLH VRRREAGTVT
DVVLAEESGA VSVSLTGMAF EELENPSGRD TARLLHHVAW QPVAWPDADL PAEVVLVGGD
AATRAFVTRD LDEAGVPHRS VGHATELGAL PSGSVVLVLP RADETPQTSA ELVLSTVQHL
DASGARTRLW VLTSDVHEGV NPAHAPLWGM ARVAAAEHPH LWGGVLDITG DRLPVRALGA
LHGHGVVVVR DGVAYAARLA HAGPGEAAPL QCSPGGTYLI TGGTGVLGLR LAQRLADLGA
RRLVLVSRSG IPERSAWRDH SDREVVAVVS ALEQRGVSVK VAAVDVGAPA AATALRSALV
DLPPVRGVIH AAGVEAGALL ATTTADDLDA AMRPKVAGLN TLHELFPPGE LDWMVLFSSC
GYLAGFPGQG AYACGNAYLD AFARHRRRLG DRTTSVAWTA WRGMGMGSAS GFVAAQLDAL
GMGTVGLDDA MRALDSAMRD DDPNVVVLPV LPSAASVPIL ADVAPTESAE PVADRGDEDV
AEWAARQVLT AVSSELGCAA DDVDLRLPLV EIGVDSIMTV ALRRRLEKQT GLSLPPTLLW
EYPTAAAVTD RITELLTVED DSAA