Gene Mkms_5098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5098 
Symbol 
ID4612781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5341914 
End bp5347466 
Gene Length5553 bp 
Protein Length1850 aa 
Translation table11 
GC content69% 
IMG OID639794795 
Productmycolic acid condensase 
Protein accessionYP_941077 
Protein GI119871125 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGT CTGAAACACC GAACAATTCG CCCTCCGCTG AGAATCAGCC GATCGTCGCC 
CAGGGCGAGG GTGGCCCGCT GCGTCCCGCG CAGGTCGACA TGACCGTCGC CGAGATGCGC
GAATGGCTGC GCAACTGGAT CGCCAACGCG ACCGGCCAGA ACGCCGACAA CATCGACGAG
CAGACCGCGA TGGTGGAACT CGGCCTGTCC TCGCGCGACG CGGTCGCGAT GGCCAGCGAC
ATCGAGGACC TCACCGGTGT CACGCTGACG GCGACCGTGG CGTTCCGCCA TCCGACCATC
GAGTCGTTGG CGACGGTGAT CATCGAGGGT GAGCCCGAGG TCGAGCACGA CGACGACGGT
ACCGACTGGA GCCGTGAGCG CGACGTCGAC GACATCGCGA TCGTGGGTCT CGCCACCCGC
TTCCCGGGCG ATATGAACAC CCCCGACGAG ATGTGGGAGG CGCTGCTCGA GGGTCGCGAT
GCGATCACCG ATCTGCCCGA GGGCCGGTGG GAGGAGTTCC TCGGTGAGCC GCGGATCGCC
GAGCGGGTCG CCAAGGCGGC CACCCGCGGC GGCTACCTGT CGGACATCAA GGGTTTCGAC
GCCGAGTTCT TCGCGCTGTC GAAGATGGAG GCCGACAACA TCGATCCGCA GCAGCGGATG
GCGCTCGAGC TGACGTGGGA GGCGCTCGAA CACGCCCGCA TCCCGGCGTC GAGCCTGCGC
GGTGAGCGGG TCGGCGTGTA CATCGGCGCG TCGAACAACG ACTACAGCTT CATGTCCGTG
GCCGATCCGG GCGTCGCGCA CCCGTACGCG ATCACCGGCA CCACCAGCTC GATCATCGCC
AACCGGGTGT CGTACTTCTA CGACTTCCGC GGCCCGTCGA TGGCGATCGA CACCGCATGT
TCGAGTTCGC TGGTCGCCGC CCACCAGGGG GTGGCCGCGC TGCGCGCAGG CGAGGCCGAC
GTCGCGGTGG TCGGCGGCGT CAACGCGCTG ATCACCCCGC TGGTGACCAT CGGGTTCGAC
GAGGTCGGCG GTGTGCTCGC ACCCGACGGC CGGATCAAGT CGTTCTCGCA GGACGCCAAC
GGCTACGCCC GCTCCGAGGG CGCAGGCATG CTGGTGCTCA AACGCCTCTC CGATGCGCGG
CGCGACGGTG ACGAGATCTA CGCCGTGATC GCGGGCAGCG CGGTGAACCA CGACGGCCGG
TCCAACGGTC TGCTGGCGCC CAACCCGGAC GCGCAGGCCG AGGTGCTGCG CAAGGCCTAC
AAGGACGCGG GCATCAACCC GCGCGACGTC GACTACATCG AGGCGCACGG CACCGGCACC
ATCCTCGGCG ACCCGATCGA GGCCGACGCG CTCGGCCGGG TGATCGGACG CAGTCGGCCG
GCCGACCAGC CCGCCCTGCT GGGTGCGGTG AAATCCAATG TGGGACACCT GGAGTCGGCG
GCAGGCGCGG CCAGCCTGGC GAAGGTGGCG CTGTCGCTGC GCAACGACAA GCTGCCCCCA
TCGATCAACT ACACCGGCCC GAACCCCTAC ATCGACTTCG ATGCCGTGCG GTTGAAGGTC
AACGACACGG TCAGTGACTG GCCGCGCTAC AGCGGGCACG CCATCGCCGG CGTCTCCGGC
TTCGGCTTCG GCGGCGCCAA CGCGCACATG GTGCTGCGCG AGGTGCTGCC CAGCGACCTG
GTCGAGCCCG AACCCGAGCA GGTGGTCGAG GTGACCGCCG AGCCGAATCA GCCCGCCGTG
TACGTGGGCG GGGTGCGGAT GGACGAGTAC GGCGAGTTCG TCGATGAGCC GCTTGCGCGA
AGAGAATCAG GCTTCGATGA GGACTCAGAC GACGACGGTC TCGACCGGCC CGCCGCAGCG
GTCGAGGACG ACTACGAGCT GCCCGGACTG ACCGACGAGG CCAAGCGTCT GCTCGAGGTC
GCGCGCGAAG AGCTCGAAGC GGCCGAACAG CCCGTTCCGC TTGTGCCGCT GGCGGTTTCG
GCGTTCCTGA CCTCACGCAA GAAGTCCGCG GCCGCCGAGC TGGCCGACTG GATGGACAGT
GAAGAGGGCC GGGCGTCGTC GCTGGAGTCG ATCGGCCGCG CGCTGTCGCG CCGCAACCAC
GGCCGCTCCC GCGCGGTCGT GATGGCCCGC GATCACGACG AGGCGATCAA GGGTCTGCGC
GCGCTGGCCG AGGGCAAGCA GAGCCCGAAC GTCTACAGCG CCGACGGTCC CGTGACGAAC
GGGCCGGTCT GGGTGCTCGC CGGTTTCGGC GCCCAGCACC GCAAGATGGG TAAGAGCCTC
TACCTGCGCA ACGAGGTGTT CGCCGACTGG ATCAACAAGG TCGACTCCCA CGTGCAGGAC
GAGCGGGGGC ACTCGATCCT CGAGCTCATC CTCGACGATG CCGTCGACTA CACCGACGAG
ACCACCGAGC TGCCGATCGA GAAGGTGCAG CTGGTCATCT TCGCGATCCA GGTGGCGCTG
GGCGAACTGC TCAAGCATCA CGGCGCCAAG CCGGGTGCGG TCATCGGCCA GTCGCTCGGT
GAGGCCGCCG CGGCCTACTT CTCGGGCGGC CTGTCGCTCG AGGACGCCAC CCGCGCCATC
TGTTCGCGCA GCCACCTCAT GGGTGAGGGC GAGGCAATGC TGTTCGGCGA GTACATCCGC
TTGATGGCGC TGGTGGAGTA CTCCGCCGAC GAGATCAAGA CGGTGTTCTC CGACTATCCC
GACCTCGAGG TGTGCGTGTA CGCCGCCCCG ACCCAGACCG TGATCGGCGG CCCGCCCGAG
CAGGTGGACG CGATCATCGC CAGGGCCGAG CAGGAGGGCA AGTTCGCCCG CAAGTTCCAG
ACCAAGGGCG CCAGCCACAC CTCGCAGATG GATCCGCTGC TCGGTGAGCT GGCCGCCGAA
CTGGTCGGAA TCACCCCGCA CCCGTTGCAG ATCGGCTACT ACTCGACGGT GCACGAGGGC
AAGTTCCTGC GGGCGGGCAG CGAGCCGATC CACGACGTGG ACTACTGGAA GAAGGGGCTG
CGCCACAGCG TCTACTTCAC CCACGGCATC CGCAACGCGG TCGACAACGG CCACACCACC
TTTCTCGAGC TCGCCCCGAA CCCGGTGGCG CTCATGCAGG TCGGGCTGAC CACGATGTCG
GCGGGTCTGC ATGACGGGCA GCTCATCGCG ACGCTGGCCC GCAAGCAGGA CGAGGTCGAC
TCGATGACCG CGGCCATGGC CCAGTTGTTC GTCCACGGCC ACGACCTCGA CATGCGCACC
CTGTTCCCGC GGCGTTCGCG CGGGCTGGCC GGTGCGCTGG ACTACGCGAA CATCCCGCCG
ACCCGGTTCC GCCGCAAGCC GCACTGGCTC GACGTGCGCT TCAGCGGGGA CAACGCCGGT
GTCATGCCGG GTAGCCACGT CGCCACCCCG GACGGCAGGC ACGTGTGGGA GTACTCGCCG
CGCGGTGCCG TGGACGCCCA GGCCCTGGCG GCGCTGGTGA AATCCGCTGC CTCGCAGGTG
TTCCCGGAGG CGGCCGTGAC GGCGGCGGAG CAGCGGGCGG TGCCCGGTGA CGGCGCCCGC
CTGGTGACCA CGCTGACCCG CCACCCCGGT GGGGCGTCGG TGCAGGTGCA CGCCCGCATG
GACAACGGAG GCGAATCTTC CTTCGCTCTG GTCTACGACG CGATCGTCAC CCGCGGTGGT
CAGGCCGTTG CCCTGCCCGC TGCGGTCGCC ACGGGGACCG TTGCTCCGCA AGCCGATTCG
CTGACTCCAG CGGCCGAACC CGAGGGCGGC GACGCGGCGA TCCTGTCCGA CAACCTCACC
CAGGGCGCGA ACCTCGGTGC GGGACTGGGC AAGTGGTCGC CGGACTCCGG TGAGACCATC
CACGACCGGC TCGGCACGAT CGTCGGCGGC GCCATGGGTT ACGAGCCCGA GGATCTGCCG
TGGGAGGTGC CGCTCATCGA GCTCGGCCTG GATTCCCTGA TGGCGGTGCG GATCAAGAAC
CGTGTCGAGT ACGACTTCGA CCTGCCGCCG ATCCAGCTGA CCGCGGTGCG CGACGCCAAC
CTCTATGCCG TCGAACAGCT CATCACCTAC GCGATCGAGC ATCGCGACGA GGTCGACCAG
CTGGCCGAAT CGCAGAAGGG TAAGACCGCC GAGGAGATCG CCGCCGAACA GGCCGAGCTG
ATGGGCGGGG CGTCGACGGT CGCCGAGCTC GAGGAGAAGC TGGCCGCGGC GGGACACCCG
CTGGGCGAAG CTGCCTCAGA GCAAGCTGCG GTGATGTCCG GGGCGAACCT GACGACGGTG
ACCACGCCGG CGCCGGATCC GCAGGCCGAA CCGAGCACCC AACAAGATTC GGCGATCCCG
GCCCCGCCGA CCGATCCGTC GGGACCGAAC ATCCCGCCGC CGCCGACCAA CCCGGCCGGA
CCGGACACCA CCGCGAAGTC GTCCGCCGCC AAGGCGGCCG CGCAGGTGCT GACCCAGGAG
GCCGTCACCG AGGCGCTGGG CGCCGACGTC CCACCGCGTG ACGCCGCCGA ACGCGTCACG
TTCGCCACCT GGGCGATCGT CACCGGCAAA TCGCCGGGCG GCATCTTCAA CGAGCTGCCG
AAGGTCGACG ACGCCACGGC GGCGAAGATG GCCGAACGGT TGACCGAACG AGCCGAAGGC
ACCATCACCG CCGACGACGT GAAGGCCGCC ACCACCATCG AGGATCTGGC CACCACGGTG
CGCGAACACC TCGAGGCGGG CAAGGTCGAC GGGTTCGTCC GCGTCCTGCG TGCGCCGCAG
GAAGGGAGCG ACCGGATCCC TGTGTTCGTG TTCCATCCGG CCGGCGGGTC GACGGTGGTG
TACGAACCGC TGATGAAGCG GCTGCCGCCG GACACCCCGA TCTACGGCAT CGAGCGGGTC
GAGGGCTCCG TGGAGGAGCG GGCCGCGGAG TACGTGCCGA AACTGCTCGA GATGAACGGG
TGGACCGAGG GCAGGTCCGG CGTGCCGTTC ATCCTGGCGG GCTGGTCGCT GGGCGGGGTG
CTGGCCTATG CGTGCGCCAT CGGGCTCAAG CAGGCCGGTG CGGACGTGCG GTTCGTCGGC
CTCATCGACG CGGTGCGCGC CGGTGAGGAG GTCCCGCAGA CCAAGGAGGA GACCCGCGCC
CGCTGGGAGC GCTACGCCCG GTTCGCCGAG CGCACCTTCA ACGTGCAGAT CCCGGAGATC
CCGTACGAGG AGCTGGAGAA CCTCGACGAC GAGGGTCAGG TGAGGTTCGT CATGGAGGCC
GTCGCGGCCA GCGGCGTGCA GATCCCCGGC GGCATCATCG AGCACCAGCG CACGTCGTAT
CTGGACAACC GGATGATCGA CACCGCGGAG ATCAAGCCGT ACGACGGTCA CGTCACCCTC
TACATGGCCG ACCGCTACCA CGACGACGCG ATCTACTTCG AACCGCGATA CGCCACCCGC
CAACCCGACG GCGGCTGGGG TGAGTATGTT TCGGAGCTGG AAGTGATCCC GATCGGCGGT
GAGCACATCC AGGCGATCGA CGAGCCGTAC ATCGCGAAGG TGGGTGCCCA CATGAGCGAG
GCGATCAACC GTATCGAGGC CCAGGGAAAG TAG
 
Protein sequence
MNMSETPNNS PSAENQPIVA QGEGGPLRPA QVDMTVAEMR EWLRNWIANA TGQNADNIDE 
QTAMVELGLS SRDAVAMASD IEDLTGVTLT ATVAFRHPTI ESLATVIIEG EPEVEHDDDG
TDWSRERDVD DIAIVGLATR FPGDMNTPDE MWEALLEGRD AITDLPEGRW EEFLGEPRIA
ERVAKAATRG GYLSDIKGFD AEFFALSKME ADNIDPQQRM ALELTWEALE HARIPASSLR
GERVGVYIGA SNNDYSFMSV ADPGVAHPYA ITGTTSSIIA NRVSYFYDFR GPSMAIDTAC
SSSLVAAHQG VAALRAGEAD VAVVGGVNAL ITPLVTIGFD EVGGVLAPDG RIKSFSQDAN
GYARSEGAGM LVLKRLSDAR RDGDEIYAVI AGSAVNHDGR SNGLLAPNPD AQAEVLRKAY
KDAGINPRDV DYIEAHGTGT ILGDPIEADA LGRVIGRSRP ADQPALLGAV KSNVGHLESA
AGAASLAKVA LSLRNDKLPP SINYTGPNPY IDFDAVRLKV NDTVSDWPRY SGHAIAGVSG
FGFGGANAHM VLREVLPSDL VEPEPEQVVE VTAEPNQPAV YVGGVRMDEY GEFVDEPLAR
RESGFDEDSD DDGLDRPAAA VEDDYELPGL TDEAKRLLEV AREELEAAEQ PVPLVPLAVS
AFLTSRKKSA AAELADWMDS EEGRASSLES IGRALSRRNH GRSRAVVMAR DHDEAIKGLR
ALAEGKQSPN VYSADGPVTN GPVWVLAGFG AQHRKMGKSL YLRNEVFADW INKVDSHVQD
ERGHSILELI LDDAVDYTDE TTELPIEKVQ LVIFAIQVAL GELLKHHGAK PGAVIGQSLG
EAAAAYFSGG LSLEDATRAI CSRSHLMGEG EAMLFGEYIR LMALVEYSAD EIKTVFSDYP
DLEVCVYAAP TQTVIGGPPE QVDAIIARAE QEGKFARKFQ TKGASHTSQM DPLLGELAAE
LVGITPHPLQ IGYYSTVHEG KFLRAGSEPI HDVDYWKKGL RHSVYFTHGI RNAVDNGHTT
FLELAPNPVA LMQVGLTTMS AGLHDGQLIA TLARKQDEVD SMTAAMAQLF VHGHDLDMRT
LFPRRSRGLA GALDYANIPP TRFRRKPHWL DVRFSGDNAG VMPGSHVATP DGRHVWEYSP
RGAVDAQALA ALVKSAASQV FPEAAVTAAE QRAVPGDGAR LVTTLTRHPG GASVQVHARM
DNGGESSFAL VYDAIVTRGG QAVALPAAVA TGTVAPQADS LTPAAEPEGG DAAILSDNLT
QGANLGAGLG KWSPDSGETI HDRLGTIVGG AMGYEPEDLP WEVPLIELGL DSLMAVRIKN
RVEYDFDLPP IQLTAVRDAN LYAVEQLITY AIEHRDEVDQ LAESQKGKTA EEIAAEQAEL
MGGASTVAEL EEKLAAAGHP LGEAASEQAA VMSGANLTTV TTPAPDPQAE PSTQQDSAIP
APPTDPSGPN IPPPPTNPAG PDTTAKSSAA KAAAQVLTQE AVTEALGADV PPRDAAERVT
FATWAIVTGK SPGGIFNELP KVDDATAAKM AERLTERAEG TITADDVKAA TTIEDLATTV
REHLEAGKVD GFVRVLRAPQ EGSDRIPVFV FHPAGGSTVV YEPLMKRLPP DTPIYGIERV
EGSVEERAAE YVPKLLEMNG WTEGRSGVPF ILAGWSLGGV LAYACAIGLK QAGADVRFVG
LIDAVRAGEE VPQTKEETRA RWERYARFAE RTFNVQIPEI PYEELENLDD EGQVRFVMEA
VAASGVQIPG GIIEHQRTSY LDNRMIDTAE IKPYDGHVTL YMADRYHDDA IYFEPRYATR
QPDGGWGEYV SELEVIPIGG EHIQAIDEPY IAKVGAHMSE AINRIEAQGK