Gene Mmcs_2835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2835 
Symbol 
ID4111667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2994977 
End bp2999719 
Gene Length4743 bp 
Protein Length1580 aa 
Translation table11 
GC content69% 
IMG OID638031959 
Productbeta-ketoacyl synthase 
Protein accessionYP_639998 
Protein GI108799801 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCGG CCACGCTCGA CGACGCGAGG CTGCGCGACT GGTTGGTCAC CTATCTGACC 
ACGCACGTCG AGTGCTCACC CGAGAGCATC GACTTCGACG CGTCGATGGC CGACCTCGGG
GTCGGCTCGC GCGATGCCGT CGTCCTGTCC GGTGAACTGG CGGAACTGCT CGGCAGGCCG
GTCTCGCCCG TCGACTTCTG GCAGCACCCG ACGATCAACA GTCTGATCGA GTTCCTCAGT
GCACCCGTCA CCGAGGTGGA GACGCAGGCC GTGGTCGAGG GGCCGAGGAT TTCGGGCACC
GAACCGATCG CCGTCATCGG GTTGGGCTGC CGCATGCCCG GCGGGATCTC CGATCCAGAT
GCGCTGTGGG ATCTACTCGC CGACGGCCGC TGCGCGGTGG GGAAGGTGCC GCCCGAACGC
TGGCAACCGT TCGACGACGG TTCCCCCGAG GTGGCGTCGG CACTGGCGGG AACCACGCGT
TGGGGGTCGT TCCTCGACGA CATCGCGGGC TTCGACGCGG ATTTCTTCGA CATCTCTTCG
CGTGAAGCCG TCAAGATGGA TCCGCAGCAG CGCCTGCTGC TGGAAGTGGC CTGGGAGGCG
CTGGAGCACG CCGGGATCCC CGCCGCATCG CTGCGCCGTT CGCAGACAGG CGTTTTCGCC
GGCGCATGCT TCAGCGAGTA CGGCTACCTC GCATCGACGG ATCTGCCGCG GGTCGACGCA
TGGAGTAACA CCGGCGGGGC GTTGAGCATC ATCGCCAACC GGCTCTCCTA CTTCCTCGAC
CTCCGTGGGC CGTCGATCAC GGTCGACACG GCCTGCTCGT CCTCGCTGGT CGCCGTCCAC
CTGGCCTGCC AGAGCCTGCG GTCGGGCGAC TCGAACCTCG CACTCGCAGC GGGGGTGAAC
CTGCTGCTCT CACCCGCCGT CTTCCACGGC TTCGATCAGG CCGGCGCCCT GTCACCCACG
GGAATGTGCC ATTCCTTCGA CGCGGCCGCC GACGGTTTCG TCCGTGGCGA AGGCTGCGGC
GTGGTCGTGC TCAAGCGGCT CCCGGATGCA CTGCGTGACG GTGACCGGGT GCTCGCCGTG
GTGCGCGGTT CGGCGATCAA CCAGGACGGC CGGTCCAACG GCCTGATGGC GCCGAACCCG
GCCGCGCAGA TGGCGGTGCT CCGGTCTGCG TGTGCGAACG CCGGCATCGA ACCGCAGGAC
ATGGACTACG TGGAGGCGCA CGGAACCGGC ACCTTCCTGG GTGACCCGAT CGAGGCCAGG
GCCCTCGGCT CGGTGATGGG CCGCGGGCGG CCGGCGACCT CGCCGCTGCT CGTCGGTGCG
GTCAAATCCA ACCTCGGGCA TCTCGAGGCC GCCGCCGGTG TGGCCGGATT CATCAAGACG
GTGATGGCGC TGCAGCGCGG CCGGATTCCC GGCAACGCCG GCTACGAGTC GCCGAATCCC
CACATCCCGT TCGACCAACT GCGCTTGAAA GTCGTTGACC ACGAACAAGA GTGGCCATCC
GTGTCGCGCG CACGCCGCGC CGGGGTGTCG TCGTTCGGTT TCGGCGGCAC CAACGCCCAC
GTCATCCTCG AGCAGGCGCC GGACGCGATC GCGGCCGAAC CGCACCCCGC CGCTGCGGTG
AGCACGTTGA TCGTGTCGGG TAAGTCCCCC GAGCGGATCG AAGCCGCCGC CGCCGCAGTG
GCCGAGTGGA TGTCCGGTCC CGGTGCGGGC GTCGCGCTGG GCGATGTGGC CCACACCCTC
AACCATCACC GCGCCCACCA CCAGTCCTTC GCCACGGTCT GTGCCCGGGA CGGCGTCGAC
GCCGTGGCAG GTCTGCAGGC GCTGGCCGCG CGCCTGCCCG CCGATGGCGT GGTGAAACCC
CATGAGGGGC CGTGTGGTTC GGGGACGGTG TTCGTGTTCT CGGGTCAGGG GTCGCAGTGG
GCCGGGATGG GTCGGCGGCT GTTGGCCGAT GAGCCGGTGT TCGCGGCGGC GGTGGCCGAG
TTGGAGCCGG TGTTCGTCGA GCAGGTCGGG TTCTCGCTGG CTCAGGTGCT CGCCGATGGT
GAGGCGGTCA CCGGGGATGC TCGGGTGCAG CCGGTGATCA TGGGGCTGCA GTTGGCGCTG
ACCGAGCTGT GGCGCTCCTA CGGGGTGACC CCGGATGCGG TGATCGGCCA CTCGATGGGT
GAGGTCACCG CGGCCGTCGT CGCCGGTGCG CTGAGCCCCA CCGAAGGTCT GAGGGTCATC
GCAGTGCGCT CGCGGCTGAT GTCCCGGCTG GCAGGCCAGG GCGCGGTCGC GCTGCTGACA
CTGGGCGCCG ATGCGGCGGA GGCGCTGATC GCCGATCATC CGGACGTCGC GGTGGCCGGG
TATGTGTCAC CGGGGCAGAC GGTCGTCGCC GGTCCGCCCG CAGAGGTCGA CGCGGTGATC
GCCGCGGTGC AGAGCCAGAA CCGGTTCGCC CGCCGGGTGA ACATGGAAGT CGCCTCCCAT
ACCGCCCTGA TGGATCCGAT CCTCGACGAA CTGCGGTCCG AACTGGCCGA CCTCACGCCG
AACACGACTG CGATTCCGTT CATCTCGACG GTCGAGGACA GCGCGACCCC GCTGCTGGAT
GCGGACTATT GGGTGGCCAA CGTGCGGCGG CCGGTACGGC TGAGCCAGGC GTTGGCCACC
GCCGCCGAGA GCCACACCAC ATTCGTCGAG ATCAGCGCGC ACCCGATGCT GACCACCGCG
GTGACCGAGA CGCTCGGCGA CCTGCACCAC CACGCGCTGG GCACGCTGTC TCGGGATACC
GACGACACCG TCACCTTCCA CACCAACCTG AACACCACTC ACACCACGCA TCCGCCGGTC
ACACCGCACC CGCCCGAACC ACACCCGGTG CTGCCCGCCA CGCCGTGGCA GCACAACCGG
TACTGGATGG ACCTGACTCC ACTGCGTCGC ACCGCGACTG ACGCTGCGCC GCAGGGGGAT
TCATCAGCGG GGGTGCTGCC CGCGGAGTGG AACTGCGAGC TGACGTGGCC GAGCCGGCCA
GTCGCCGGCG GGGAGCGCGT CGCCGGATCG TGGCTGGTCG TCGGGAACGC GGCTCTGGCA
GCCCAGATCC GGCGAGATCT GGGAGCCGGC GCAGAGGTGG CAGTCCTCGA CGAAGACACG
CCGGACACCC GGCTCGAGGA TGCGCTGGCC GCCGCCGACC ACGTGGTCTA TGCGCCCGCG
GTGCCTGCCG TTTTCGATGC CGCGCAGGGC CGTCGGCTCT TCGACGTCGC CCGTCGCATC
GCGGTCGCGA TGGCGAGGAT GACCGACCCG GGCCGCCTCA TCCTGCTGAC CCGCAACGCC
CAGCCCGTCA CCGAAGGCGA CCGCGCCAAC CCGGCACACG CGGTGCTGTG GGGTCTGGGC
CGCACTCTCG CGCTCGAGCA CCCCGAGATC TGGGACGCCG TGATCGATCT CGACGAGTTG
GTCCCAGACC GGTTGGCCGC CCGCTACCTG CTCGCCGAGG CGACGGCCGA GGGCGGCGAG
GACCAGGTCG TCTATCGCGA CGGGACGCGC CGGGTGGCCC GGTTACGCCG AACCCCGCTG
TCGCAGGCAT CCGGTGACGG GCTCGATCCG GCCGGCAGCC ACCTGGTCGT CGGGGCGACC
GGCAACATCG GCCCGCACCT GATCCAGCAG TTGGCCGATA TGGGGGCCAA GACCGTCGTC
GCGGTATCTC GGAACCCCGG TGACCGGCTG CGCGAACTCG GCGACACCCT CGCCGCGCGG
GGCGTCACCC TGGTCACCGT GGCCGCCGAC GCCGCCGACG AAGAGTCGAT GCGCGCGGTG
TTCGACCGCT TCGGCGCCGA TCTGCCCCCG CTGGCCGGAA TCTATCTGGC CGCCTTCGGG
GGAGGGCCGG TCATGTTGGC CGAAATGACC GACGACGACA TCACCGCGAT GTTCGCGCCC
AAGCTCGACG CGGTGGCGGT ACTGCACAGG CTGTCGCTGA CCACCGACGT CCAGCAATTC
GTGCTGTTCT CGTCGATCTC GGGGATTCTG GGCTCGCGAT GGCTGGCCCA TTACACCGCG
ACCACCACGT ACCTCGACGC CTTCGCCTAT GCGCGACGCG CCGCGGGACT GCCCGCCACC
GCCGTCAACT GGGGTCTGTG GAAGTCGTTG GCCGACAACT ACAGTGAGCA CGAACGGCAG
ATCACCGTGG AGTCCGGCCT CGAACCGATG CCCGACGAGG TGGCGATCCA GGCGTTGTGG
TCGATAACCG CGCCCGGCAC ACCCGCCCGC TCGACCGTGG TCGCCGCGGA CTGGCCGCGG
CTGGCCGCGG CCTACCGGAC GCGCGCCGCA CTGCGGATCG TCGACGAGTT GCTGCCGGTC
GAGAGCACCG ACGACGAACG CGCCGACACC CCGACGTCGG TTCCGGAGAC CGAATTCCGC
CGTGAACTGC GCGCATGCCC CGCCGACGAG CGAGGGTATC TGCTCAGCAC CCACATCCGT
GCGCTCGTCG CATCGTCGAT GGGGTTGTCC AGCGCCCAGC TGGTGGACCC GTCCGCGGGC
TTCTTCCAGT GCGGGATGGA CTCGCTGATG AGCGTCACCC TCAAGCGTGA GCTCGGCGAG
AGCCTCGGTG AGAGCCTGCC GGCGTCGGTG ATCTTCGATT ACCCGACCGT CGACGGACTC
ACCGAATACC TCGCCACGGT ATTGCCCGAA ATGCTCGAGA TCGCCGACGA AAGCGACGTC
GACGACTACG ACGAGTTCAG CGACGACGAA CTGCTCCAAC AACTCTCGGA AAGGTTGAGC
TGA
 
Protein sequence
MTPATLDDAR LRDWLVTYLT THVECSPESI DFDASMADLG VGSRDAVVLS GELAELLGRP 
VSPVDFWQHP TINSLIEFLS APVTEVETQA VVEGPRISGT EPIAVIGLGC RMPGGISDPD
ALWDLLADGR CAVGKVPPER WQPFDDGSPE VASALAGTTR WGSFLDDIAG FDADFFDISS
REAVKMDPQQ RLLLEVAWEA LEHAGIPAAS LRRSQTGVFA GACFSEYGYL ASTDLPRVDA
WSNTGGALSI IANRLSYFLD LRGPSITVDT ACSSSLVAVH LACQSLRSGD SNLALAAGVN
LLLSPAVFHG FDQAGALSPT GMCHSFDAAA DGFVRGEGCG VVVLKRLPDA LRDGDRVLAV
VRGSAINQDG RSNGLMAPNP AAQMAVLRSA CANAGIEPQD MDYVEAHGTG TFLGDPIEAR
ALGSVMGRGR PATSPLLVGA VKSNLGHLEA AAGVAGFIKT VMALQRGRIP GNAGYESPNP
HIPFDQLRLK VVDHEQEWPS VSRARRAGVS SFGFGGTNAH VILEQAPDAI AAEPHPAAAV
STLIVSGKSP ERIEAAAAAV AEWMSGPGAG VALGDVAHTL NHHRAHHQSF ATVCARDGVD
AVAGLQALAA RLPADGVVKP HEGPCGSGTV FVFSGQGSQW AGMGRRLLAD EPVFAAAVAE
LEPVFVEQVG FSLAQVLADG EAVTGDARVQ PVIMGLQLAL TELWRSYGVT PDAVIGHSMG
EVTAAVVAGA LSPTEGLRVI AVRSRLMSRL AGQGAVALLT LGADAAEALI ADHPDVAVAG
YVSPGQTVVA GPPAEVDAVI AAVQSQNRFA RRVNMEVASH TALMDPILDE LRSELADLTP
NTTAIPFIST VEDSATPLLD ADYWVANVRR PVRLSQALAT AAESHTTFVE ISAHPMLTTA
VTETLGDLHH HALGTLSRDT DDTVTFHTNL NTTHTTHPPV TPHPPEPHPV LPATPWQHNR
YWMDLTPLRR TATDAAPQGD SSAGVLPAEW NCELTWPSRP VAGGERVAGS WLVVGNAALA
AQIRRDLGAG AEVAVLDEDT PDTRLEDALA AADHVVYAPA VPAVFDAAQG RRLFDVARRI
AVAMARMTDP GRLILLTRNA QPVTEGDRAN PAHAVLWGLG RTLALEHPEI WDAVIDLDEL
VPDRLAARYL LAEATAEGGE DQVVYRDGTR RVARLRRTPL SQASGDGLDP AGSHLVVGAT
GNIGPHLIQQ LADMGAKTVV AVSRNPGDRL RELGDTLAAR GVTLVTVAAD AADEESMRAV
FDRFGADLPP LAGIYLAAFG GGPVMLAEMT DDDITAMFAP KLDAVAVLHR LSLTTDVQQF
VLFSSISGIL GSRWLAHYTA TTTYLDAFAY ARRAAGLPAT AVNWGLWKSL ADNYSEHERQ
ITVESGLEPM PDEVAIQALW SITAPGTPAR STVVAADWPR LAAAYRTRAA LRIVDELLPV
ESTDDERADT PTSVPETEFR RELRACPADE RGYLLSTHIR ALVASSMGLS SAQLVDPSAG
FFQCGMDSLM SVTLKRELGE SLGESLPASV IFDYPTVDGL TEYLATVLPE MLEIADESDV
DDYDEFSDDE LLQQLSERLS