Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_2835 |
Symbol | |
ID | 4111667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 2994977 |
End bp | 2999719 |
Gene Length | 4743 bp |
Protein Length | 1580 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638031959 |
Product | beta-ketoacyl synthase |
Protein accession | YP_639998 |
Protein GI | 108799801 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACCGG CCACGCTCGA CGACGCGAGG CTGCGCGACT GGTTGGTCAC CTATCTGACC ACGCACGTCG AGTGCTCACC CGAGAGCATC GACTTCGACG CGTCGATGGC CGACCTCGGG GTCGGCTCGC GCGATGCCGT CGTCCTGTCC GGTGAACTGG CGGAACTGCT CGGCAGGCCG GTCTCGCCCG TCGACTTCTG GCAGCACCCG ACGATCAACA GTCTGATCGA GTTCCTCAGT GCACCCGTCA CCGAGGTGGA GACGCAGGCC GTGGTCGAGG GGCCGAGGAT TTCGGGCACC GAACCGATCG CCGTCATCGG GTTGGGCTGC CGCATGCCCG GCGGGATCTC CGATCCAGAT GCGCTGTGGG ATCTACTCGC CGACGGCCGC TGCGCGGTGG GGAAGGTGCC GCCCGAACGC TGGCAACCGT TCGACGACGG TTCCCCCGAG GTGGCGTCGG CACTGGCGGG AACCACGCGT TGGGGGTCGT TCCTCGACGA CATCGCGGGC TTCGACGCGG ATTTCTTCGA CATCTCTTCG CGTGAAGCCG TCAAGATGGA TCCGCAGCAG CGCCTGCTGC TGGAAGTGGC CTGGGAGGCG CTGGAGCACG CCGGGATCCC CGCCGCATCG CTGCGCCGTT CGCAGACAGG CGTTTTCGCC GGCGCATGCT TCAGCGAGTA CGGCTACCTC GCATCGACGG ATCTGCCGCG GGTCGACGCA TGGAGTAACA CCGGCGGGGC GTTGAGCATC ATCGCCAACC GGCTCTCCTA CTTCCTCGAC CTCCGTGGGC CGTCGATCAC GGTCGACACG GCCTGCTCGT CCTCGCTGGT CGCCGTCCAC CTGGCCTGCC AGAGCCTGCG GTCGGGCGAC TCGAACCTCG CACTCGCAGC GGGGGTGAAC CTGCTGCTCT CACCCGCCGT CTTCCACGGC TTCGATCAGG CCGGCGCCCT GTCACCCACG GGAATGTGCC ATTCCTTCGA CGCGGCCGCC GACGGTTTCG TCCGTGGCGA AGGCTGCGGC GTGGTCGTGC TCAAGCGGCT CCCGGATGCA CTGCGTGACG GTGACCGGGT GCTCGCCGTG GTGCGCGGTT CGGCGATCAA CCAGGACGGC CGGTCCAACG GCCTGATGGC GCCGAACCCG GCCGCGCAGA TGGCGGTGCT CCGGTCTGCG TGTGCGAACG CCGGCATCGA ACCGCAGGAC ATGGACTACG TGGAGGCGCA CGGAACCGGC ACCTTCCTGG GTGACCCGAT CGAGGCCAGG GCCCTCGGCT CGGTGATGGG CCGCGGGCGG CCGGCGACCT CGCCGCTGCT CGTCGGTGCG GTCAAATCCA ACCTCGGGCA TCTCGAGGCC GCCGCCGGTG TGGCCGGATT CATCAAGACG GTGATGGCGC TGCAGCGCGG CCGGATTCCC GGCAACGCCG GCTACGAGTC GCCGAATCCC CACATCCCGT TCGACCAACT GCGCTTGAAA GTCGTTGACC ACGAACAAGA GTGGCCATCC GTGTCGCGCG CACGCCGCGC CGGGGTGTCG TCGTTCGGTT TCGGCGGCAC CAACGCCCAC GTCATCCTCG AGCAGGCGCC GGACGCGATC GCGGCCGAAC CGCACCCCGC CGCTGCGGTG AGCACGTTGA TCGTGTCGGG TAAGTCCCCC GAGCGGATCG AAGCCGCCGC CGCCGCAGTG GCCGAGTGGA TGTCCGGTCC CGGTGCGGGC GTCGCGCTGG GCGATGTGGC CCACACCCTC AACCATCACC GCGCCCACCA CCAGTCCTTC GCCACGGTCT GTGCCCGGGA CGGCGTCGAC GCCGTGGCAG GTCTGCAGGC GCTGGCCGCG CGCCTGCCCG CCGATGGCGT GGTGAAACCC CATGAGGGGC CGTGTGGTTC GGGGACGGTG TTCGTGTTCT CGGGTCAGGG GTCGCAGTGG GCCGGGATGG GTCGGCGGCT GTTGGCCGAT GAGCCGGTGT TCGCGGCGGC GGTGGCCGAG TTGGAGCCGG TGTTCGTCGA GCAGGTCGGG TTCTCGCTGG CTCAGGTGCT CGCCGATGGT GAGGCGGTCA CCGGGGATGC TCGGGTGCAG CCGGTGATCA TGGGGCTGCA GTTGGCGCTG ACCGAGCTGT GGCGCTCCTA CGGGGTGACC CCGGATGCGG TGATCGGCCA CTCGATGGGT GAGGTCACCG CGGCCGTCGT CGCCGGTGCG CTGAGCCCCA CCGAAGGTCT GAGGGTCATC GCAGTGCGCT CGCGGCTGAT GTCCCGGCTG GCAGGCCAGG GCGCGGTCGC GCTGCTGACA CTGGGCGCCG ATGCGGCGGA GGCGCTGATC GCCGATCATC CGGACGTCGC GGTGGCCGGG TATGTGTCAC CGGGGCAGAC GGTCGTCGCC GGTCCGCCCG CAGAGGTCGA CGCGGTGATC GCCGCGGTGC AGAGCCAGAA CCGGTTCGCC CGCCGGGTGA ACATGGAAGT CGCCTCCCAT ACCGCCCTGA TGGATCCGAT CCTCGACGAA CTGCGGTCCG AACTGGCCGA CCTCACGCCG AACACGACTG CGATTCCGTT CATCTCGACG GTCGAGGACA GCGCGACCCC GCTGCTGGAT GCGGACTATT GGGTGGCCAA CGTGCGGCGG CCGGTACGGC TGAGCCAGGC GTTGGCCACC GCCGCCGAGA GCCACACCAC ATTCGTCGAG ATCAGCGCGC ACCCGATGCT GACCACCGCG GTGACCGAGA CGCTCGGCGA CCTGCACCAC CACGCGCTGG GCACGCTGTC TCGGGATACC GACGACACCG TCACCTTCCA CACCAACCTG AACACCACTC ACACCACGCA TCCGCCGGTC ACACCGCACC CGCCCGAACC ACACCCGGTG CTGCCCGCCA CGCCGTGGCA GCACAACCGG TACTGGATGG ACCTGACTCC ACTGCGTCGC ACCGCGACTG ACGCTGCGCC GCAGGGGGAT TCATCAGCGG GGGTGCTGCC CGCGGAGTGG AACTGCGAGC TGACGTGGCC GAGCCGGCCA GTCGCCGGCG GGGAGCGCGT CGCCGGATCG TGGCTGGTCG TCGGGAACGC GGCTCTGGCA GCCCAGATCC GGCGAGATCT GGGAGCCGGC GCAGAGGTGG CAGTCCTCGA CGAAGACACG CCGGACACCC GGCTCGAGGA TGCGCTGGCC GCCGCCGACC ACGTGGTCTA TGCGCCCGCG GTGCCTGCCG TTTTCGATGC CGCGCAGGGC CGTCGGCTCT TCGACGTCGC CCGTCGCATC GCGGTCGCGA TGGCGAGGAT GACCGACCCG GGCCGCCTCA TCCTGCTGAC CCGCAACGCC CAGCCCGTCA CCGAAGGCGA CCGCGCCAAC CCGGCACACG CGGTGCTGTG GGGTCTGGGC CGCACTCTCG CGCTCGAGCA CCCCGAGATC TGGGACGCCG TGATCGATCT CGACGAGTTG GTCCCAGACC GGTTGGCCGC CCGCTACCTG CTCGCCGAGG CGACGGCCGA GGGCGGCGAG GACCAGGTCG TCTATCGCGA CGGGACGCGC CGGGTGGCCC GGTTACGCCG AACCCCGCTG TCGCAGGCAT CCGGTGACGG GCTCGATCCG GCCGGCAGCC ACCTGGTCGT CGGGGCGACC GGCAACATCG GCCCGCACCT GATCCAGCAG TTGGCCGATA TGGGGGCCAA GACCGTCGTC GCGGTATCTC GGAACCCCGG TGACCGGCTG CGCGAACTCG GCGACACCCT CGCCGCGCGG GGCGTCACCC TGGTCACCGT GGCCGCCGAC GCCGCCGACG AAGAGTCGAT GCGCGCGGTG TTCGACCGCT TCGGCGCCGA TCTGCCCCCG CTGGCCGGAA TCTATCTGGC CGCCTTCGGG GGAGGGCCGG TCATGTTGGC CGAAATGACC GACGACGACA TCACCGCGAT GTTCGCGCCC AAGCTCGACG CGGTGGCGGT ACTGCACAGG CTGTCGCTGA CCACCGACGT CCAGCAATTC GTGCTGTTCT CGTCGATCTC GGGGATTCTG GGCTCGCGAT GGCTGGCCCA TTACACCGCG ACCACCACGT ACCTCGACGC CTTCGCCTAT GCGCGACGCG CCGCGGGACT GCCCGCCACC GCCGTCAACT GGGGTCTGTG GAAGTCGTTG GCCGACAACT ACAGTGAGCA CGAACGGCAG ATCACCGTGG AGTCCGGCCT CGAACCGATG CCCGACGAGG TGGCGATCCA GGCGTTGTGG TCGATAACCG CGCCCGGCAC ACCCGCCCGC TCGACCGTGG TCGCCGCGGA CTGGCCGCGG CTGGCCGCGG CCTACCGGAC GCGCGCCGCA CTGCGGATCG TCGACGAGTT GCTGCCGGTC GAGAGCACCG ACGACGAACG CGCCGACACC CCGACGTCGG TTCCGGAGAC CGAATTCCGC CGTGAACTGC GCGCATGCCC CGCCGACGAG CGAGGGTATC TGCTCAGCAC CCACATCCGT GCGCTCGTCG CATCGTCGAT GGGGTTGTCC AGCGCCCAGC TGGTGGACCC GTCCGCGGGC TTCTTCCAGT GCGGGATGGA CTCGCTGATG AGCGTCACCC TCAAGCGTGA GCTCGGCGAG AGCCTCGGTG AGAGCCTGCC GGCGTCGGTG ATCTTCGATT ACCCGACCGT CGACGGACTC ACCGAATACC TCGCCACGGT ATTGCCCGAA ATGCTCGAGA TCGCCGACGA AAGCGACGTC GACGACTACG ACGAGTTCAG CGACGACGAA CTGCTCCAAC AACTCTCGGA AAGGTTGAGC TGA
|
Protein sequence | MTPATLDDAR LRDWLVTYLT THVECSPESI DFDASMADLG VGSRDAVVLS GELAELLGRP VSPVDFWQHP TINSLIEFLS APVTEVETQA VVEGPRISGT EPIAVIGLGC RMPGGISDPD ALWDLLADGR CAVGKVPPER WQPFDDGSPE VASALAGTTR WGSFLDDIAG FDADFFDISS REAVKMDPQQ RLLLEVAWEA LEHAGIPAAS LRRSQTGVFA GACFSEYGYL ASTDLPRVDA WSNTGGALSI IANRLSYFLD LRGPSITVDT ACSSSLVAVH LACQSLRSGD SNLALAAGVN LLLSPAVFHG FDQAGALSPT GMCHSFDAAA DGFVRGEGCG VVVLKRLPDA LRDGDRVLAV VRGSAINQDG RSNGLMAPNP AAQMAVLRSA CANAGIEPQD MDYVEAHGTG TFLGDPIEAR ALGSVMGRGR PATSPLLVGA VKSNLGHLEA AAGVAGFIKT VMALQRGRIP GNAGYESPNP HIPFDQLRLK VVDHEQEWPS VSRARRAGVS SFGFGGTNAH VILEQAPDAI AAEPHPAAAV STLIVSGKSP ERIEAAAAAV AEWMSGPGAG VALGDVAHTL NHHRAHHQSF ATVCARDGVD AVAGLQALAA RLPADGVVKP HEGPCGSGTV FVFSGQGSQW AGMGRRLLAD EPVFAAAVAE LEPVFVEQVG FSLAQVLADG EAVTGDARVQ PVIMGLQLAL TELWRSYGVT PDAVIGHSMG EVTAAVVAGA LSPTEGLRVI AVRSRLMSRL AGQGAVALLT LGADAAEALI ADHPDVAVAG YVSPGQTVVA GPPAEVDAVI AAVQSQNRFA RRVNMEVASH TALMDPILDE LRSELADLTP NTTAIPFIST VEDSATPLLD ADYWVANVRR PVRLSQALAT AAESHTTFVE ISAHPMLTTA VTETLGDLHH HALGTLSRDT DDTVTFHTNL NTTHTTHPPV TPHPPEPHPV LPATPWQHNR YWMDLTPLRR TATDAAPQGD SSAGVLPAEW NCELTWPSRP VAGGERVAGS WLVVGNAALA AQIRRDLGAG AEVAVLDEDT PDTRLEDALA AADHVVYAPA VPAVFDAAQG RRLFDVARRI AVAMARMTDP GRLILLTRNA QPVTEGDRAN PAHAVLWGLG RTLALEHPEI WDAVIDLDEL VPDRLAARYL LAEATAEGGE DQVVYRDGTR RVARLRRTPL SQASGDGLDP AGSHLVVGAT GNIGPHLIQQ LADMGAKTVV AVSRNPGDRL RELGDTLAAR GVTLVTVAAD AADEESMRAV FDRFGADLPP LAGIYLAAFG GGPVMLAEMT DDDITAMFAP KLDAVAVLHR LSLTTDVQQF VLFSSISGIL GSRWLAHYTA TTTYLDAFAY ARRAAGLPAT AVNWGLWKSL ADNYSEHERQ ITVESGLEPM PDEVAIQALW SITAPGTPAR STVVAADWPR LAAAYRTRAA LRIVDELLPV ESTDDERADT PTSVPETEFR RELRACPADE RGYLLSTHIR ALVASSMGLS SAQLVDPSAG FFQCGMDSLM SVTLKRELGE SLGESLPASV IFDYPTVDGL TEYLATVLPE MLEIADESDV DDYDEFSDDE LLQQLSERLS
|
| |