Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_5010 |
Symbol | |
ID | 4113839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 5304292 |
End bp | 5309844 |
Gene Length | 5553 bp |
Protein Length | 1850 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638034168 |
Product | mycolic acid condensase |
Protein accession | YP_642170 |
Protein GI | 108801973 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.398099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATGT CTGAAACACC GAACAATTCG CCCTCCGCTG AGAATCAGCC GATCGTCGCC CAGGGCGAGG GTGGCCCGCT GCGTCCCGCG CAGGTCGACA TGACCGTCGC CGAGATGCGC GAATGGCTGC GCAACTGGAT CGCCAACGCG ACCGGCCAGA ACGCCGACAA CATCGACGAG CAGACCGCGA TGGTGGAACT CGGCCTGTCC TCGCGCGACG CGGTCGCGAT GGCCAGCGAC ATCGAGGACC TCACCGGTGT CACGCTGACG GCGACCGTGG CGTTCCGCCA TCCGACCATC GAGTCGTTGG CGACGGTGAT CATCGAGGGT GAGCCCGAGG TCGAGCACGA CGACGACGGT ACCGACTGGA GCCGTGAGCG CGACGTCGAC GACATCGCGA TCGTGGGTCT CGCCACCCGC TTCCCGGGCG ATATGAACAC CCCCGACGAG ATGTGGGAGG CGCTGCTCGA GGGTCGCGAT GCGATCACCG ATCTGCCCGA GGGCCGGTGG GAGGAGTTCC TCGGTGAGCC GCGGATCGCC GAGCGGGTCG CCAAGGCGGC CACCCGCGGC GGCTACCTGT CGGACATCAA GGGTTTCGAC GCCGAGTTCT TCGCGCTGTC GAAGATGGAG GCCGACAACA TCGATCCGCA GCAGCGGATG GCGCTCGAGC TGACGTGGGA GGCGCTCGAA CACGCCCGCA TCCCGGCGTC GAGCCTGCGC GGTGAGCGGG TCGGCGTGTA CATCGGCGCG TCGAACAACG ACTACAGCTT CATGTCCGTG GCCGATCCGG GCGTCGCGCA CCCGTACGCG ATCACCGGCA CCACCAGCTC GATCATCGCC AACCGGGTGT CGTACTTCTA CGACTTCCGC GGCCCGTCGA TGGCGATCGA CACCGCATGT TCGAGTTCGC TGGTCGCCGC CCACCAGGGG GTGGCCGCGC TGCGCGCAGG CGAGGCCGAC GTCGCGGTGG TCGGCGGCGT CAACGCGCTG ATCACCCCGC TGGTGACCAT CGGGTTCGAC GAGGTCGGCG GTGTGCTCGC ACCCGACGGC CGGATCAAGT CGTTCTCGCA GGACGCCAAC GGCTACGCCC GCTCCGAGGG CGCAGGCATG CTGGTGCTCA AACGCCTCTC CGATGCGCGG CGCGACGGTG ACGAGATCTA CGCCGTGATC GCGGGCAGCG CGGTGAACCA CGACGGCCGG TCCAACGGTC TGCTGGCGCC CAACCCGGAC GCGCAGGCCG AGGTGCTGCG CAAGGCCTAC AAGGACGCGG GCATCAACCC GCGCGACGTC GACTACATCG AGGCGCACGG CACCGGCACC ATCCTCGGCG ACCCGATCGA GGCCGACGCG CTCGGCCGGG TGATCGGACG CAGTCGGCCG GCCGACCAGC CCGCCCTGCT GGGTGCGGTG AAATCCAATG TGGGACACCT GGAGTCGGCG GCAGGCGCGG CCAGCCTGGC GAAGGTGGCG CTGTCGCTGC GCAACGACAA GCTGCCCCCA TCGATCAACT ACACCGGCCC GAACCCCTAC ATCGACTTCG ATGCCGTGCG GTTGAAGGTC AACGACACGG TCAGTGACTG GCCGCGCTAC AGCGGGCACG CCATCGCCGG CGTCTCCGGC TTCGGCTTCG GCGGCGCCAA CGCGCACATG GTGCTGCGCG AGGTGCTGCC CAGCGACCTG GTCGAGCCCG AACCCGAGCA GGTGGTCGAG GTGACCGCCG AGCCGAATCA GCCCGCCGTG TACGTGGGCG GGGTGCGGAT GGACGAGTAC GGCGAGTTCG TCGATGAGCC GCTTGCGCGA AGAGAATCAG GCTTCGATGA GGACTCAGAC GACGACGGTC TCGACCGGCC CGCCGCAGCG GTCGAGGACG ACTACGAGCT GCCCGGACTG ACCGACGAGG CCAAGCGTCT GCTCGAGGTC GCGCGCGAAG AGCTCGAAGC GGCCGAACAG CCCGTTCCGC TTGTGCCGCT GGCGGTTTCG GCGTTCCTGA CCTCACGCAA GAAGTCCGCG GCCGCCGAGC TGGCCGACTG GATGGACAGT GAAGAGGGCC GGGCGTCGTC GCTGGAGTCG ATCGGCCGCG CGCTGTCGCG CCGCAACCAC GGCCGCTCCC GCGCGGTCGT GATGGCCCGC GATCACGACG AGGCGATCAA GGGTCTGCGC GCGCTGGCCG AGGGCAAGCA GAGCCCGAAC GTCTACAGCG CCGACGGTCC CGTGACGAAC GGGCCGGTCT GGGTGCTCGC CGGTTTCGGC GCCCAGCACC GCAAGATGGG TAAGAGCCTC TACCTGCGCA ACGAGGTGTT CGCCGACTGG ATCAACAAGG TCGACTCCCA CGTGCAGGAC GAGCGGGGGC ACTCGATCCT CGAGCTCATC CTCGACGATG CCGTCGACTA CACCGACGAG ACCACCGAGC TGCCGATCGA GAAGGTGCAG CTGGTCATCT TCGCGATCCA GGTGGCGCTG GGCGAACTGC TCAAGCATCA CGGCGCCAAG CCGGGTGCGG TCATCGGCCA GTCGCTCGGT GAGGCCGCCG CGGCCTACTT CTCGGGCGGC CTGTCGCTCG AGGACGCCAC CCGCGCCATC TGTTCGCGCA GCCACCTCAT GGGTGAGGGC GAGGCAATGC TGTTCGGCGA GTACATCCGC TTGATGGCGC TGGTGGAGTA CTCCGCCGAC GAGATCAAGA CGGTGTTCTC CGACTATCCC GACCTCGAGG TGTGCGTGTA CGCCGCCCCG ACCCAGACCG TGATCGGCGG CCCGCCCGAG CAGGTGGACG CGATCATCGC CAGGGCCGAG CAGGAGGGCA AGTTCGCCCG CAAGTTCCAG ACCAAGGGCG CCAGCCACAC CTCGCAGATG GATCCGCTGC TCGGTGAGCT GGCCGCCGAA CTGGTCGGAA TCACCCCGCA CCCGTTGCAG ATCGGCTACT ACTCGACGGT GCACGAGGGC AAGTTCCTGC GGGCGGGCAG CGAGCCGATC CACGACGTGG ACTACTGGAA GAAGGGGCTG CGCCACAGCG TCTACTTCAC CCACGGCATC CGCAACGCGG TCGACAACGG CCACACCACC TTTCTCGAGC TCGCCCCGAA CCCGGTGGCG CTCATGCAGG TCGGGCTGAC CACGATGTCG GCGGGTCTGC ATGACGGGCA GCTCATCGCG ACGCTGGCCC GCAAGCAGGA CGAGGTCGAC TCGATGACCG CGGCCATGGC CCAGTTGTTC GTCCACGGCC ACGACCTCGA CATGCGCACC CTGTTCCCGC GGCGTTCGCG CGGGCTGGCC GGTGCGCTGG ACTACGCGAA CATCCCGCCG ACCCGGTTCC GCCGCAAGCC GCACTGGCTC GACGTGCGCT TCAGCGGGGA CAACGCCGGT GTCATGCCGG GTAGCCACGT CGCCACCCCG GACGGCAGGC ACGTGTGGGA GTACTCGCCG CGCGGTGCCG TGGACGCCCA GGCCCTGGCG GCGCTGGTGA AATCCGCTGC CTCGCAGGTG TTCCCGGAGG CGGCCGTGAC GGCGGCGGAG CAGCGGGCGG TGCCCGGTGA CGGCGCCCGC CTGGTGACCA CGCTGACCCG CCACCCCGGT GGGGCGTCGG TGCAGGTGCA CGCCCGCATG GACAACGGAG GCGAATCTTC CTTCGCTCTG GTCTACGACG CGATCGTCAC CCGCGGTGGT CAGGCCGTTG CCCTGCCCGC TGCGGTCGCC ACGGGGACCG TTGCTCCGCA AGCCGATTCG CTGACTCCAG CGGCCGAACC CGAGGGCGGC GACGCGGCGA TCCTGTCCGA CAACCTCACC CAGGGCGCGA ACCTCGGTGC GGGACTGGGC AAGTGGTCGC CGGACTCCGG TGAGACCATC CACGACCGGC TCGGCACGAT CGTCGGCGGC GCCATGGGTT ACGAGCCCGA GGATCTGCCG TGGGAGGTGC CGCTCATCGA GCTCGGCCTG GATTCCCTGA TGGCGGTGCG GATCAAGAAC CGTGTCGAGT ACGACTTCGA CCTGCCGCCG ATCCAGCTGA CCGCGGTGCG CGACGCCAAC CTCTATGCCG TCGAACAGCT CATCACCTAC GCGATCGAGC ATCGCGACGA GGTCGACCAG CTGGCCGAAT CGCAGAAGGG TAAGACCGCC GAGGAGATCG CCGCCGAACA GGCCGAGCTG ATGGGCGGGG CGTCGACGGT CGCCGAGCTC GAGGAGAAGC TGGCCGCGGC GGGACACCCG CTGGGCGAAG CTGCCTCAGA GCAAGCTGCG GTGATGTCCG GGGCGAACCT GACGACGGTG ACCACGCCGG CGCCGGATCC GCAGGCCGAA CCGAGCACCC AACAAGATTC GGCGATCCCG GCCCCGCCGA CCGATCCGTC GGGACCGAAC ATCCCGCCGC CGCCGACCAA CCCGGCCGGA CCGGACACCA CCGCGAAGTC GTCCGCCGCC AAGGCGGCCG CGCAGGTGCT GACCCAGGAG GCCGTCACCG AGGCGCTGGG CGCCGACGTC CCACCGCGTG ACGCCGCCGA ACGCGTCACG TTCGCCACCT GGGCGATCGT CACCGGCAAA TCGCCGGGCG GCATCTTCAA CGAGCTGCCG AAGGTCGACG ACGCCACGGC GGCGAAGATG GCCGAACGGT TGACCGAACG AGCCGAAGGC ACCATCACCG CCGACGACGT GAAGGCCGCC ACCACCATCG AGGATCTGGC CACCACGGTG CGCGAACACC TCGAGGCGGG CAAGGTCGAC GGGTTCGTCC GCGTCCTGCG TGCGCCGCAG GAAGGGAGCG ACCGGATCCC TGTGTTCGTG TTCCATCCGG CCGGCGGGTC GACGGTGGTG TACGAACCGC TGATGAAGCG GCTGCCGCCG GACACCCCGA TCTACGGCAT CGAGCGGGTC GAGGGCTCCG TGGAGGAGCG GGCCGCGGAG TACGTGCCGA AACTGCTCGA GATGAACGGG TGGACCGAGG GCAGGTCCGG CGTGCCGTTC ATCCTGGCGG GCTGGTCGCT GGGCGGGGTG CTGGCCTATG CGTGCGCCAT CGGGCTCAAG CAGGCCGGTG CGGACGTGCG GTTCGTCGGC CTCATCGACG CGGTGCGCGC CGGTGAGGAG GTCCCGCAGA CCAAGGAGGA GACCCGCGCC CGCTGGGAGC GCTACGCCCG GTTCGCCGAG CGCACCTTCA ACGTGCAGAT CCCGGAGATC CCGTACGAGG AGCTGGAGAA CCTCGACGAC GAGGGTCAGG TGAGGTTCGT CATGGAGGCC GTCGCGGCCA GCGGCGTGCA GATCCCCGGC GGCATCATCG AGCACCAGCG CACGTCGTAT CTGGACAACC GGATGATCGA CACCGCGGAG ATCAAGCCGT ACGACGGTCA CGTCACCCTC TACATGGCCG ACCGCTACCA CGACGACGCG ATCTACTTCG AACCGCGATA CGCCACCCGC CAACCCGACG GCGGCTGGGG TGAGTATGTT TCGGAGCTGG AAGTGATCCC GATCGGCGGT GAGCACATCC AGGCGATCGA CGAGCCGTAC ATCGCGAAGG TGGGTGCCCA CATGAGCGAG GCGATCAACC GTATCGAGGC CCAGGGAAAG TAG
|
Protein sequence | MNMSETPNNS PSAENQPIVA QGEGGPLRPA QVDMTVAEMR EWLRNWIANA TGQNADNIDE QTAMVELGLS SRDAVAMASD IEDLTGVTLT ATVAFRHPTI ESLATVIIEG EPEVEHDDDG TDWSRERDVD DIAIVGLATR FPGDMNTPDE MWEALLEGRD AITDLPEGRW EEFLGEPRIA ERVAKAATRG GYLSDIKGFD AEFFALSKME ADNIDPQQRM ALELTWEALE HARIPASSLR GERVGVYIGA SNNDYSFMSV ADPGVAHPYA ITGTTSSIIA NRVSYFYDFR GPSMAIDTAC SSSLVAAHQG VAALRAGEAD VAVVGGVNAL ITPLVTIGFD EVGGVLAPDG RIKSFSQDAN GYARSEGAGM LVLKRLSDAR RDGDEIYAVI AGSAVNHDGR SNGLLAPNPD AQAEVLRKAY KDAGINPRDV DYIEAHGTGT ILGDPIEADA LGRVIGRSRP ADQPALLGAV KSNVGHLESA AGAASLAKVA LSLRNDKLPP SINYTGPNPY IDFDAVRLKV NDTVSDWPRY SGHAIAGVSG FGFGGANAHM VLREVLPSDL VEPEPEQVVE VTAEPNQPAV YVGGVRMDEY GEFVDEPLAR RESGFDEDSD DDGLDRPAAA VEDDYELPGL TDEAKRLLEV AREELEAAEQ PVPLVPLAVS AFLTSRKKSA AAELADWMDS EEGRASSLES IGRALSRRNH GRSRAVVMAR DHDEAIKGLR ALAEGKQSPN VYSADGPVTN GPVWVLAGFG AQHRKMGKSL YLRNEVFADW INKVDSHVQD ERGHSILELI LDDAVDYTDE TTELPIEKVQ LVIFAIQVAL GELLKHHGAK PGAVIGQSLG EAAAAYFSGG LSLEDATRAI CSRSHLMGEG EAMLFGEYIR LMALVEYSAD EIKTVFSDYP DLEVCVYAAP TQTVIGGPPE QVDAIIARAE QEGKFARKFQ TKGASHTSQM DPLLGELAAE LVGITPHPLQ IGYYSTVHEG KFLRAGSEPI HDVDYWKKGL RHSVYFTHGI RNAVDNGHTT FLELAPNPVA LMQVGLTTMS AGLHDGQLIA TLARKQDEVD SMTAAMAQLF VHGHDLDMRT LFPRRSRGLA GALDYANIPP TRFRRKPHWL DVRFSGDNAG VMPGSHVATP DGRHVWEYSP RGAVDAQALA ALVKSAASQV FPEAAVTAAE QRAVPGDGAR LVTTLTRHPG GASVQVHARM DNGGESSFAL VYDAIVTRGG QAVALPAAVA TGTVAPQADS LTPAAEPEGG DAAILSDNLT QGANLGAGLG KWSPDSGETI HDRLGTIVGG AMGYEPEDLP WEVPLIELGL DSLMAVRIKN RVEYDFDLPP IQLTAVRDAN LYAVEQLITY AIEHRDEVDQ LAESQKGKTA EEIAAEQAEL MGGASTVAEL EEKLAAAGHP LGEAASEQAA VMSGANLTTV TTPAPDPQAE PSTQQDSAIP APPTDPSGPN IPPPPTNPAG PDTTAKSSAA KAAAQVLTQE AVTEALGADV PPRDAAERVT FATWAIVTGK SPGGIFNELP KVDDATAAKM AERLTERAEG TITADDVKAA TTIEDLATTV REHLEAGKVD GFVRVLRAPQ EGSDRIPVFV FHPAGGSTVV YEPLMKRLPP DTPIYGIERV EGSVEERAAE YVPKLLEMNG WTEGRSGVPF ILAGWSLGGV LAYACAIGLK QAGADVRFVG LIDAVRAGEE VPQTKEETRA RWERYARFAE RTFNVQIPEI PYEELENLDD EGQVRFVMEA VAASGVQIPG GIIEHQRTSY LDNRMIDTAE IKPYDGHVTL YMADRYHDDA IYFEPRYATR QPDGGWGEYV SELEVIPIGG EHIQAIDEPY IAKVGAHMSE AINRIEAQGK
|
| |