Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4872 |
Symbol | |
ID | 4615831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 5100400 |
End bp | 5104335 |
Gene Length | 3936 bp |
Protein Length | 1311 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639794563 |
Product | non-ribosomal peptide synthetase |
Protein accession | YP_940852 |
Protein GI | 119870900 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain [TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.169627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.377916 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGACAG CAGCGGTTCC CGGCGCCGGC CTCCCCCACC AGTACCTCCA GTCCACGTTC GCCGCCGCGC CACGCACCCT CGTCGACATC CTCCACGAGA CCGCCGCCCG CTATCCCGAC GCTCCCGCCA TCGACGACGG CACCGTCCAG CTGACCTACG CCGAGTTGAT CGCCGACATC GAGGACAGCG TCGAGTGGCT GGCCGCCCGC GGTATCGGGC GCGGTGACCG CGTCGGCATC CGGATGCCGT CCGGCACCTA CGCGCTGTAT GTCGCGATCC TGTCCACCCT GGCCACCGGC GCCGCCTACG TCCCCGTCGA CGCCGACGAT CCCGACGAGC GCGCACAACT GGTGTTCACC GAGGCCAACG TGGTCGCCGT CATCACCGAG AGGGGCCTGG TCCGCGGGCC CGGATCGTCG CGCGGGTGGC GCGCGGTCGC GCCGCTGAGC CGCGACGACG CGTGGATCAT CTTCACCTCC GGCTCCACCG GCGTCCCCAA GGGCGTGGCC GTCACCCACC GCAGCGCCGC CGCCTTCGTC GACGCCGAGG CCAAGATGTT CCTCACCGAC AATCCGCTGG CCCCCGGTGA CCGGGTGCTC GCCGGATTGT CGGTCGCGTT CGACGCCTCG TGTGAGGAGA TGTGGCTGGC GTGGCGGCAT GGCGCCTGCC TGGTGCCCGC ACCGCGCTCG CTCGTGCGCA GCGGTATGGA TCTGGGGCCG TGGCTGGTGT CCCGCGACAT CACCGTCGTC TCGACGGTGC CGACGCTCGC GGCGCTGTGG CCTGCCGAGG CGCTCGAAGC GGTCCGGCTG TTGATCTTCG GCGGGGAGGC CTGCCCCCCG GATCTGGCCG AACGGCTCGC GGTTGAGGGG CGGGAGGTCT GGAACACCTA CGGTCCGACC GAGGCGACGG TGGTGGCGTC CGCAGCCCGG CTCGACGGCA GGTCGCCGGT GAGCATCGGC CTGCCACTGC CCGGCTGGGA TCTGGCGGTC GTCGACAAGG ACGGTGCGCC GGTGCCCTTC GGCGAGGTCG GCGAACTGGT GATCGGCGGG GTCGGGCTGG CCCGCTACCT CGACCCGGAG AAAGACGCCG AGAAGTACGC CCCGATGCCG GCTCTCGGCT GGGACCGCGC GTACCGCAGC GGTGACCTGG TGCGGTTGGA GGCCGAGGGT CTCTACTTCA TGGGACGGGC CGACGATCAA GTGAAGGTCG GCGGTCGGCG CATCGAACTC GGCGAGGTCG ATGCGGCGCT GGTGCACCTG CCCGGGGTGA GCGGCGGCGC GGCCGCGGTG CGCCGCACCT CCAGCGGGAC ACCGCTGCTC GTGGGTTACG TGGCCAGCGC GGATCCGTCG TTCGACCTGG CCGCGGCCCG CGCCGCACTC GCCGAGGCGC TGCCCGCGGC GCTGGTCCCG CGCCTGGTCC TGGTCGACGA ACTGCCCACC CGCACGTCGG GGAAGGTCGA CCGCGACGCG CTGCCCTGGC CGGTGGCCGG CGCCGCGGAT GCGGCGCCCG CCATGAGCGG CACGATGGGC TGGCTGGCCG GACTGTGGCA GGACGTGCTC GCCGCCCCGG TCGACGGCCC CGAGGCGGAC TTCTTCGCCT GCGGCGGCGG CAGCCTGTCG GCGGCCCAGC TGGTGGTCGC GCTGCGGCAG CGCTACCCGA TGGTGACGGT GGCCGACCTC TATGACCACC CGCGCCTGGG GTCGCTGGCC GGATACCTCG ACGAACTCGC CCCGCCGCCG CAGGTGGTGA CCCGCGAGGT CGCACCGACA CCGCGGCTCA CGCAGGCCGC CCAGGTGGTG CTGACGCTGC CGCTGGCCAC GCTGACCGCA CTGCAGTGGG TCGTGTGGCT GGCGCTGCTC AACAACGTCG CCGCCGAATT CTCGTTGGTG TCGTGGGCGG TCCCGCTGAA CTGGTGGGTG ATGCTGGCCG GTTTCGTGCT GTTCGTCTCG CCCGTGGGCC GGATGGGCAT CGCGGTGCTG TTCGCGCGCA TGCTGCTGGG CAACCTGCAA CCGGGCACCT ACCGACGCGG CGGTGCCGTC CACCTGAGGG TGTGGCTGGC CGAACGACTC GCCGCGGCCA GCGGCGCCGA GAACCTCGCG GGTGCTCCGT GGATGGTCTA CTACGCTCGC GCGCTGGGCA ATTCGGTCGG CAAGGAAGTC GACCTGCATT CGGCGCCCCC TGTGACGGGC ATGCTCAAGA TCGGGCACCG CTCCTCGGTG GAGCCCGAAG TCGACCTGGC CGGGCACTGG ATCGACGGCG ACCAGTTCCA CGTGGGGCAC ATCACCGTCG GCAACGACGC CACCATCGGC GCGCGCAGCA CCCTACTGCC CGGTGCGGTC GTCGGCAAGA ACGCCGACGT GGCCGCGGGG TCGGCCGTCA CCGGCAAGGT CAAGAACGGC CAGTATTGGA CGGGTTCACC GGCGGCGAAG TCCGGGAAGG CCCGCCACCC GTGGCCGGGT CACCGCCCGC CGCGCGCGCC GCTGTGGGTG GCCGTCTACG GGCTGACATC GATGCTGCTG GGCGGGCTGC CGCTGCTCGC GCTCGCGGCC GGACTGGCCG TGATCGGCTG GGGTGTGCGC GACACCGCGT CGGTGTCCGA CGCGATCCTT CCGGCGCTGC TGTGGACGCC GGTCGCCACG GTCGTCGCGG TGCTCCTCTA CGCGGCGTTG ACAGTGGCGG GCGTGCGGCT GCTGTCGCTC GGACTGCGGG AGGGCTACCA CCCGGTGCGC AGCCGGATCG GTTGGCAGGT GTGGGCCACC GAGCGGCTCC TCGACGCCGC CCGCAACTAT CTGTTCCCGC TGTACGCCAG CCTGCTGACC CCGTGGTGGC TGCGCGTGCT GGGTGCGCAG GTGGGCAGCG GCACCGAGAT CTCCACAGCG CTGTTCACGC CGAAGTTCAC CGTGGTGGAG GACGGCGCGT TCCTCGCCGA CGACACCATG GTCGCGTCCT ACGAACTGGG CGGCGGCTGG ATCCACGTCG CGAAGGCCAC CGTCGGCAGG CGCGCCTTCC TCGGCAACTC GGGCATCACC CAGCCGGGCC GCAAGGTGCC CGACGACGGT CTGGTGGCGG TGCTGTCGGC GACACCGCGC AAGGCGAAGG CCGGATCCTC CTGGCTGGGC AGTCCGCCGG TCCGGTTGCG CCGCCAGCCC ACCGCCGCCG ACGCGCTGCG CACCTTCCAC CCCACCCCCC GACTGAAGGC GATGCGCGGT GCGGTCGAGA CGTGCCGGCT GATCCCGGTG ATCGTCACGT TCGCGATCGG GGTCGGAGTG CTGCTGTCGT TGCAGGCGTT GGTGTTTCAT CTGGGCTACC TGTGGGCCGC GGTGCTGGGC GGAGTGGTCC TGATGGTGGC CGGGGCGGTC GCCGGCGCCG TCGCCGTGCT GGCGAAGCGG GTGGTGATCG GACGGATCGA GGCAATCGAG CACCCGCTGT GGTCGTCGTT CGTATGGCGT AACGAGGTGT CCGACACGTT CGTCGAGACG GTGGCCGCGC CGTGGTTCGC CCGTGCCGCC AGTGGCACGC CGGTGATGAA CCTCTGGCTG CGCGGCCTCG GGGCGAAGAT CGGCCGCGGC GTCTGGTGTG AGACCTACTG GCTGCCGGAG GCGGATCTGG TCACGCTCGG CGAGGGCGCC ACCGTCAACC GCGGGTGCGT GGTGCAGACC CACCTGTTCC ACGACCGCAT CATGCGGATG GACACCGTCG TCCTCGAGGA CGGGGCCACG CTGGGACCGC ACTGCGTGGC GCTGCCCGCG GCCCGGGTGG GCGCGGGTGG GACGGTGGGA CCGGGCTCGC TGGTCATGCG CGGCGACGAG GTCCCGGCCT CGACGCGCTG GCAGGGCAAT CCGATCGCGC CGTGGAATCC GTTGCGCAAG AAGCGCGGCG AGGCGCCGGC GAAGAAGAAG AAGGCCGCCC GTGACCCCGA GGACTCGGCC GCGTGA
|
Protein sequence | MVTAAVPGAG LPHQYLQSTF AAAPRTLVDI LHETAARYPD APAIDDGTVQ LTYAELIADI EDSVEWLAAR GIGRGDRVGI RMPSGTYALY VAILSTLATG AAYVPVDADD PDERAQLVFT EANVVAVITE RGLVRGPGSS RGWRAVAPLS RDDAWIIFTS GSTGVPKGVA VTHRSAAAFV DAEAKMFLTD NPLAPGDRVL AGLSVAFDAS CEEMWLAWRH GACLVPAPRS LVRSGMDLGP WLVSRDITVV STVPTLAALW PAEALEAVRL LIFGGEACPP DLAERLAVEG REVWNTYGPT EATVVASAAR LDGRSPVSIG LPLPGWDLAV VDKDGAPVPF GEVGELVIGG VGLARYLDPE KDAEKYAPMP ALGWDRAYRS GDLVRLEAEG LYFMGRADDQ VKVGGRRIEL GEVDAALVHL PGVSGGAAAV RRTSSGTPLL VGYVASADPS FDLAAARAAL AEALPAALVP RLVLVDELPT RTSGKVDRDA LPWPVAGAAD AAPAMSGTMG WLAGLWQDVL AAPVDGPEAD FFACGGGSLS AAQLVVALRQ RYPMVTVADL YDHPRLGSLA GYLDELAPPP QVVTREVAPT PRLTQAAQVV LTLPLATLTA LQWVVWLALL NNVAAEFSLV SWAVPLNWWV MLAGFVLFVS PVGRMGIAVL FARMLLGNLQ PGTYRRGGAV HLRVWLAERL AAASGAENLA GAPWMVYYAR ALGNSVGKEV DLHSAPPVTG MLKIGHRSSV EPEVDLAGHW IDGDQFHVGH ITVGNDATIG ARSTLLPGAV VGKNADVAAG SAVTGKVKNG QYWTGSPAAK SGKARHPWPG HRPPRAPLWV AVYGLTSMLL GGLPLLALAA GLAVIGWGVR DTASVSDAIL PALLWTPVAT VVAVLLYAAL TVAGVRLLSL GLREGYHPVR SRIGWQVWAT ERLLDAARNY LFPLYASLLT PWWLRVLGAQ VGSGTEISTA LFTPKFTVVE DGAFLADDTM VASYELGGGW IHVAKATVGR RAFLGNSGIT QPGRKVPDDG LVAVLSATPR KAKAGSSWLG SPPVRLRRQP TAADALRTFH PTPRLKAMRG AVETCRLIPV IVTFAIGVGV LLSLQALVFH LGYLWAAVLG GVVLMVAGAV AGAVAVLAKR VVIGRIEAIE HPLWSSFVWR NEVSDTFVET VAAPWFARAA SGTPVMNLWL RGLGAKIGRG VWCETYWLPE ADLVTLGEGA TVNRGCVVQT HLFHDRIMRM DTVVLEDGAT LGPHCVALPA ARVGAGGTVG PGSLVMRGDE VPASTRWQGN PIAPWNPLRK KRGEAPAKKK KAARDPEDSA A
|
| |