Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_3528 |
Symbol | |
ID | 4611458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 3703064 |
End bp | 3707449 |
Gene Length | 4386 bp |
Protein Length | 1461 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639793204 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_939512 |
Protein GI | 119869560 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.567451 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCGGA CCCAGGCCGC TGCCAACGCG ATCGAGGACG TGATGGCGCT GAGCCCGCTG CAGCAGGGGC TGTACTCGAT GACGATGCTC GCGGACTCCG AGCACGCCGA CGGCGATCCC CGTCCGGATC CCTACGTGAT CGCGATGGCC GCCGACGTCA CCGGGACGCT GGACACCGCC CTGCTGCGCG ACTGCGCGAC GGCGCTGCTG ACGCGGCATC CGAACCTGCG GGCCGGCTTC TTCCGCGGTG ACCTGCCCCG TCCGGTCCAG GTCGTCCCCG CCCGCGTCGA CCTGCCCTGG CAGCATGTGC GCGCCCGGAC CGCCGAGAAG GCGGCCGAAC TCGAGCACCG TGAGCGACGG CGCCCCTTCG ACCTCGAACG GGGGCCTGCG ATCCGGTTTC TGCTCATCGA ATTGCCGGGT CCTGCTTGGC GATTGGTGAT CGTCGCGCAC CACATCGTGA TCGACGGCTG GTCGCTTCCG CTGTTCGTCG GCGAAATGCT GACGCTCTAC CGCGCCGGCG GGGATCCGGG TGCGCTGGGC GACCCGCCGC GGCCCTACCG TGACTACATC GGCTGGCTGG CCGGTCGTGA CCAGGCGGCG AGTCGGGCGC TGTGGACCGA GCACCTCGCC GGCATCGACG CTCCGACGCT GCTCACCCCG GCGCTGTCCG CCGTCGAACC CGCCGCCGGG CTGCCGCGGC GCACCGAGGT CGCACTGGAC GCCGGCGCCA GCCGACGGCT CACCGAGACG GCCCGCTCGC GGGGCGTGAC CGTCAACACG CTGGTGCAGA TGGCCTGGGC GGCAATGCTG TCGGTGTTCA CCGACCGCCG TGACGTGACG TTCGGGATGA CGGTGTCGGG ACGCCCGGGT GAACTCGCGG GCGTCGAGAC GATGGTCGGC CTGTTCATCA ACACCGTCCC GCTGCGGGTG CGGCTCGACC CGGTGGACAC CGTTGGCAGT CAATGCCTTT CATTGCAGCG CACCGCCGCC GCGCTGCGCG AGCACAGCTA CCTCGCCCAC TCCGAGCTGC GGGCGCTGGC CGGGGTGGGG GAGATGTTCG ACTCGCTGCT GGTCTACGAG AACTTCCCAC CGGGCGGGCT GGTCGGCGGC GACCACCGCT TCGAAGCCGC CGGCGCCACC TTCATCCCCG CGGCCCTCGA GAGCCTGTCG CACTTCCCGG TCACCATCGC CGCCCACATG GCCGACGAGC GGTTGACCGT ATTGGTGGAG ACCCTCGACG GCGCCCTGGG CGCGCTGACC CCGCACGACG TCGGGCAGGG CCTGCTCGAC ACGGTCGAGC GGCTGATCGC CCTCTGGGAC CGGCCACTTC GCGACGTGCG GATGCTGTCC GGCGACGACC CTGCGGTGCC GGCTGTCACG GAACCGCCGC CGCGGACCGG CGTGCACTGC GCGTTCACCG AGGCGGCGCG CCGCACCCCG GATGCGGTCG CGCTCAGCTG GGACGGCGGC ACGCTCACCT ACCGCGAGGT CGACGCGGCC GCCGACCGGC TGGCCGCCGG CCTGACCGCA CGCGGTGTCG GCGCCGAAAC GCCGGTCGCC GTGAGGCTCT CGCGCGGACC CGATTACGTG ATCGCGATGC TTGCCGTCCT CAAGGCGGGC GGCATGATCG TGCCGCTCGA TCCCGGCATG GCCGGGGAGC GGATCGAGGA GATCCTGCGC CAGACTGCGG CGCCCGTCGT CGTCGACGAT GCGCTGTCGG CCGGCGTCGG TGCACCCGAC GGGGCGTGGG CACCGGCGAC CGTCGCCCCG GGGCAGGCCG CCTATGCGGT GTTCACCTCC GGCACCACCG GCATCCCCAA GGGTGTGGTC GGCACCCACG ACGCGGTGCT CGCCTACGCA GACGACCACG CCCGCCACGT CCTACGGCCG GCGGCGACAC GGCTGGGCCG TCCGCTCCGG ATCGCGCACG CCTGGTCGTT CACCTTCGAC GCAGCGTGGC AGCCGCTCGT CGCGTTGTTC GAGGGGCATT CGGTGCACAT CATCGGCGAC GCGGTCCAGC GCGATGCCGA AGCGCTGGTC GACACCATCG ACCGGTATGA CATCGACATG ATCGACACCA CCCCGTCGAT GTTCGCTCAG CTCAAGGCTT TCGGGTTGAT GTCGCGGGTG CCGCTGGCGG TGCTGGCACT CGGTGGCGAA GCCGTGGGCT CCGGCGCGTG GCGGTTCATC CGCGAGGAGT GCGCGCGCAC CTCGATGACG GCCTTCAACT GTTACGGCCC CACCGAGACC ACCGTGGAGT CCGTCGTCGC CGCCATCGCG GAACACCCCC AGCCGGTGAT CGGATCGCCG ACCCGCCACG CCCGCGCCTA CGTGCTCGAC GCCTGGCTGC GTCCGGTTCC CGACGGCGTC GCGGGGGAGC TCTACCTGTC CGGCGCCCAA CTGGCCCGCG GCTATCTGGA CCGGGCCTGC GAGACGGCAG GCCGGTTCGT CGCAGACCCG TTCCTGGCCG GGAACCGGAT GTACCGCACC GGCGACGTGG TACGCCGTGA CGCCACCGGC GCCCTGCAGT ACCTGGGCCG CAGCGACGAT CAGGTCAAGA TCCGCGGCTT CCGGGTCGAA CCCGGCGAGG TGTGCGCGGT GTTGCAGACC CATCCCGCCG TGCGGGCCGC CCATGTCACC GTGCGCCGCC ACGGCGCCGG GCCCCGATTG ACGGCCTATG CGGCCACCGG AGGCACCGAC GTCGCCGTCG CCGAACTGCG CCACATGCTC AGCACGCGGC TGCCGCGCTA TCTGGTCCCG CACCACATCG CGGTGCTCGA CGAACTGCCG CTGACCGCGC ACGGCAAGAT CGACGACGCC ACGCTCGCCA CGTACGACGC CGCCGCGGCC GGGCCGGCGG CCGCCCCCGA GACGCCCACC GAGGCCGCCC TGGCCGAGGC GGTCGCCGAA CTGCTGGGCA CCACCGCGGT GGACGTGAAC GCGGATCTGC TCTCGCTGGG ACTGGACAGC ATCGTGGCAC TCTCGGTGGT GCAGGCCGCG CGCCGCCGCG GTGTCGCGCT GCGGGCCCGA CTCATGCTCG AGTGCGGTTC GATCCGCGAA CTCGCCACGG CCGTCGACGC CGAGGCCGGT ATCGAATCCT CCTCGCAGGC AACCGAAGCG GCCTCGTCGG ACCCGATCCC GGTGCTGCCC AACGTGCACT GGCTCTACGA GTACGGCCAC CCGCGCCGGC TGGCGCAGAC CGAGGCGATC CCGCTGCCCG ACGGCATCAC CGCCGACGCG CTGCGCACCG TGCTGCGCAC CGTCCTCGAC GGCCACGAGG TGCTGCGCAC CCGGTTCGAC CGCACCGCCA TGACGCTGGT GCCGCAGGCG GAGACGCCCG CCGACGAGCT GCTGTCGGAA GTCGCGGTGA GCGGTGACCT CGCCGACGCG GTCGCCGAAC ACACCGAAAC CGCGGTGCAG AGGCTGGATC CGGAACGTGG CCGGCTGTGG TCGGCGCTGT GGCTGCGCCC GGCGCACGGC CCGGGGGTGC TCGTGGTGAC CGCGCACGTG TTGGCGATCG ATCCGGCGTC GTGGCGGGTC GTCATCGGTG AACTCGACAC CGCCTGGCAC GCGCTGAGGG CAGGGCGCAC ACCCACACCC ACACGCGAAC ACACCACCTA CCGGCAGTGG TCGGCGCGCG TGCGCGAGCG GGCCCACCGG CTCGACACGG CCGACTTCTG GGCCGCGCAG CTCGACGGCG CCGACCCGGA ACTCGGGGCG CGCCGAGTGC GCCCGGAGAC CGACCGCGCA GGCGATCTCG AGGTCAGCGT CGCCATCACC GAACCCGATC TCACCGCCCG GCTGATCGCC TCGGCGCAGG CCATGCCGAC GCTGCTGACC CTGGCGGCGG CGCGCACCGT CACCGCCTGG CGACGGCGCC GCGGTCAACC CACACCGCCG CCGCTGCTGG CGCTGGAGAC CCACGGCCGC GCCGACAGCG CGGTGAGCGC CGACGCCGAC ACCGGCGACA CCGTCGGACT GTTCAGCGCG ATCTACCCCG TACGGGTCGA TCCCGATGCC GGGCCGGTCG AGATCCCCGG TGAGGGAATC GATTTCGGTC TGCTGCGATA CCTGCGGGCC GACACCGCGG AACGGCTCTG CGCCTACCGG CAACCGCAGC TGCTGCTGAA CTACCTGGGC CGCGCCGACG TCGGCGGCGC CGGGACGTTC AACGTCGACA AGGGTCTGCT GCGCGCGGTG TCGGTGCTAC CGGAACCGGA ACTGGCCGTC CGGCACGAGG TGACCGTGAT GGCCGCGATC CTTCCCCAGG GTGACGCGCC CGTCCTGGCC ACCCAGTGGC GGACGCTGCC CGAGGTGCTC TCCGCCGACG ACGTCGCGGT CCTGCAGACC CTGTGGCAGG ACGCGCTGCA GGAGGTTGCC CGATGA
|
Protein sequence | MTRTQAAANA IEDVMALSPL QQGLYSMTML ADSEHADGDP RPDPYVIAMA ADVTGTLDTA LLRDCATALL TRHPNLRAGF FRGDLPRPVQ VVPARVDLPW QHVRARTAEK AAELEHRERR RPFDLERGPA IRFLLIELPG PAWRLVIVAH HIVIDGWSLP LFVGEMLTLY RAGGDPGALG DPPRPYRDYI GWLAGRDQAA SRALWTEHLA GIDAPTLLTP ALSAVEPAAG LPRRTEVALD AGASRRLTET ARSRGVTVNT LVQMAWAAML SVFTDRRDVT FGMTVSGRPG ELAGVETMVG LFINTVPLRV RLDPVDTVGS QCLSLQRTAA ALREHSYLAH SELRALAGVG EMFDSLLVYE NFPPGGLVGG DHRFEAAGAT FIPAALESLS HFPVTIAAHM ADERLTVLVE TLDGALGALT PHDVGQGLLD TVERLIALWD RPLRDVRMLS GDDPAVPAVT EPPPRTGVHC AFTEAARRTP DAVALSWDGG TLTYREVDAA ADRLAAGLTA RGVGAETPVA VRLSRGPDYV IAMLAVLKAG GMIVPLDPGM AGERIEEILR QTAAPVVVDD ALSAGVGAPD GAWAPATVAP GQAAYAVFTS GTTGIPKGVV GTHDAVLAYA DDHARHVLRP AATRLGRPLR IAHAWSFTFD AAWQPLVALF EGHSVHIIGD AVQRDAEALV DTIDRYDIDM IDTTPSMFAQ LKAFGLMSRV PLAVLALGGE AVGSGAWRFI REECARTSMT AFNCYGPTET TVESVVAAIA EHPQPVIGSP TRHARAYVLD AWLRPVPDGV AGELYLSGAQ LARGYLDRAC ETAGRFVADP FLAGNRMYRT GDVVRRDATG ALQYLGRSDD QVKIRGFRVE PGEVCAVLQT HPAVRAAHVT VRRHGAGPRL TAYAATGGTD VAVAELRHML STRLPRYLVP HHIAVLDELP LTAHGKIDDA TLATYDAAAA GPAAAPETPT EAALAEAVAE LLGTTAVDVN ADLLSLGLDS IVALSVVQAA RRRGVALRAR LMLECGSIRE LATAVDAEAG IESSSQATEA ASSDPIPVLP NVHWLYEYGH PRRLAQTEAI PLPDGITADA LRTVLRTVLD GHEVLRTRFD RTAMTLVPQA ETPADELLSE VAVSGDLADA VAEHTETAVQ RLDPERGRLW SALWLRPAHG PGVLVVTAHV LAIDPASWRV VIGELDTAWH ALRAGRTPTP TREHTTYRQW SARVRERAHR LDTADFWAAQ LDGADPELGA RRVRPETDRA GDLEVSVAIT EPDLTARLIA SAQAMPTLLT LAAARTVTAW RRRRGQPTPP PLLALETHGR ADSAVSADAD TGDTVGLFSA IYPVRVDPDA GPVEIPGEGI DFGLLRYLRA DTAERLCAYR QPQLLLNYLG RADVGGAGTF NVDKGLLRAV SVLPEPELAV RHEVTVMAAI LPQGDAPVLA TQWRTLPEVL SADDVAVLQT LWQDALQEVA R
|
| |