Gene Mjls_5172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5172 
Symbol 
ID4880870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5414805 
End bp5418740 
Gene Length3936 bp 
Protein Length1311 aa 
Translation table11 
GC content72% 
IMG OID640142482 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_001073427 
Protein GI126437736 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.066843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGACAG CAGCGGTTCC CGGCGCCGGC CTCCCCCACC AGTACCTCCA GTCCACGTTC 
GCCGCCGCGC CACGCACCCT CGTCGACATC CTCCACGAGA CCGCCGCCCG CTATCCCGAC
GCTCCCGCCA TCGACGACGG CACCGTCCAG CTGACCTACG CCGAGTTGAT CGCCGACATC
GAGGACAGCG TCGAGTGGCT GGCCGCCCGC GGTATCGGGC GCGGTGACCG CGTCGGCATC
CGGATGCCGT CCGGCACCTA CGCGCTGTAT GTCGCGATCC TGTCCACCCT GGCCACCGGC
GCCGCCTACG TCCCCGTCGA CGCCGACGAT CCCGACGAGC GCGCACAACT GGTGTTCACC
GAGGCCAACG TGGTCGCCGT CATCACCGAG AGGGGCCTGG TCCGCGGGCC CGGATCGTCG
CGCGGGTGGC GCGCGGTCGC GCCGCTGAGC CGCGACGACG CGTGGATCAT CTTCACCTCC
GGCTCCACCG GCGTCCCCAA GGGCGTGGCC GTCACCCACC GCAGCGCCGC CGCCTTCGTC
GACGCCGAGG CCAAGATGTT CCTCACCGAC AATCCGCTGG CCCCCGGTGA CCGGGTGCTC
GCCGGATTGT CGGTCGCGTT CGACGCCTCG TGTGAGGAGA TGTGGCTGGC GTGGCGGCAT
GGCGCCTGCC TGGTGCCCGC ACCGCGCTCG CTCGTGCGCA GCGGTATGGA TCTGGGGCCG
TGGCTGGTGT CCCGCGACAT CACCGTCGTC TCGACGGTGC CGACGCTCGC GGCGCTGTGG
CCTGCCGAGG CGCTCGAAGC GGTCCGGCTG CTGATCTTCG GCGGGGAGGC CTGCCCCCCG
GATCTGGCCG AACGGCTCGC GGTTGAGGGG CGGGAGGTCT GGAACACCTA CGGTCCGACC
GAGGCGACGG TGGTGGCGTC CGCAGCCCGG CTCGACGGCA GGTCGCCGGT GAGCATCGGC
CTGCCACTGC CCGGCTGGGA TCTGGCGGTC GTCGACAAGG ACGGTGCGCC GGTGCCCTTC
GGCGAGGTCG GCGAACTGGT GATCGGCGGG GTCGGGCTGG CCCGCTACCT CGACCCGGAG
AAAGACGCCG AGAAGTACGC CCCGATGCCG ACTCTCGGCT GGGACCGCGC GTACCGCAGC
GGCGACCTGG TGCGGTTGGA GGCCGAGGGT CTCTACTTCA TGGGACGGGC CGACGATCAG
GTGAAGGTCG GCGGTCGGCG CATCGAACTC GGCGAGGTCG ATGCGGCGCT GGTGCACCTG
CCCGGGGTGA GCGGCGGCGC GGCCGCGGTG CGCCGCACCT CCAGCGGGAC ACCGCTGCTG
GTGGGCTACG TGGCCAGCGC GGATCCGTCG TTCGACCTGG CCGCGGCCCG CGCCGCACTC
GCCGAGGCGC TGCCCGCGGC ACTGGTCCCG CGCCTGGTCC TGGTCGACGA ACTGCCCACC
CGCACGTCGG GGAAGGTCGA CCGCGACGCG CTGCCCTGGC CGGTGGCCGG CGCCGAGGAT
GCGGCGCCCG CCATGAGCGG CACGATGGGC TGGCTGGCCG GACTGTGGCA CGACGTGCTC
GCCGCCCCGG TCGACGGCCC CGAGGCGGAC TTCTTCGCCT GCGGCGGCGG CAGCCTGTCG
GCGGCCCAGC TGGTGGTCGC GCTGCGGCAG CGCTACCCGA TGGTGACCGT GGCCGACCTC
TATGACCACC CGCGCCTGGG GTCGCTGGCC GGATACCTCG ACGAACTCGA CCCGCCGCCG
CAGGTGGTGA CCCGCGAGGT CGCACCGACA CCGCGGCTCA CGCAGGCCGC CCAGGTGGTG
CTGACGCTGC CGCTGGCCAC GCTGACCGCA CTGCAGTGGG TCGTGTGGCT GGCGCTGCTG
AACAACGTCG CCGCCGAATT CTCGTTGGTG TCGTGGGCGG TCCCGCTGAA CTGGTGGGTG
GTGCTGGCCG GTTTCGTGCT GTTCGTCTCG CCCGTGGGCC GGATGGGCAT CGCGGTGCTG
TTCGCGCGCA TGCTGCTGGG CAACCTGCAA CCGGGCACCT ACCGTCGCGG CGGTGCCGTC
CACCTGAGGG TGTGGCTGGC CGAACGACTC GCCGCGGCCA GCGGCGCCGA GAACCTCGCG
GGTGCGCCGT GGATGGTCTA CTACGCTCGC GCGCTTGGCA ATTCGGTCGG CAAGGAAGTC
GACCTGCATT CGGCGCCGCC GGTGACGGGC ATGCTCAAGA TCGGGCACCG CTCCTCGGTG
GAGCCCGAAG TCGACCTGTC CGGGCACTGG ATCGACGGCG ACCAGTTCCA CGTGGGGCAC
ATCACCATCG GCAACGACGC CACCATCGGC GCGCGCAGCA CCCTGCTGCC CGGTGCGGCG
GTCGGCAAGA ACGCCGACGT GGCCGCCGGG TCGGCCGTCA CCGGCAAGGT CAAGAACGGC
CAGTACTGGA CGGGTTCACC GGCGGCGAAG TCCGGGAAGG CCCGCCACCC GTGGCCGGGT
CACCGCCCGC CGCGCGCGCC GCTGTGGGTG GCCGTCTACG GGCTGACATC GATGCTGCTC
GGCGGGCTGC CGCTGCTCGC GCTCGCGGCC GGACTGGCCG TGATCGGCTG GGGTGTGCGC
GACACCGCGT CGGTGTCCGA CGCGATCCTT CCGGCGCTGC TGTGGACGCC GGTCGCCACG
GTCGTCGCGG TGCTCCTCTA CGCGGCGTTG ACAGTGGCGG GCGTGCGGCT GCTGTCGCTC
GGACTGCGGG AGGGCTACCA CCCGGTGCGC AGCCGGATCG GTTGGCAGGT GTGGGCCACC
GAGCGGCTCC TCGACGCCGC CCGCAACTAT CTGTTCCCGC TGTACGCCAG CCTGCTGACC
CCGTGGTGGC TGCGCGTGCT GGGCGCGCAG GTGGGCAGCG GCACCGAGAT CTCCACAGCG
CTGTTCACGC CGAAGTTCAC CGTCGTGGAG GACGGTGCGT TCCTCGCCGA CGACACCATG
GTCGCGTCCT ACGAACTGGG CGGCGGCTGG ATCCACGTCG CGAAGGCCAC CGTCGGCAGG
CGCGCCTTCC TCGGCAACTC GGGCATCACC CAGCCGGGCC GCAAGGTGCC CGACGACGGC
CTGGTGGCGG TGCTGTCGGC GACACCGCGC AAGGCGAAGG CCGGATCCTC CTGGCTGGGC
AGTCCGCCGG TCCGGTTGCG CCGCCGGCCC ACCGCCGCCG ACGCGCTGCG CACCTTCCAC
CCCACCCCCC GACTGAAGGC GATGCGCGGT GCGGTCGAGA CGTGCCGGCT GATTCCGGTG
ATCGTCACGT TCGCGATCGG GGTCGGAGTG CTGCTGTCGT TGCAGGCGTT GGTGTTTCAC
CTGGGCTACC TGTGGGCCGC AGTCCTGGGC GGAGTGGTCC TGATGGTGGC CGGGGCGGTC
GCCGGCGCCG TCGCCGTGCT GGCGAAGCGG GTGGTGATCG GACGGATCGA GGCCATCGAG
CACCCGCTGT GGTCGTCGTT CGTATGGCGT AACGAGGTGT CCGACACGTT CGTCGAGACG
GTGGCCGCGC CGTGGTTCGC CCGTGCCGCC AGTGGCACAC CGGTGATGAA CCTCTGGCTG
CGCGGCCTCG GCGCGAAGAT CGGCCGCGGC GTCTGGTGTG AGACCTACTG GCTGCCGGAG
GCGGATCTGG TCACCCTCGG CGAGGGCGCC ACCGTCAACC GCGGGTGCGT GGTGCAGACC
CACCTGTTCC ACGACCGCAT CATGCGGATG GACACTGTCG TCCTCGAGGA CGGGGCCACG
CTGGGACCGC ACTGCGTGGC GCTGCCCGCG GCCCGGGTGG GCGCGGGCGC GACGGTGGGA
CCGGGCTCGC TGGTCATGCG CGGCGACGAG GTCCCGGCCT CGACCCGCTG GCAGGGCAAT
CCGATCGCGC CGTGGAATCC GTTGCGCAAG AAGCGCGGCG AGGCGTCGGC GAAGAAGAAG
AAAGCCGCCC GTGACCCCGA GGACTCGGCC GCGTGA
 
Protein sequence
MVTAAVPGAG LPHQYLQSTF AAAPRTLVDI LHETAARYPD APAIDDGTVQ LTYAELIADI 
EDSVEWLAAR GIGRGDRVGI RMPSGTYALY VAILSTLATG AAYVPVDADD PDERAQLVFT
EANVVAVITE RGLVRGPGSS RGWRAVAPLS RDDAWIIFTS GSTGVPKGVA VTHRSAAAFV
DAEAKMFLTD NPLAPGDRVL AGLSVAFDAS CEEMWLAWRH GACLVPAPRS LVRSGMDLGP
WLVSRDITVV STVPTLAALW PAEALEAVRL LIFGGEACPP DLAERLAVEG REVWNTYGPT
EATVVASAAR LDGRSPVSIG LPLPGWDLAV VDKDGAPVPF GEVGELVIGG VGLARYLDPE
KDAEKYAPMP TLGWDRAYRS GDLVRLEAEG LYFMGRADDQ VKVGGRRIEL GEVDAALVHL
PGVSGGAAAV RRTSSGTPLL VGYVASADPS FDLAAARAAL AEALPAALVP RLVLVDELPT
RTSGKVDRDA LPWPVAGAED AAPAMSGTMG WLAGLWHDVL AAPVDGPEAD FFACGGGSLS
AAQLVVALRQ RYPMVTVADL YDHPRLGSLA GYLDELDPPP QVVTREVAPT PRLTQAAQVV
LTLPLATLTA LQWVVWLALL NNVAAEFSLV SWAVPLNWWV VLAGFVLFVS PVGRMGIAVL
FARMLLGNLQ PGTYRRGGAV HLRVWLAERL AAASGAENLA GAPWMVYYAR ALGNSVGKEV
DLHSAPPVTG MLKIGHRSSV EPEVDLSGHW IDGDQFHVGH ITIGNDATIG ARSTLLPGAA
VGKNADVAAG SAVTGKVKNG QYWTGSPAAK SGKARHPWPG HRPPRAPLWV AVYGLTSMLL
GGLPLLALAA GLAVIGWGVR DTASVSDAIL PALLWTPVAT VVAVLLYAAL TVAGVRLLSL
GLREGYHPVR SRIGWQVWAT ERLLDAARNY LFPLYASLLT PWWLRVLGAQ VGSGTEISTA
LFTPKFTVVE DGAFLADDTM VASYELGGGW IHVAKATVGR RAFLGNSGIT QPGRKVPDDG
LVAVLSATPR KAKAGSSWLG SPPVRLRRRP TAADALRTFH PTPRLKAMRG AVETCRLIPV
IVTFAIGVGV LLSLQALVFH LGYLWAAVLG GVVLMVAGAV AGAVAVLAKR VVIGRIEAIE
HPLWSSFVWR NEVSDTFVET VAAPWFARAA SGTPVMNLWL RGLGAKIGRG VWCETYWLPE
ADLVTLGEGA TVNRGCVVQT HLFHDRIMRM DTVVLEDGAT LGPHCVALPA ARVGAGATVG
PGSLVMRGDE VPASTRWQGN PIAPWNPLRK KRGEASAKKK KAARDPEDSA A