Gene Mjls_4595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4595 
Symbol 
ID4880294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4818713 
End bp4821835 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content64% 
IMG OID640141898 
ProductMMPL domain-containing protein 
Protein accessionYP_001072851 
Protein GI126437160 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID[TIGR00833] Transport protein
[TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACG GCACTGCAGA TTCCCCAGGC CAGGCCACGC GCCGTGGCCA GCGGGACAAC 
GGATCCCCCA ACCCGGCCGA AACCCGCGAG TACAGCGAAC GGTTGGCAGC GCTAGCCCGA
TTCACCCTCC GACACAAAGC GCTGGTCATC GGGGTATGGC TAGGCGCCGC GGTCGTCCTC
GCGTTGCTGT TCCCCCAGCT AGAAACCGTG GTGCGCCAAC AGTCGGTGGA CCTCATCCCT
CGCGATGCGC CGTCCTTGCA GACGGTTGAC CGCATGAGCG CCGCTTTCGG CGAAGAAGGC
TCAAAAACCA TGGTGTTCGT CGCCATGGAA GACCCCAATG GCTTGACTCC GACTGCACGG
CAGCGCTACG GCGAACTCGT GCGCCGGTTG CAGGGCGAAG GCAACCACGT CCTGCTGGTG
CAGGACCTAC TGTCCGATCC GATTACCGAA GCCCAGGCAG TCAGCGCCGA CCGCAAAGCT
TGGTACCTGC CCGTAGGTGT CACCGGCACC CTCGGAGACC CCACGGCCGC CGAATCGGTA
AACGCGGTGC GCAACATCGC CGCCGAGGTC TTCACCGGCT CGACCACGAC TGTGCAAGTG
ACGGGACCAC CGGCAACCTT CAGCGACATG ATCGCATCCG CCGAGCACGA CCTGCTGCTG
ATCTCAATCG CTACCGCGGG CGTGATCGCC CTGATCCTGC TGATCGTCTA CCGGTCGGTG
TTCACCGCGC TGTTACCGCT GCTAGTAATC GGGTTGAGCC TGGCAGTCGG GCGCGGCGTG
CTGTCCGCCC TCGGCGAGAT GGGCATGCCC GTCTCCCAGT TCACCGTCGC CTTCATGACT
GCGATCCTGC TCGGCGCGGG AACCGACTAC ACCGTATTTC TGATCAGCCG ATACCACGAA
CAGCGCCGCG CCCAAGTACC CGCCGATCAG GCCGTCATCC ACGCCACCGC CAGCATCGGG
CGCGTCATCC TCGCCTCCGC CGCCACCGTC GCACTCGCGT TCCTGGCCAT GGTCTTCGCG
CGGCTCAGCG TCTTCGCCGC CCTTGGCCCC GCGTGTGCCA TCGCCGTACT GTTCGGATTT
CTGGCCACCG TAACCCTGCT GCCACCGGTG CTGTCGCTAG CCGCCAAACG CGGCATCGGT
GAACCCAAAC CCGATCGCAC CCGCCGCTAC TGGAACAGCG TCGCCGTCGC CGTGGTCCGC
CGTCCCGTGC CACTACTCAT CGTCAGCCTG GTCATCTTGC TCGCCCTGTC GGCAGCGGCA
GCAACCATCA AAATCAGCTA CGACGACCGC AAGGGCCAAC CAGACACCAC GGCCAGCAAC
CTGGGCTACC ACCTGCTGGA CCGCCACTTC CGCAAAGACG TCGTCATCAG CGAATTCCTC
GTCGTGGAAA ATCCGACCGA CATGCGGACC GGCAAAGGAC TGGCCGATCT TGACGAGATG
GCCTCCCGCG TCTCCCAGAT CCCCGGCGTC ACCAAGGTTT CCGGAGTCAC CCGCCCCACC
GGAGAGCGCC TCGACCAAGC AGAACTGGCC TGGCAGAACG GCCAGATCGG CGACAAAATG
GCCGGCGCCG TCGCCGAGGG CAACTCCCGC AAGGACGACC TCACCAAACT CACCGACGGC
GCCGACCAGC TCGCCGACGG CCTCGCCCAA CTCGACAGCA CCGTGCGCAC CGCCTTCACA
CCACTAGCCG GAATCCTCAC CCAAGCCCAA TCCGCAGGAA CTCAGGTCAA CCAATTCCGG
CCGCTGCTGC AACAACTTTC CGCCACTGCC CCCGCCGTCG ACCAAGCCAT CCAATCCGGC
CCAGGACTAC GACCGCTGGC CAACCAAGCC CAAAACGCAA TCACTCAACT CGACCCACTC
GTCGGCGCCC TCAACACCTC ACCCTGGTGC GCCACCACCC CACAATGCGC CCAAATCCGC
AACCAGGTCC AGATCCTGGT CACTCTGCGC GACAACGGAT TCTTCAACCA AATCGCCGAC
CTCGGGGACC GCTACGATCC CGCCACCAAT GCCACCGTTG GTGGCACCCT CGCCAACGTC
CAGAACGCAG TCGCCTCACT GGACAAGGCA TTCGGAGCCC TGGGTGACCC CGCCGACCTG
ACCACAAATC TCCGCCGATT GCAGGACGGA ATCGGACAGC TGGCCTCCGG CGCCCAAGCA
CTCGCCACCG GCGTCCGCAC CCTCGCCGAC AGCAACATCG AAATGCTGTC CGGCATGAGC
CAGATCGCCA CCCAACTACA GAACTCCTCG CGCGCAGCGG CCGACTCCGA CTCCTCGAGC
GGTTTCTACC TGCCCGCCAA CGCATTCGAG AACCGGCAAT TCACCGACGT CGCCGAACAA
TTCCTCTCAC CGGACGGCAA AACAGCGCGG TTCATGATCG AAAGCAGCCA CGACCCATAC
AGCGTTGAAG CCATGGACCT CGCCAGCCGC ATCACCGACA CCGCCAACAC CGCACGACCC
AACACGTCAC TCGCCGACGC CACCGTGTCC GTAGCCGGCT TCCCCGCCGT CAACTCCGAT
ATCCAACGAC TCCTCTGGGC CGACTTCGCA CAACTAGCCA TCGCCACCAT CATCATCGTC
GGCGTCATCC TGGTCCTACT ACTGCGCGCA CTCCTAGCAC CGCTCTACCT ACTAGGCACC
GTCGTGCTCA ACTACCTCGC ATCACTCGGC ATCGGCGTCG TAGTCTTCCA ATGGGGACTG
GGCCACGAAA TCGCCTGGCC CGTACCGCTG CTGGCGTTCA TCATCCTCGT CGCCGTCGGC
GCCGACTACA ACATGCTGCT CGTCTCACGG CTCCGCGAAG AATCCGGAAC CAACATCCGC
GTCGGCGTCC TGCGCACCGT GGCAAACACC GGAGCCGTCA TCACCTCCGC TGGCCTCATA
TTCGCCGCCA GCATGTTCGG CCTCATGGTC GGCTCAGTCG CCATCATGAT CCAAGCCGGC
CTCATCATCG GCTTCGGGCT GCTGCTCGAC ACCTTCCTCG TGCGCACCCT CACCGTGCCC
GCCATCGCCA CACTCCTACG CGAAGCCAGC TGGTGGCCCA CCAAAGCAAC AAACCCGCGA
CCCGGTCAGA CCAACCCAGG AGGCATTAAG GACGCACCTC ACGTGCTGGT CCAGGAGAGG
TGA
 
Protein sequence
MTDGTADSPG QATRRGQRDN GSPNPAETRE YSERLAALAR FTLRHKALVI GVWLGAAVVL 
ALLFPQLETV VRQQSVDLIP RDAPSLQTVD RMSAAFGEEG SKTMVFVAME DPNGLTPTAR
QRYGELVRRL QGEGNHVLLV QDLLSDPITE AQAVSADRKA WYLPVGVTGT LGDPTAAESV
NAVRNIAAEV FTGSTTTVQV TGPPATFSDM IASAEHDLLL ISIATAGVIA LILLIVYRSV
FTALLPLLVI GLSLAVGRGV LSALGEMGMP VSQFTVAFMT AILLGAGTDY TVFLISRYHE
QRRAQVPADQ AVIHATASIG RVILASAATV ALAFLAMVFA RLSVFAALGP ACAIAVLFGF
LATVTLLPPV LSLAAKRGIG EPKPDRTRRY WNSVAVAVVR RPVPLLIVSL VILLALSAAA
ATIKISYDDR KGQPDTTASN LGYHLLDRHF RKDVVISEFL VVENPTDMRT GKGLADLDEM
ASRVSQIPGV TKVSGVTRPT GERLDQAELA WQNGQIGDKM AGAVAEGNSR KDDLTKLTDG
ADQLADGLAQ LDSTVRTAFT PLAGILTQAQ SAGTQVNQFR PLLQQLSATA PAVDQAIQSG
PGLRPLANQA QNAITQLDPL VGALNTSPWC ATTPQCAQIR NQVQILVTLR DNGFFNQIAD
LGDRYDPATN ATVGGTLANV QNAVASLDKA FGALGDPADL TTNLRRLQDG IGQLASGAQA
LATGVRTLAD SNIEMLSGMS QIATQLQNSS RAAADSDSSS GFYLPANAFE NRQFTDVAEQ
FLSPDGKTAR FMIESSHDPY SVEAMDLASR ITDTANTARP NTSLADATVS VAGFPAVNSD
IQRLLWADFA QLAIATIIIV GVILVLLLRA LLAPLYLLGT VVLNYLASLG IGVVVFQWGL
GHEIAWPVPL LAFIILVAVG ADYNMLLVSR LREESGTNIR VGVLRTVANT GAVITSAGLI
FAASMFGLMV GSVAIMIQAG LIIGFGLLLD TFLVRTLTVP AIATLLREAS WWPTKATNPR
PGQTNPGGIK DAPHVLVQER