Gene Mkms_4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4139 
Symbol 
ID4612079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4368003 
End bp4370312 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content69% 
IMG OID639793823 
ProductMMPL domain-containing protein 
Protein accessionYP_940121 
Protein GI119870169 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGTCAC GAATTGCCAG GTCGGCGATC GCCTCGCCGA AACGGATCCT GTTCGTCGCC 
GCGCTGGTGA TGGCGGCGGC CGGGCTGCTC GGCCTTCCGG TGAGCCAGAC CCTGTCCGCG
GGCGGTTTCC AGGACCCGAC GTCGGAGTCG GCGCGCGCCA CCCAGACGCT CGTCGAGAAG
TTCGACCAGG GCGACATGGA CCTGATCATC AGCGTGACCT CAGACGACGG CGCCCAGAGC
CCGGCGGCCA CCGCGGTCGG CACCGCCATC GCCGCCGAAC TCGAGGCCTC ACCGGACGTC
GCCGGCGTGG CCTCCACGTG GACCGCACCG CGCGAGGCGG TGCCGGCGCT GCTCAGCAAG
GACCGCAAGA CCGGGTTGAT CATCGCCGGG ATCACCGGCG GCGAGAGCGG TGCGCAGAAG
CACGCCAAGG AGCTCACCGA CCGGCTGGTG TACGACCGCG ACGGCGTGAC GGTGCGCGCG
GGCGGCGAGG CCATGACCTA CGTCGAGATC AACCGGCAGA CCGAGAAAGA CCTGCTGACC
ATGGAGCTGA TCGCGATCCC GCTCAGCTTC GCCGTGCTGG TGTGGGTCTT CGGCGGGCTG
CTGGCCGCGG CGCTGCCGGT GGTGGTCGGG TTGTTCGCGA TCGTGGGCAC GATGGCCGTC
CTGCGGCTGC TCACGATGGT CACCGACGTG TCGATCTTCG CGCTCAACCT GTCCATCGCG
ATGGGTCTCG CCCTCGCGAT CGACTACACC CTGCTCATCG TGAGCCGGTT CCGCGACGAA
CTCGCCGACG GTGTCGACCG GGACCGCGCA CTGATCCGCA CCATGGCGAC CGCGGGCCGG
ACCGTGTTGT TCTCCGCGCT GACCGTCGCG CTGTCGATGG CCACGATGGT GCTGTTCCCG
ATGTACTTCC TGAAGTCGTT CGCCTACGCC GGGCTGGCGG TCGTGGGGTT CGCCGCGACG
GCCGCGATCG TGGTCGCCCC GGCCGCGATC GCGCTCCTCG GAAACCGGCT CGACGACTTC
GACGTGCGCC GGCTCGCGCG CCGCCTGCTC GGCAGACCCG AGCCCACGCC CAAGTCCGTC
GAGGAGACGT TCTGGTACCG GTCGACGAAG TACGTCCAGC GCCGGTCCAT CCCGATCGGC
GTGGCGATCG TGGCGCTGCT GCTGGTGCTC GGCGCACCGT TCCTCAGCAT CAAATGGGGC
TTCCCCGACG ACCGGGTGCT GCCGCAGTCC TCGTCGGCGC GTCAGGTCGG CGACGAACTG
CGCACCAACT TCGCCGTCGA CACCGCCACC GACGTCACCG TCGTCGTTCC GGACACCGGG
GGACTGACAC CGGCGGACAT CGACCGCTAC GCCGCGGACC TCTCGCGTGT CCCGGACGTG
CTGAGCGTGG CCGCCCCGGG CGGCACCTAC GTCACCGGCG CCCGGGTCGG CCCGCCCTCG
GCGGCCAGCG GGCTGGGCGC AGGCAGCGCC TACCTCACCG TCGCCACGAA CACCGAACTG
TTCTCCGACG AGTCGGAACG ACAACTCGAC GCGTTGCACG CCGTCGCGCC ACCCGGCGGG
ACTGCCGTCC AGTTCACCGG CACCGCTCAG ATCAACCGCG ACAGTTCGGA GTCGATCACC
TCGCGGCTGC CGCTCGTGCT CGGCATCATC GCGGGAATCA GCTTCGTACT GCTGTTCCTG
CTGACCGGCA GCGTGGTGCT CCCGCTCAAG GCGCTGCTGC TCAACGTGTT GTCGTTGTCG
GCCGCGTTCG GTGCGCTGGT GTGGATCTTC CAGGAGGGCC ACCTCGGGGC GCTCGGCACG
ACGCCGACGG GGACGCTGGT CGCCAACATG CCGGTGCTGC TGTTCTGTAT CGCGTTCGGT
CTGTCGATGG ACTACGAGGT CTTCCTGATG TCGCGGATCC GGGAGTTCTG GCTGAAGTCC
GGCCGCACCC GCCAAGACGG GACCGTACCT GCTCCTCGCG TCCGCAATGA TCTTTCTGCT
CCTCGCGTCC GCAATGACGA GAGCGTGGCG CTCGGCCTCG CCCGCACCGG CCGCGTCGTC
ACCGCCGCCG CCTTGTTGAT GACGATCTCG TTCGCCGCGC TGATCGCCGC GCAGGTGTCG
TTCATGCGGA TGTTCGGCGT CGGGTTCACG CTGGCCGTGC TCGCAGACGC GACGCTGATC
CGCATGCTGC TGGTGCCGGC CTTCATGCAT CTGATGGGCC GCTGGAACTG GTGGGCACCC
AAGCCGCTGG CACGCCTGCA CGAACGGTTC GGCATCAGCG AATCGGCCGA ACCCGAGTAC
GCCGAACCGG TGGCGGCGCG CCCGGCCTGA
 
Protein sequence
MLSRIARSAI ASPKRILFVA ALVMAAAGLL GLPVSQTLSA GGFQDPTSES ARATQTLVEK 
FDQGDMDLII SVTSDDGAQS PAATAVGTAI AAELEASPDV AGVASTWTAP REAVPALLSK
DRKTGLIIAG ITGGESGAQK HAKELTDRLV YDRDGVTVRA GGEAMTYVEI NRQTEKDLLT
MELIAIPLSF AVLVWVFGGL LAAALPVVVG LFAIVGTMAV LRLLTMVTDV SIFALNLSIA
MGLALAIDYT LLIVSRFRDE LADGVDRDRA LIRTMATAGR TVLFSALTVA LSMATMVLFP
MYFLKSFAYA GLAVVGFAAT AAIVVAPAAI ALLGNRLDDF DVRRLARRLL GRPEPTPKSV
EETFWYRSTK YVQRRSIPIG VAIVALLLVL GAPFLSIKWG FPDDRVLPQS SSARQVGDEL
RTNFAVDTAT DVTVVVPDTG GLTPADIDRY AADLSRVPDV LSVAAPGGTY VTGARVGPPS
AASGLGAGSA YLTVATNTEL FSDESERQLD ALHAVAPPGG TAVQFTGTAQ INRDSSESIT
SRLPLVLGII AGISFVLLFL LTGSVVLPLK ALLLNVLSLS AAFGALVWIF QEGHLGALGT
TPTGTLVANM PVLLFCIAFG LSMDYEVFLM SRIREFWLKS GRTRQDGTVP APRVRNDLSA
PRVRNDESVA LGLARTGRVV TAAALLMTIS FAALIAAQVS FMRMFGVGFT LAVLADATLI
RMLLVPAFMH LMGRWNWWAP KPLARLHERF GISESAEPEY AEPVAARPA