Gene Mkms_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4044 
Symbol 
ID4611984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4267296 
End bp4269236 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content70% 
IMG OID639793728 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_940026 
Protein GI119870074 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCCA CGTCCGAAGC GCGACCGGTG ACCGGTGACC CATGGCATGC GCTGTGGGCC 
ATGATGGTCG GCTTCTTCAT GATCCTGGTC GACGCCACCA TCGTGGCGGT CGCCAACCCG
GTGCTCATGG AGAAGCTGGG CGCCGACTAC GACGCGGTGA TCTGGGTGAC CAGCGCCTAT
CTGCTGGCCT ACGCGGTCCC GCTGCTGGTA GCGGGTCGGC TGGGGGACCG GTTCGGGCCC
AAGAACGTCT ACCTGGTCGG TCTGGCCGTA TTCACCGCCG CCTCGCTGTG GTGCGGGCTG
GCCGGGTCCA TCGACACGCT GATCGCCGCA CGGGTCGTCC AGGGCGTCGG CGCGGCACTG
CTGACGCCGC AGACCCTGTC GACGATCACC CGCATCTTCC CGCCCGAGCG GCGCGGTGTG
GCGATGAGTG TGTGGGGTGC GACGGCCGGG GTGGCCACGC TGGTGGGGCC GCTGGCGGGT
GGCGTGCTGG TGGACCACCT CGGTTGGCAG TGGATCTTCT TCGTCAACGT GCCGGTCGGG
GTGGCGGGCC TGGCGTTGGC GTTCTGGCGG GTGCCCGCGC TGACCACCAC CGCGCACCGG
TTCGACCTGC TCGGGGTGCT GCTGTCCGGG GTCGGGATGT TCGCGATCGT CTTCGGGCTG
CAGGAGGGGC AGTCCCACGG CTGGCAGCCC TGGATCTGGG TGGTCATCGT GGCCGGCGTC
GCGGTGATGG CCGGGTTCGT CTACTGGCAG TCGGTCAACC CCCACGAACC GTTGATCCCG
TTGCGGATCT TCGGTGACCG CGACTTCTCG CTGTCCAGCT TCGGCGTCGC CGTCATCGGC
TTCGTGGTGA CCGGCATGAT CGTGCCCGCG ATGTTCTTCG CCCAGGCGGT GTGTGGGCTG
TCGCCGACCG AGTCGGCGCT GCTGACCGCG CCGATGGCCA TCACCAGTGG AGTGCTGGCA
CCCGCGGTCG GGCGAATCGT CGACCGCGCC CATCCGCGAC CGATCGTCGG GTTCGGGTTC
TCGGCGCTGG CGATCGGGCT GACCTGGTTG TCGATCGAGA TGACGCCGGA TACGCCGATC
TGGCGGCTGG TCCTGCCGTT CCTCGTGATG GGTATCGGTA TGGCGTTCAT CTGGTCGCCG
CTGGCGGCCA CCGCGACCCG GAATCTGGCA CCGCATCTCG CCGGCGCCGG GTCCGGCGTC
TACAACGCGA CGCGTCAGGT CGGGTCGGTG CTGGGCAGCG CCGGCATGGC GGCGTTCATG
ACGTCGCGGA TCAGCGCGGA GATGCCGTCG GCGCAGGGCG GGGCGCCGCG CGGGGAGGGG
GCGGTGTCGG CGCTCCCCGA ATTCCTTCAG GTGCCGTTCG CCGCCGCGAT GTCGCAGTCG
CTGCTGCTGC CGGCGTTCGT GGCGTTGATC GGGGTGGTCG CGGCGATCTT CCTGCGCGGC
TTCGGCGAGG TATCGGCGCC GGTTGTCGCC GCGCCGGTGG CGCGCGCCGA CCCCCGGGAC
GACTCCGTCG ACGACAGCCA CGGCTATGAC GACGACGACT ACCTCGAGTA CGCCGTCAGC
TGGGACGATC TTCAGTTCAC CGAACCCATC TCGACCCGTC CTGAGGTCGG CGCCGACGAC
AGTGTCACGA CGCCGCTCGC AACGCGGGGC CGCCGCGCGG AGGCGGCCGC CGAGCCGTCC
GGTGCCGACG ATCCGTGGCG CCGTGTGCTC GACGAACTGC TACCGGAGCC GCCGGCCCGG
CCCGAGGCCG AGCCGATCGG CTTCGCGCAC AACGGTTTCC ATGTGGAGGG AGAGGAGACG
CCGGTCGACG ACAGACGGGG CCGGCGATAC CGAGACGACG GCGACGCTGC ACCCGCATGG
CTGCGCGAGT TCGGTGAACG CTCGCGCCGC GGAACGGACA CCCCCTCCGG CGGCCGTCAT
TCACGCCGCG ACGGCGACTG A
 
Protein sequence
MFSTSEARPV TGDPWHALWA MMVGFFMILV DATIVAVANP VLMEKLGADY DAVIWVTSAY 
LLAYAVPLLV AGRLGDRFGP KNVYLVGLAV FTAASLWCGL AGSIDTLIAA RVVQGVGAAL
LTPQTLSTIT RIFPPERRGV AMSVWGATAG VATLVGPLAG GVLVDHLGWQ WIFFVNVPVG
VAGLALAFWR VPALTTTAHR FDLLGVLLSG VGMFAIVFGL QEGQSHGWQP WIWVVIVAGV
AVMAGFVYWQ SVNPHEPLIP LRIFGDRDFS LSSFGVAVIG FVVTGMIVPA MFFAQAVCGL
SPTESALLTA PMAITSGVLA PAVGRIVDRA HPRPIVGFGF SALAIGLTWL SIEMTPDTPI
WRLVLPFLVM GIGMAFIWSP LAATATRNLA PHLAGAGSGV YNATRQVGSV LGSAGMAAFM
TSRISAEMPS AQGGAPRGEG AVSALPEFLQ VPFAAAMSQS LLLPAFVALI GVVAAIFLRG
FGEVSAPVVA APVARADPRD DSVDDSHGYD DDDYLEYAVS WDDLQFTEPI STRPEVGADD
SVTTPLATRG RRAEAAAEPS GADDPWRRVL DELLPEPPAR PEAEPIGFAH NGFHVEGEET
PVDDRRGRRY RDDGDAAPAW LREFGERSRR GTDTPSGGRH SRRDGD