Gene EcSMS35_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0983 
SymbolmdtD 
ID6144223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp992067 
End bp993482 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content55% 
IMG OID641615870 
Productmultidrug efflux system protein MdtE 
Protein accessionYP_001743062 
Protein GI170682817 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.500741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.355503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATC TTCCCGACAG CACCCGCTGG CAATTGTGGA TTGTGGCTTT CGGCTTCTTT 
ATGCAGTCGC TGGACACCAC CATCGTTAAC ACCGCGCTTC CCTCAATGGC GCAAAGCCTC
GGGGAAAGTC CGTTGCATAT GCACATGGTC ATCGTATCTT ATGTGCTGAC CGTGGCTGTG
ATGCTGCCCG CCAGCGGCTG GCTGGCGGAC AAAGTCGGCG TGCGCAATAT TTTCTTTACC
GCCATCGTGC TGTTTACTCT CGGTTCACTA TTTTGCGCAC TTTCCGGCAC GCTGAACGAA
CTGTTGCTGG CACGCGCGTT ACAGGGCGTT GGCGGTGCGA TGATGGTGCC GGTCGGCAGG
TTGACGGTGA TGAAAATCGT ACCGCGCGAG CAGTATATGG CGGCGATGAC CTTTGTCACA
CTGCCCGGTC AGGTCGGTCC GCTGCTTGGC CCGGCGCTCG GCGGTCTGCT GGTGGAGTAC
GCATCGTGGC ACTGGATCTT TTTGATCAAC ATTCCGGTGG GGATTATTGG GGCGATCGCC
ACACTGATGT TAATGCCGAA CTACACCATG CAGACGCGGC GCTTCGATCT CTCCGGCTTT
TTATTGCTGG CGGTTGGCAT GGCGGTATTA ACCCTGGCGC TGGACGGCAG TAAAGGTACA
GGTTTATCGC CGCTGGCGAT TGCTGGCCTG GTCGCAGTTG GCGTGGTAGC ACTGGTGCTT
TATCTGCTGC ACGCCAGAAA TAACAACCGC GCTCTTTTCA GTCTGAAACT GTTCCGTACT
CGTACCTTTT CGCTGGGTCT GGCGGGGAGT TTTGCCGGAC GTATCGGTAG CGGCATGTTG
CCCTTTATGA CGCCGGTTTT CCTGCAAATT GGCCTTGGCT TCTCACCGTT TCATGCCGGA
CTGATGATGA TCCCGATGGT GCTCGGCAGC ATGGGGATGA AGCGAATTGT GGTACAGGTA
GTGAATCGCT TTGGTTATCG TCGGGTACTG GTGGCGACCA CGCTGGGTCT GTCGCTGGTC
ACCCTGTTGT TTATGACCAC TGCTCTGCTG GGCTGGTACT ACGTTTTGCC GTTCGTCCTG
TTTTTACAAG GGATGGTCAA CTCGACGCGT TTCTCCTCCA TGAACACCCT GACGCTGAAA
GATCTCCCGG ACAATCTGGC GAGCAGCGGA AACAGCCTGC TGTCGATGAT TATGCAATTG
TCGATGAGTA TCGGCGTCAC TATCGCCGGG CTGTTGCTGG GACTTTTTGG TTCACAGCAT
GTCAGCGTCG ACAGCGGCAC CACACAAACC GTCTTTATGT ACACCTGGCT TAGCATGGCG
TTTATCATCG CCCTTCCGGC GTTCATCTTT GCCAGAGTGC CGAACGATAC GCATCAAAAT
GTAGCTATTT CGCGGCGAAA AAGGAGCGCG CAATGA
 
Protein sequence
MTDLPDSTRW QLWIVAFGFF MQSLDTTIVN TALPSMAQSL GESPLHMHMV IVSYVLTVAV 
MLPASGWLAD KVGVRNIFFT AIVLFTLGSL FCALSGTLNE LLLARALQGV GGAMMVPVGR
LTVMKIVPRE QYMAAMTFVT LPGQVGPLLG PALGGLLVEY ASWHWIFLIN IPVGIIGAIA
TLMLMPNYTM QTRRFDLSGF LLLAVGMAVL TLALDGSKGT GLSPLAIAGL VAVGVVALVL
YLLHARNNNR ALFSLKLFRT RTFSLGLAGS FAGRIGSGML PFMTPVFLQI GLGFSPFHAG
LMMIPMVLGS MGMKRIVVQV VNRFGYRRVL VATTLGLSLV TLLFMTTALL GWYYVLPFVL
FLQGMVNSTR FSSMNTLTLK DLPDNLASSG NSLLSMIMQL SMSIGVTIAG LLLGLFGSQH
VSVDSGTTQT VFMYTWLSMA FIIALPAFIF ARVPNDTHQN VAISRRKRSA Q