Gene EcolC_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1564 
Symbol 
ID6065361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1730226 
End bp1731641 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content55% 
IMG OID641600980 
Productmultidrug efflux system protein MdtE 
Protein accessionYP_001724550 
Protein GI170019596 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.886181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.734188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATC TTCCCGACAG CACCCGTTGG CAATTGTGGA TTGTGGCTTT CGGCTTCTTT 
ATGCAGTCGC TGGACACCAC CATCGTAAAC ACCGCCCTTC CCTCAATGGC GCAAAGCCTC
GGGGAAAGTC CGTTGCATAT GCACATGGTC ATTGTCTCTT ATGTGCTGAC CGTGGCGGTG
ATGCTGCCCG CCAGCGGCTG GCTGGCGGAC AAAGTCGGCG TGCGCAATAT TTTCTTTACC
GCCATCGTGC TGTTTACTCT CGGTTCACTG TTTTGCGCGC TTTCCGGCAC GCTGAACGAA
CTGTTGCTGG CACGCGCGTT ACAGGGCGTT GGCGGCGCGA TGATGGTGCC GGTCGGCAGA
TTGACGGTGA TGAAAATCGT ACCGCGCGAG CAATATATGG CGGCGATGAC CTTTGTCACG
TTACCCGGTC AGGTCGGTCC GCTGCTCGGT CCGGCGCTCG GCGGTCTGCT GGTGGAGTAC
GCATCGTGGC ACTGGATCTT TTTGATCAAC ATTCCGGTGG GGATTATCGG TGCGATCGCC
ACATTGCTGT TAATGCCAAA CTACACCATG CAGACGCGGC GCTTTGATCT CTCCGGATTT
TTATTGCTGG CGGTTGGCAT GGCGGTATTA ACCCTGGCGC TGGACGGCAG TAAAGGTACA
GGTTTATCGC CGCTGACGAT TGCAGGCCTG GTCGCAGTTG GCGTGGTGGC ACTGGTGCTT
TATCTGCTGC ACGCCAGAAA TAACAACCGT GCCCTGTTCA GTCTGAAACT GTTCCGTACT
CGTACCTTTT CGCTGGGCCT GGCGGGGAGC TTTGCCGGAC GTATTGGCAG TGGCATGTTG
CCCTTTATGA CACCGGTTTT CCTGCAAATT GGCCTCGGTT TCTCGCCGTT TCATGCCGGA
CTGATGATGA TCCCGATGGT GCTTGGCAGC ATGGGAATGA AGCGAATTGT GGTACAGGTG
GTGAATCGCT TTGGTTATCG TCGGGTACTG GTAGCGACCA CGCTGGGTCT GTCGCTGGTC
ACCCTGTTGT TTATGACTAC CGCCCTGCTG GGCTGGTACT ACGTTTTGCC GTTCGTCCTG
TTTTTACAAG GGATGGTCAA CTCGACGCGT TTCTCCTCCA TGAACACCCT GACGCTGAAA
GATCTCCCGG ACAATCTGGC GAGCAGCGGC AACAGCCTGC TGTCGATGAT TATGCAATTG
TCGATGAGTA TCGGCGTCAC TATCGCCGGG CTGTTGCTGG GACTTTTTGG TTCACAGCAT
GTCAGCGTCG ACAGCGGCAC CACACAAACC GTCTTTATGT ACACCTGGCT TAGCATGGCG
TTGATCATCG CCCTTCCGGC GTTCATCTTT GCCAGAGTGC CGAACGATAC GCATCAAAAT
GTAGCTATTT CGCGGCGAAA AAGGAGCGCG CAATGA
 
Protein sequence
MTDLPDSTRW QLWIVAFGFF MQSLDTTIVN TALPSMAQSL GESPLHMHMV IVSYVLTVAV 
MLPASGWLAD KVGVRNIFFT AIVLFTLGSL FCALSGTLNE LLLARALQGV GGAMMVPVGR
LTVMKIVPRE QYMAAMTFVT LPGQVGPLLG PALGGLLVEY ASWHWIFLIN IPVGIIGAIA
TLLLMPNYTM QTRRFDLSGF LLLAVGMAVL TLALDGSKGT GLSPLTIAGL VAVGVVALVL
YLLHARNNNR ALFSLKLFRT RTFSLGLAGS FAGRIGSGML PFMTPVFLQI GLGFSPFHAG
LMMIPMVLGS MGMKRIVVQV VNRFGYRRVL VATTLGLSLV TLLFMTTALL GWYYVLPFVL
FLQGMVNSTR FSSMNTLTLK DLPDNLASSG NSLLSMIMQL SMSIGVTIAG LLLGLFGSQH
VSVDSGTTQT VFMYTWLSMA LIIALPAFIF ARVPNDTHQN VAISRRKRSA Q