Gene ECH74115_3598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3598 
SymbolemrY 
ID6967306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3312891 
End bp3314429 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID643387395 
Productmultidrug resistance protein Y 
Protein accessionYP_002271854 
Protein GI209400997 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0257273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCA CTAAATCAAC TCCGGCACCA TTAACCGGTG GGACGTTATG GTGCGTCACT 
ATTGCATTGT CATTAGCGAC GTTTATGCAA ATGTTGGATT CCACTATTTC TAACGTCGCA
ATACCGACAA TATCTGGCTT TCTGGGAGCA TCAACAGACG AAGGCACCTG GGTTATCACC
TCGTTTGGCG TAGCAAATGC CATTGCGATC CCTGTTACCG GCAGGCTGGC ACAAAGAATA
GGCGAATTAA GGTTATTTTT ACTTTCAGTC ACTTTTTTTT CGCTGTCTTC CTTAATGTGT
AGTTTATCGA TCAATCTTGA TGTGCTGATA TTTTTTAGAG TCGTTCAGGG GTTAATGGCA
GGGCCGTTAA TTCCACTGTC ACAGAGTTTA TTATTAAGGA ATTATCCACC AGAAAAAAGA
ACATTTGCTC TGGCATTATG GTCAATGACC GTGATTATCG CTCCGATATG TGGGCCGATA
TTGGGCGGTT ATATTTGTGA TAACTTTAGC TGGGGTTGGA TATTTTTAAT CAATGTCCCT
ATGGGGATTA TCGTCCTGAC ATTATGCTTA ACCTTACTTA AAGGAAGAGA AACTGAGACT
TCACCGGTCA AAATGAATCT ACCAGGACTG ACCCTGTTAG TGCTCGGTGT TGGTGGCTTG
CAAATTATGC TTGATAAAGG GCGCGATCTG GATTGGTTCA ACTCGAGTAC AATAATAATA
TTAACAGTAG TATCAGTTAT TTTTCTGATC TCTTTAGTCA TTTGGGAGTC GACCTCAGAG
AACCCGATTC TTGATCTCAG TTTGTTTAAG TCCCGTAACT TCACCATTGG TATTGTGAGT
ATCACATGCG CGTATTTATT TTACTCTGGA GCGATCGTCC TTATGCCGCA GTTACTCCAG
GAAACGATGG GGTATAATGC GATATGGGCC GGACTTGCTT ATGCGCCCAT CGGCATCATG
CCACTATTAA TTTCACCTTT GATAGGACGT TATGGCAACA AAATAGACAT GCGGGTGTTG
GTGACATTCA GTTTTTTGAT GTATGCGGTT TGCTATTACT GGCGTTCTGT GACATTTATG
CCAACGATTG ATTTTACAGG TATCATTTTG CCGCAGTTTT TTCAGGGATT CGCCGTTGCC
TGTTTCTTTT TACCCTTAAC AACGATTTCG TTTTCAGGCT TGCCAGATAA TAAATTTGCC
AATGCCTCGA GTATGAGTAA TTTTTTTCGT ACCTTGTCAG GATCAGTTGG TACGTCGTTG
ACAATGACGC TGTGGGGACG ACGCGAATCG TTACACCATA GTCAGTTGAC AGAAACCATC
GATCAATTTA ACCCCGTGTT TAATTCATCG TCACAAATTA TGGATAAATA CTATGGTTCG
CTTTCAGGAG TTCTTAATGA AATTAATAAT GAAATAACCC AGCAGTCACT TTCTATTTCT
GCAAATGAGA TTTTCCGTAT GGCGGCTATT GCTTTTATCT TACTTACGGT TTTGGTTTGG
TTTGCGAAAC CGCCGTTTAC AGCGAAAGGC GTTGGGTGA
 
Protein sequence
MAITKSTPAP LTGGTLWCVT IALSLATFMQ MLDSTISNVA IPTISGFLGA STDEGTWVIT 
SFGVANAIAI PVTGRLAQRI GELRLFLLSV TFFSLSSLMC SLSINLDVLI FFRVVQGLMA
GPLIPLSQSL LLRNYPPEKR TFALALWSMT VIIAPICGPI LGGYICDNFS WGWIFLINVP
MGIIVLTLCL TLLKGRETET SPVKMNLPGL TLLVLGVGGL QIMLDKGRDL DWFNSSTIII
LTVVSVIFLI SLVIWESTSE NPILDLSLFK SRNFTIGIVS ITCAYLFYSG AIVLMPQLLQ
ETMGYNAIWA GLAYAPIGIM PLLISPLIGR YGNKIDMRVL VTFSFLMYAV CYYWRSVTFM
PTIDFTGIIL PQFFQGFAVA CFFLPLTTIS FSGLPDNKFA NASSMSNFFR TLSGSVGTSL
TMTLWGRRES LHHSQLTETI DQFNPVFNSS SQIMDKYYGS LSGVLNEINN EITQQSLSIS
ANEIFRMAAI AFILLTVLVW FAKPPFTAKG VG