Gene EcSMS35_2517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2517 
SymbolemrY 
ID6146019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2571458 
End bp2572996 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID641617389 
Productmultidrug resistance protein Y 
Protein accessionYP_001744560 
Protein GI170682814 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCA CTAAATCAAC TCCGGCACCA TTAACCGGTG GGACGTTATG GTGCGTTACT 
ATTGCATTGT CATTAGCGAC GTTTATGCAA ATGTTGGATT CCACTATTTC TAACGTCGCA
ATACCGACAA TATCTGGCTT TCTGGGAGCA TCAACAGACG AAGGCACCTG GGTTATCACT
TCGTTTGGCG TAGCAAATGC CATTGCGATC CCTGTTACCG GCAGGCTGGC ACAAAGAATA
GGCGAATTAA GGTTATTTTT ACTTTCAGTC ACTTTTTTTT CGCTGTCTTC CTTAATGTGT
AGTTTATCGA CCAATCTTGA TGTGCTGATA TTTTTTAGAG TCGTTCAGGG ATTAATGGCG
GGGCCGTTAA TTCCACTGTC ACAGAGTTTA TTATTAAGGA ATTATCCACC AGAAAAAAGA
ACATTTGCTC TGGCATTATG GTCTATGACC GTGATTATCG CGCCGATATG TGGGCCGATA
TTGGGCGGTT ATATTTGTGA TAACTTTAGC TGGGGTTGGA TATTTTTAAT CAATGTCCCT
ATGGGGATTA TCGTCCTGAC ATTATGCTTA ACCTTACTTA AAGGAAGAGA AACCGAGACT
TCACCGGTCA AAATGAATCT ACCAGGACTG ACCCTGTTAG TGCTCGGTGT TGGTGGCTTG
CAAATTATGC TTGATAAAGG GCGCGATCTG GATTGGTTCA ACTCGAGTAC AATAATAATA
CTAACAGTAG TATCAGTTAT TTCTCTGATC TCTTTAGTCA TTTGGGAGTC GACCTCAGAG
AACCCGATTC TTGATCTCAG TTTGTTTAAG TCCCGTAATT TCACCATTGG TATTGTGAGT
ATCACATGCG CGTATTTATT TTACTCTGGA GCGATCGTCC TTATGCCGCA GTTACTCCAG
GAAACGATGG GGTATAATGC GATATGGGCC GGGCTTGCTT ATGCGCCCAT CGGTATCATG
CCGCTATTAA TATCACCTTT GATAGGACGT TATGGCAACA AAATAGATAT GCGGTTGTTG
GTGACATTTA GTTTTTTGAT GTATGCGGTT TGCTATTACT GGCGTTCTGT GACATTTATG
CCAACGATTG ATTTTACAGG CATCATTTTG CCACAGTTTT TTCAGGGATT CGCCGTTGCC
TGTTTCTTTT TACCCTTAAC AACGATTTCG TTTTCAGGCT TGCCAGATAA TAAATTTGCC
AATGCCTCGA GTATGAGTAA TTTTTTTCGC ACCTTGTCAG GATCAGTTGG TACGTCGTTG
ACAATGACGC TGTGGGGACG CCGAGAATCA TTACACCATA GTCAGTTGAC AGCAACCATC
GATCAATTTA ACCCCGTGTT TAATTCATCG TCACAAATTA TGGATAAATA CTATGGTTCG
CTTTCAGGAG TTCTTAATGA AATTAATAAT GAAATAACCC AGCAGTCACT TTCTATTTCT
GCAAATGAGA TTTTCCGTAT GGCGGCTATT GCTTTTATCT TACTTACGGT TTTGGTTTGG
TTTGCGAAAC CGCCGTTTAC AGCGAAAGGC GTTGGGTGA
 
Protein sequence
MAITKSTPAP LTGGTLWCVT IALSLATFMQ MLDSTISNVA IPTISGFLGA STDEGTWVIT 
SFGVANAIAI PVTGRLAQRI GELRLFLLSV TFFSLSSLMC SLSTNLDVLI FFRVVQGLMA
GPLIPLSQSL LLRNYPPEKR TFALALWSMT VIIAPICGPI LGGYICDNFS WGWIFLINVP
MGIIVLTLCL TLLKGRETET SPVKMNLPGL TLLVLGVGGL QIMLDKGRDL DWFNSSTIII
LTVVSVISLI SLVIWESTSE NPILDLSLFK SRNFTIGIVS ITCAYLFYSG AIVLMPQLLQ
ETMGYNAIWA GLAYAPIGIM PLLISPLIGR YGNKIDMRLL VTFSFLMYAV CYYWRSVTFM
PTIDFTGIIL PQFFQGFAVA CFFLPLTTIS FSGLPDNKFA NASSMSNFFR TLSGSVGTSL
TMTLWGRRES LHHSQLTATI DQFNPVFNSS SQIMDKYYGS LSGVLNEINN EITQQSLSIS
ANEIFRMAAI AFILLTVLVW FAKPPFTAKG VG