Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2517 |
Symbol | emrY |
ID | 6146019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2571458 |
End bp | 2572996 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641617389 |
Product | multidrug resistance protein Y |
Protein accession | YP_001744560 |
Protein GI | 170682814 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily [TIGR01168] Gram-positive signal peptide, YSIRK family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATCA CTAAATCAAC TCCGGCACCA TTAACCGGTG GGACGTTATG GTGCGTTACT ATTGCATTGT CATTAGCGAC GTTTATGCAA ATGTTGGATT CCACTATTTC TAACGTCGCA ATACCGACAA TATCTGGCTT TCTGGGAGCA TCAACAGACG AAGGCACCTG GGTTATCACT TCGTTTGGCG TAGCAAATGC CATTGCGATC CCTGTTACCG GCAGGCTGGC ACAAAGAATA GGCGAATTAA GGTTATTTTT ACTTTCAGTC ACTTTTTTTT CGCTGTCTTC CTTAATGTGT AGTTTATCGA CCAATCTTGA TGTGCTGATA TTTTTTAGAG TCGTTCAGGG ATTAATGGCG GGGCCGTTAA TTCCACTGTC ACAGAGTTTA TTATTAAGGA ATTATCCACC AGAAAAAAGA ACATTTGCTC TGGCATTATG GTCTATGACC GTGATTATCG CGCCGATATG TGGGCCGATA TTGGGCGGTT ATATTTGTGA TAACTTTAGC TGGGGTTGGA TATTTTTAAT CAATGTCCCT ATGGGGATTA TCGTCCTGAC ATTATGCTTA ACCTTACTTA AAGGAAGAGA AACCGAGACT TCACCGGTCA AAATGAATCT ACCAGGACTG ACCCTGTTAG TGCTCGGTGT TGGTGGCTTG CAAATTATGC TTGATAAAGG GCGCGATCTG GATTGGTTCA ACTCGAGTAC AATAATAATA CTAACAGTAG TATCAGTTAT TTCTCTGATC TCTTTAGTCA TTTGGGAGTC GACCTCAGAG AACCCGATTC TTGATCTCAG TTTGTTTAAG TCCCGTAATT TCACCATTGG TATTGTGAGT ATCACATGCG CGTATTTATT TTACTCTGGA GCGATCGTCC TTATGCCGCA GTTACTCCAG GAAACGATGG GGTATAATGC GATATGGGCC GGGCTTGCTT ATGCGCCCAT CGGTATCATG CCGCTATTAA TATCACCTTT GATAGGACGT TATGGCAACA AAATAGATAT GCGGTTGTTG GTGACATTTA GTTTTTTGAT GTATGCGGTT TGCTATTACT GGCGTTCTGT GACATTTATG CCAACGATTG ATTTTACAGG CATCATTTTG CCACAGTTTT TTCAGGGATT CGCCGTTGCC TGTTTCTTTT TACCCTTAAC AACGATTTCG TTTTCAGGCT TGCCAGATAA TAAATTTGCC AATGCCTCGA GTATGAGTAA TTTTTTTCGC ACCTTGTCAG GATCAGTTGG TACGTCGTTG ACAATGACGC TGTGGGGACG CCGAGAATCA TTACACCATA GTCAGTTGAC AGCAACCATC GATCAATTTA ACCCCGTGTT TAATTCATCG TCACAAATTA TGGATAAATA CTATGGTTCG CTTTCAGGAG TTCTTAATGA AATTAATAAT GAAATAACCC AGCAGTCACT TTCTATTTCT GCAAATGAGA TTTTCCGTAT GGCGGCTATT GCTTTTATCT TACTTACGGT TTTGGTTTGG TTTGCGAAAC CGCCGTTTAC AGCGAAAGGC GTTGGGTGA
|
Protein sequence | MAITKSTPAP LTGGTLWCVT IALSLATFMQ MLDSTISNVA IPTISGFLGA STDEGTWVIT SFGVANAIAI PVTGRLAQRI GELRLFLLSV TFFSLSSLMC SLSTNLDVLI FFRVVQGLMA GPLIPLSQSL LLRNYPPEKR TFALALWSMT VIIAPICGPI LGGYICDNFS WGWIFLINVP MGIIVLTLCL TLLKGRETET SPVKMNLPGL TLLVLGVGGL QIMLDKGRDL DWFNSSTIII LTVVSVISLI SLVIWESTSE NPILDLSLFK SRNFTIGIVS ITCAYLFYSG AIVLMPQLLQ ETMGYNAIWA GLAYAPIGIM PLLISPLIGR YGNKIDMRLL VTFSFLMYAV CYYWRSVTFM PTIDFTGIIL PQFFQGFAVA CFFLPLTTIS FSGLPDNKFA NASSMSNFFR TLSGSVGTSL TMTLWGRRES LHHSQLTATI DQFNPVFNSS SQIMDKYYGS LSGVLNEINN EITQQSLSIS ANEIFRMAAI AFILLTVLVW FAKPPFTAKG VG
|
| |