Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2560 |
Symbol | |
ID | 6968803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2418400 |
End bp | 2419773 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643386427 |
Product | transporter, major facilitator family |
Protein accession | YP_002270909 |
Protein GI | 209397060 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000336159 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAG TTCAGGCCGA CGGCCTGCCA TTGCCCCAGC GATACGGTGC GATATTAACC ATTGTGATTG GTATTTCGAT GGCTGTCCTT GACGGCGCAA TCGCCAACGT CGCCCTGCCA ACAATCGCCA CGGACCTTCA TGCCACGCCA GCCAGTTCCA TCTGGGTAGT GAACGCCTAT CAAATCGCCA TTGTCATCTC CCTGCTCTCG TTTTCGTTTC TGGGTGATAT GTTTGGCTAT CGACGTATTT ATAAATGCGG TCTGGTCGTT TTTCTGTTGT CTTCACTGTT CTGCGCCCTT TCTGATTCGC TGCAAATACT CACCCTTGCG CGTGTCATAC AAGGTTTCGG CGGTGCAGCG TTGATGAGCG TTAATACCGC CCTTATCCGC CTGATCTATC CACAACGTTT TCTGGGTAGA GGGATGGGCA TAAACTCGTT TATTGTTGCC GTCTCTTCTG CTGCCGGGCC GACAATTGCT GCAGCAATCC TCTCCATCGC ATCCTGGAAA TGGTTATTTT TAATCAACGT ACCGTTGGGT ATTATCGCCC TGCTTCTGGC GATGCGTTTT CTGCCACCTA ATGGTTCTCG CGCCAGTAAA CCCCGTTTCG ACCTGCCCAG CGCCGTGATG AACGCGTTAA CCTTCGGCCT GCTTATCACT GCGTTGAGTG GTTTCGCTCA GGGGCAATCG CTGACGTTAA TTGCTGCGGA ACTGGTTGTA ATGGTTGTCG TTGGTATTTT CTTTATTCGC CGCCAGCTTT CTCTTCCAGT ACCGCTGCTA CCGGTGGATT TACTGCGTAT CCCGCTGTTT TCACTTTCTA TTTGCACATC TGTTTGCTCT TTCTGCGCAC AAATGCTGGC AATGGTTTCC CTGCCCTTTT ACCTGCAAAC CGTGCTCGGG CGTAGTGAAG TCGAAACAGG TTTACTTCTG ACACCGTGGC CGTTAGCAAC GATGGTGATG GCTCCGTTGG CAGGCTATTT GATTGAACGC GTACATGCAG GATTGCTGGG TGCTTTAGGA TTATTCATCA TGGCTGCGGG GCTTTTTTCC CTGGTTCTGC TGCCCGCGTC ACCTGCGGAT ATCAATATTA TCTGGCCGAT GATCTTATGT GGCGCTGGAT TTGGCTTGTT CCAGTCACCC AATAACCACA CCATTATTAC CTCCGCGCCT CGCGAACGTA GCGGTGGAGC CAGTGGCATG TTAGGAACGG CTCGTCTACT GGGTCAGAGT AGCGGCGCGG CGCTGGTGGC GCTGATGCTA AATCAGTTTG GAGATAATGG TACACACGTC TCGCTGATGG CTGCGGCTAT TCTGGCAGTG ATTGCTGCCT GTATCAGTGG GTTACGTATC ACTCAGCCAA GATCCAGGGC ATAA
|
Protein sequence | MPKVQADGLP LPQRYGAILT IVIGISMAVL DGAIANVALP TIATDLHATP ASSIWVVNAY QIAIVISLLS FSFLGDMFGY RRIYKCGLVV FLLSSLFCAL SDSLQILTLA RVIQGFGGAA LMSVNTALIR LIYPQRFLGR GMGINSFIVA VSSAAGPTIA AAILSIASWK WLFLINVPLG IIALLLAMRF LPPNGSRASK PRFDLPSAVM NALTFGLLIT ALSGFAQGQS LTLIAAELVV MVVVGIFFIR RQLSLPVPLL PVDLLRIPLF SLSICTSVCS FCAQMLAMVS LPFYLQTVLG RSEVETGLLL TPWPLATMVM APLAGYLIER VHAGLLGALG LFIMAAGLFS LVLLPASPAD INIIWPMILC GAGFGLFQSP NNHTIITSAP RERSGGASGM LGTARLLGQS SGAALVALML NQFGDNGTHV SLMAAAILAV IAACISGLRI TQPRSRA
|
| |