Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5105 |
Symbol | emrD |
ID | 6967803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4747495 |
End bp | 4748685 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643388777 |
Product | multidrug resistance protein D |
Protein accession | YP_002273203 |
Protein GI | 209396441 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.336287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATAATGA AAAGGCATAG AAACATCAAT TTGTTATTGA TGTTGGTATT ACTCGTGGCC GTCGGTCAGA TGGCGCAAAC CATTTATATT CCAGCTATTG CCGATATGGC GCGCGATCTC AACGTTCGTG AAGGGGCGGT GCAGAGCGTA ATGGGCGCTT ATCTGCTGAC TTACGGTGTC TCACAGCTGT TTTATGGCCC GATTTCCGAC CGTGTGGGTC GCCGACCGGT GATCCTCGTC GGAATGTCCA TTTTTATGCT GGCAACGCTG GTCGCGGTCA CGACCTCCAG TTTGACAGTA TTGATTGCCG CCAGCGCGAT GCAGGGGATG GGCACCGGCG TTGGCGGCGT AATGGCGCGT ACTTTGCCGC GTGATTTATA TGAACGGACA CAGTTGCGCC ACGCTAACAG CCTGTTAAAC ATGGGAATTC TTGTCAGTCC GTTGCTCGCA CCGCTAATCG GCGGTCTGCT GGATACGATG TGGAACTGGC GCGCCTGTTA TCTCTTTTTG TTGGTACTTT GTGCCGGTGT GACCTTCAGT ATGGCCCGCT GGATGCCGGA AACGCGTCCG GTCGACGCAC CGCGCACGCG CCTGCTTACC AGTTATAAAA CGCTTTTCGG TAACAGCGGT TTTAACTGTT ATTTGCTGAT GCTGATTGGC GGTCTGGCCG GGATTGCCGC CTTTGAAGCC TGCTCCGGCG TGCTGATGGG CGCGGTGTTA GGGCTGAGCA GTATGACGGT CAGTATTTTG TTTATTCTGC CGATTCCGGC GGCATTTTTT GGGGCATGGT TTGCCGGACG TCCTAATAAA CGCTTCTCAA CGTTGATGTG GCAGTCGGTT ATCTGCTGCC TGCTGGCTGG CTTACTGATG TGGATCCCCG ACTGGTTTGG CGTGATGAAT GTCTGGACGC TGCTCGTTCC CGCCGCGCTG TTCTTTTTCG GTGCCGGGAT GCTGTTTCCG CTGGCGACCA GCGGCGCGAT GGAGCCGTTC CCTTTCCTGG CGGGCACGGC TGGCGCGCTG GTCGGCGGTC TACAAAACAT TGGTTCCGGC GTGCTGGCGT CGCTCTCTGC GATGTTGCCG CAAACCGGTC AGGGCAGCCT GGGGTTGTTG ATGACCTTAA TGGGATTGTT GATCGTGCTG TGCTGGCTAC CGCTGGCGAC GCGGATGTCG CATCAGGGGC AGCCCGTTTA A
|
Protein sequence | MIMKRHRNIN LLLMLVLLVA VGQMAQTIYI PAIADMARDL NVREGAVQSV MGAYLLTYGV SQLFYGPISD RVGRRPVILV GMSIFMLATL VAVTTSSLTV LIAASAMQGM GTGVGGVMAR TLPRDLYERT QLRHANSLLN MGILVSPLLA PLIGGLLDTM WNWRACYLFL LVLCAGVTFS MARWMPETRP VDAPRTRLLT SYKTLFGNSG FNCYLLMLIG GLAGIAAFEA CSGVLMGAVL GLSSMTVSIL FILPIPAAFF GAWFAGRPNK RFSTLMWQSV ICCLLAGLLM WIPDWFGVMN VWTLLVPAAL FFFGAGMLFP LATSGAMEPF PFLAGTAGAL VGGLQNIGSG VLASLSAMLP QTGQGSLGLL MTLMGLLIVL CWLPLATRMS HQGQPV
|
| |