Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4039 |
Symbol | emrD |
ID | 6142808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4130723 |
End bp | 4131907 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618864 |
Product | multidrug resistance protein D |
Protein accession | YP_001746002 |
Protein GI | 170682873 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.679822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGGC ATAGAAACGT CAATTTGTTA TTGATGTTGG TATTACTCGT GGCCGTCGGT CAGATGGCGC AAACCATTTA TATTCCAGCT ATTGCCGATA TGGCGCGCGA TCTCAACGTC CGTGAAGGGG CGGTGCAGAG CGTAATGGGC GCTTATCTGC TGACTTACGG TGTCTCACAG CTGTTTTATG GCCCGATTTC CGACCGTGTG GGTCGCCGAC CGGTGATCCT CGTCGGAATG TCCATTTTTA TGCTGGCAAC GCTGGTCGCG GTCACGACCT CCAGTTTGAC AGTATTGATT GCCGCCAGCG CGATGCAGGG GATGGGCACC GGCGTTGGCG GCGTAATGGC GCGTACTTTG CCGCGCGATT TATATGAACG GACACAGTTG CGCCATGCTA ATAGCCTGTT AAACATGGGA ATTCTTGTCA GTCCGTTGCT CGCACCGCTA ATCGGCGGTC TGCTGGATAC GATGTGGAAC TGGCGCGCCT GTTATCTCTT TTTGTTGGTA CTTTGTGCCG GTGTGACCTT CAGTATGGCC CGCTGGATGC CGGAAACGCG TCCGGTCGAC GCACCGCGCA CGCGCCTGCT TACCAGTTAT AAAACGCTTT TCGGTAACAG CGGTTTTAAC TGTTATTTGC TGATGCTGAT TGGCGGTCTG GCCGGGATTG CCGCCTTTGA AGCCTGCTCC GGCGTGCTGA TGGGCGCGGT GTTAGGGCTG AGCAGTATGA CGGTCAGTAT TTTGTTTATT CTGCCGATTC CGGCAGCGTT TTTTGGCGCA TGGTTTGCCG GACGTCCCAA TAAACGCTTC TCCACGTTAA TGTGGCAGTC GGTTATCTGC TGCCTGCTGG CTGGCTTGCT AATGTGGATC CCCGACTGGT TTGGCGTGAT GAATGTCTGG ACGCTGCTCG TTCCCGCCGC GCTGTTCTTT TTCGGTGCCG GGATGCTGTT TCCGCTGGCG ACCAGCGGCG CGATGGAGCC GTTCCCTTTC CTGGCGGGCA CGGCTGGCGC GCTGGTCGGC GGTCTACAAA ACATTGGTTC CGGCGTGCTG GCGTCGCTCT CTGCGATGTT GCCGCAAACC GGTCAGGGCA GCCTGGGGTT GTTGATGACC TTAATGGGAT TGTTGATCGT GCTGTGCTGG CTACCGCTGG CGACGCGGAT GTCGCATCAG GGGCAGCCCG TTTAA
|
Protein sequence | MKRHRNVNLL LMLVLLVAVG QMAQTIYIPA IADMARDLNV REGAVQSVMG AYLLTYGVSQ LFYGPISDRV GRRPVILVGM SIFMLATLVA VTTSSLTVLI AASAMQGMGT GVGGVMARTL PRDLYERTQL RHANSLLNMG ILVSPLLAPL IGGLLDTMWN WRACYLFLLV LCAGVTFSMA RWMPETRPVD APRTRLLTSY KTLFGNSGFN CYLLMLIGGL AGIAAFEACS GVLMGAVLGL SSMTVSILFI LPIPAAFFGA WFAGRPNKRF STLMWQSVIC CLLAGLLMWI PDWFGVMNVW TLLVPAALFF FGAGMLFPLA TSGAMEPFPF LAGTAGALVG GLQNIGSGVL ASLSAMLPQT GQGSLGLLMT LMGLLIVLCW LPLATRMSHQ GQPV
|
| |