Gene ECH74115_5105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5105 
SymbolemrD 
ID6967803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4747495 
End bp4748685 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content56% 
IMG OID643388777 
Productmultidrug resistance protein D 
Protein accessionYP_002273203 
Protein GI209396441 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.336287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAATGA AAAGGCATAG AAACATCAAT TTGTTATTGA TGTTGGTATT ACTCGTGGCC 
GTCGGTCAGA TGGCGCAAAC CATTTATATT CCAGCTATTG CCGATATGGC GCGCGATCTC
AACGTTCGTG AAGGGGCGGT GCAGAGCGTA ATGGGCGCTT ATCTGCTGAC TTACGGTGTC
TCACAGCTGT TTTATGGCCC GATTTCCGAC CGTGTGGGTC GCCGACCGGT GATCCTCGTC
GGAATGTCCA TTTTTATGCT GGCAACGCTG GTCGCGGTCA CGACCTCCAG TTTGACAGTA
TTGATTGCCG CCAGCGCGAT GCAGGGGATG GGCACCGGCG TTGGCGGCGT AATGGCGCGT
ACTTTGCCGC GTGATTTATA TGAACGGACA CAGTTGCGCC ACGCTAACAG CCTGTTAAAC
ATGGGAATTC TTGTCAGTCC GTTGCTCGCA CCGCTAATCG GCGGTCTGCT GGATACGATG
TGGAACTGGC GCGCCTGTTA TCTCTTTTTG TTGGTACTTT GTGCCGGTGT GACCTTCAGT
ATGGCCCGCT GGATGCCGGA AACGCGTCCG GTCGACGCAC CGCGCACGCG CCTGCTTACC
AGTTATAAAA CGCTTTTCGG TAACAGCGGT TTTAACTGTT ATTTGCTGAT GCTGATTGGC
GGTCTGGCCG GGATTGCCGC CTTTGAAGCC TGCTCCGGCG TGCTGATGGG CGCGGTGTTA
GGGCTGAGCA GTATGACGGT CAGTATTTTG TTTATTCTGC CGATTCCGGC GGCATTTTTT
GGGGCATGGT TTGCCGGACG TCCTAATAAA CGCTTCTCAA CGTTGATGTG GCAGTCGGTT
ATCTGCTGCC TGCTGGCTGG CTTACTGATG TGGATCCCCG ACTGGTTTGG CGTGATGAAT
GTCTGGACGC TGCTCGTTCC CGCCGCGCTG TTCTTTTTCG GTGCCGGGAT GCTGTTTCCG
CTGGCGACCA GCGGCGCGAT GGAGCCGTTC CCTTTCCTGG CGGGCACGGC TGGCGCGCTG
GTCGGCGGTC TACAAAACAT TGGTTCCGGC GTGCTGGCGT CGCTCTCTGC GATGTTGCCG
CAAACCGGTC AGGGCAGCCT GGGGTTGTTG ATGACCTTAA TGGGATTGTT GATCGTGCTG
TGCTGGCTAC CGCTGGCGAC GCGGATGTCG CATCAGGGGC AGCCCGTTTA A
 
Protein sequence
MIMKRHRNIN LLLMLVLLVA VGQMAQTIYI PAIADMARDL NVREGAVQSV MGAYLLTYGV 
SQLFYGPISD RVGRRPVILV GMSIFMLATL VAVTTSSLTV LIAASAMQGM GTGVGGVMAR
TLPRDLYERT QLRHANSLLN MGILVSPLLA PLIGGLLDTM WNWRACYLFL LVLCAGVTFS
MARWMPETRP VDAPRTRLLT SYKTLFGNSG FNCYLLMLIG GLAGIAAFEA CSGVLMGAVL
GLSSMTVSIL FILPIPAAFF GAWFAGRPNK RFSTLMWQSV ICCLLAGLLM WIPDWFGVMN
VWTLLVPAAL FFFGAGMLFP LATSGAMEPF PFLAGTAGAL VGGLQNIGSG VLASLSAMLP
QTGQGSLGLL MTLMGLLIVL CWLPLATRMS HQGQPV