Gene ECH74115_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0995 
SymbolmdfA 
ID6970133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1012323 
End bp1013555 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content52% 
IMG OID643385011 
Productmultidrug translocase MdfA 
Protein accessionYP_002269511 
Protein GI209400914 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.462654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0714029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATA AATTAGCTTC CGGTGCCAGG CTTGGACGTC AGGCGTTACT TTTCCCTCTC 
TGTCTGGTGC TTTACGAATT TTCAACCTAT ATCGGCAACG ATATGATTCA ACCCGGTATG
TTGGCCGTGG TGGAACAATA TCAGGCGGGC ATTGATTGGG TTCCTACTTC GATGACCGCG
TATCTGGCGG GCGGGATGTT TTTACAATGG CTCCTGGGGC CGTTGTCGGA TCGTATTGGT
CGCCGTCCGG TGATGCTGGC GGGTGTGGTG TGGTTTATTG TCACCTGTCT GGCAATATTA
CTGGCGCAAA ACATTGAACA ATTCACCCTG TTGCGCTTCT TGCAGGGCAT AAGCCTCTGT
TTCATTGGCG CTGTGGGATA CGCCGCAATT CAGGAATCCT TCGAAGAGGC GGTTTGTATC
AAGATCACCG CGCTGATGGC GAACGTGGCG CTGATTGCTC CGCTACTTGG TCCACTGGTG
GGCGCGGCGT GGATCCATGT GCTGCCCTGG GAAGGGATGT TTGTCTTGTT TGCCGCATTG
GCAGCGATCT CCTTTTTCGG TCTGCAACGA GCCATGCCTG AAACCGCCAC GCGTATAGGC
GAGAAACTGT CGCTGAAAGA ACTCGGTCGT GACTATAAGC TGGTGCTGAA GAACGGTCGC
TTTGTGGCGG GGGCGCTGGC GCTGGGATTC GTTAGCCTGC CGTTGCTGGC GTGGATCGCT
CAGTCGCCGA TTATCATCAT TACCGGCGAG CAGTTGAGCA GCTATGAATA TGGTTTGCTG
CAAGTGCCTA TTTTCGGGGC GTTAATTGCG GGTAACTTGC TGTTAGCGCG TCTGACCTCG
CGCCGCACCG TACGTTCGCT GATTATTATG GGCGGCTGGC CGATTATGAT TGGTCTGTTG
GTCGCTGCTG CGGCAACGGT TATCTCATCG CACGCGTATT TATGGATGAC CGCCGGGTTA
AGTATTTATG CTTTCGGTAT TGGTCTGGCG AATGCGGGAC TGGTGCGATT AACCCTGTTT
GCCAGCGATA TGAGTAAAGG TACGGTTTCT GCGGCGATGG GAATGCTGCA AATGCTGATC
TTTACCGTCG GTATTGAAAT CAGCAAACAT GCCTGGCTGA ACGGGGGCAA CGGACTGTTT
AATCTCTTCA ACCTTGTCAA CGGAATTTTG TGGCTGTCGC TGATGGTTAT CTTTTTAAAA
GATAAACAGA TGGGAAATTC TCACGAAGGG TAA
 
Protein sequence
MQNKLASGAR LGRQALLFPL CLVLYEFSTY IGNDMIQPGM LAVVEQYQAG IDWVPTSMTA 
YLAGGMFLQW LLGPLSDRIG RRPVMLAGVV WFIVTCLAIL LAQNIEQFTL LRFLQGISLC
FIGAVGYAAI QESFEEAVCI KITALMANVA LIAPLLGPLV GAAWIHVLPW EGMFVLFAAL
AAISFFGLQR AMPETATRIG EKLSLKELGR DYKLVLKNGR FVAGALALGF VSLPLLAWIA
QSPIIIITGE QLSSYEYGLL QVPIFGALIA GNLLLARLTS RRTVRSLIIM GGWPIMIGLL
VAAAATVISS HAYLWMTAGL SIYAFGIGLA NAGLVRLTLF ASDMSKGTVS AAMGMLQMLI
FTVGIEISKH AWLNGGNGLF NLFNLVNGIL WLSLMVIFLK DKQMGNSHEG