Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0995 |
Symbol | mdfA |
ID | 6970133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1012323 |
End bp | 1013555 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385011 |
Product | multidrug translocase MdfA |
Protein accession | YP_002269511 |
Protein GI | 209400914 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.462654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.0714029 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATA AATTAGCTTC CGGTGCCAGG CTTGGACGTC AGGCGTTACT TTTCCCTCTC TGTCTGGTGC TTTACGAATT TTCAACCTAT ATCGGCAACG ATATGATTCA ACCCGGTATG TTGGCCGTGG TGGAACAATA TCAGGCGGGC ATTGATTGGG TTCCTACTTC GATGACCGCG TATCTGGCGG GCGGGATGTT TTTACAATGG CTCCTGGGGC CGTTGTCGGA TCGTATTGGT CGCCGTCCGG TGATGCTGGC GGGTGTGGTG TGGTTTATTG TCACCTGTCT GGCAATATTA CTGGCGCAAA ACATTGAACA ATTCACCCTG TTGCGCTTCT TGCAGGGCAT AAGCCTCTGT TTCATTGGCG CTGTGGGATA CGCCGCAATT CAGGAATCCT TCGAAGAGGC GGTTTGTATC AAGATCACCG CGCTGATGGC GAACGTGGCG CTGATTGCTC CGCTACTTGG TCCACTGGTG GGCGCGGCGT GGATCCATGT GCTGCCCTGG GAAGGGATGT TTGTCTTGTT TGCCGCATTG GCAGCGATCT CCTTTTTCGG TCTGCAACGA GCCATGCCTG AAACCGCCAC GCGTATAGGC GAGAAACTGT CGCTGAAAGA ACTCGGTCGT GACTATAAGC TGGTGCTGAA GAACGGTCGC TTTGTGGCGG GGGCGCTGGC GCTGGGATTC GTTAGCCTGC CGTTGCTGGC GTGGATCGCT CAGTCGCCGA TTATCATCAT TACCGGCGAG CAGTTGAGCA GCTATGAATA TGGTTTGCTG CAAGTGCCTA TTTTCGGGGC GTTAATTGCG GGTAACTTGC TGTTAGCGCG TCTGACCTCG CGCCGCACCG TACGTTCGCT GATTATTATG GGCGGCTGGC CGATTATGAT TGGTCTGTTG GTCGCTGCTG CGGCAACGGT TATCTCATCG CACGCGTATT TATGGATGAC CGCCGGGTTA AGTATTTATG CTTTCGGTAT TGGTCTGGCG AATGCGGGAC TGGTGCGATT AACCCTGTTT GCCAGCGATA TGAGTAAAGG TACGGTTTCT GCGGCGATGG GAATGCTGCA AATGCTGATC TTTACCGTCG GTATTGAAAT CAGCAAACAT GCCTGGCTGA ACGGGGGCAA CGGACTGTTT AATCTCTTCA ACCTTGTCAA CGGAATTTTG TGGCTGTCGC TGATGGTTAT CTTTTTAAAA GATAAACAGA TGGGAAATTC TCACGAAGGG TAA
|
Protein sequence | MQNKLASGAR LGRQALLFPL CLVLYEFSTY IGNDMIQPGM LAVVEQYQAG IDWVPTSMTA YLAGGMFLQW LLGPLSDRIG RRPVMLAGVV WFIVTCLAIL LAQNIEQFTL LRFLQGISLC FIGAVGYAAI QESFEEAVCI KITALMANVA LIAPLLGPLV GAAWIHVLPW EGMFVLFAAL AAISFFGLQR AMPETATRIG EKLSLKELGR DYKLVLKNGR FVAGALALGF VSLPLLAWIA QSPIIIITGE QLSSYEYGLL QVPIFGALIA GNLLLARLTS RRTVRSLIIM GGWPIMIGLL VAAAATVISS HAYLWMTAGL SIYAFGIGLA NAGLVRLTLF ASDMSKGTVS AAMGMLQMLI FTVGIEISKH AWLNGGNGLF NLFNLVNGIL WLSLMVIFLK DKQMGNSHEG
|
| |