Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0914 |
Symbol | mdfA |
ID | 5588247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 933088 |
End bp | 934320 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640924624 |
Product | multidrug translocase MdfA |
Protein accession | YP_001462039 |
Protein GI | 157159279 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAATA AATTAGCTTC CGGTGCCAGG CTTGGACGTC AGGCGTTACT TTTCCCTCTC TGTCTGGTGC TTTACGAATT TTCAACCTAT ATCGGCAACG ATATGATTCA ACCCGGTATG TTGGCCGTGG TGGAACAATA TCAGGCGGTC ATTGATTGGG TTCCTACTTC AATGACCGCA TATCTGGCGG GCGGGATGTT TTTACAATGG CTCCTGGGGC CGCTGTCGGA TCGTATTGGT CGCCGTCCGG TGATGCTGGC GGGGGTGGTG TGGTTTATCA TCACCTGTCT GGCAATATTA CTGGCGCAAA ACATTGAACA ATTCACTCTG TTGCGCTTCT TGCAGGGCAT AAGCCTCTGT TTCATTGGCG CTGTGGGATA CGCCGCAATT CAGGAATCCT TCGAAGAGGC GGTTTGTATC AAGATCACCG CGCTGATGGC GAACGTGGCG CTGATTGCTC CGCTACTTGG TCCGCTGGTG GGCGCGGCGT GGATCCATGT GCTGCCCTGG GAAGGGATGT TTGTCTTGTT TGCCGCATTG GCAGCGATCT CCTTTTTCGG TCTGCAACAA GCCATGCCTG AAACCGCCAC GCGTATAGGC GAGAAACTGT CACTGAAAGA ACTCGGTCGT GACTATAAGC TGGTGCTGAA GAACGGCCGC TTTGTGGCGG GGGCGCTGGC GCTGGGATTC GTTAGCCTGC CATTGCTGGC GTGGATCGCC CAGTCGCCGA TTATCATCAT TACCGGCGAG CAGTTGAGCA GCTATGAATA TGGCTTGCTG CAAGTGCCTA TTTTCGGGGC GTTAATTGCG GGTAACTTGT TGTTAGCGCG TTTGACCTCG CGCCGCACCG TACGTTCGCT GATAATTATG GGCGGCTGGC CGATTATGAT TGGTCTGTTG GTCGCTGCTG CGGCAACGGT TATCTCATCG CACGCGTATT TATGGATGAC CGCCGGGTTA AGTATTTATG CTTTCGGCAT TGGTCTGGCG AATGCGGGAC TGGTGCGATT AACCCTGTTT GCCAGCGATA TGAGTAAAGG TACGGTTTCT GCGGCGATGG GAATGCTGCA AATGCTGATC TTTACCGTCG GTATTGAAAT CAGCAAACAT GCCTGGCTGA ACGGGGGCAA CGGACTGTTT AATCTCTTCA ACCTTGTCAA CGGAATTTTG TGGTTGTCGC TGATGGTTAT CTTTTTAAAA GATAAACAGA TGGGAAATTC GCACGAAGGG TAA
|
Protein sequence | MQNKLASGAR LGRQALLFPL CLVLYEFSTY IGNDMIQPGM LAVVEQYQAV IDWVPTSMTA YLAGGMFLQW LLGPLSDRIG RRPVMLAGVV WFIITCLAIL LAQNIEQFTL LRFLQGISLC FIGAVGYAAI QESFEEAVCI KITALMANVA LIAPLLGPLV GAAWIHVLPW EGMFVLFAAL AAISFFGLQQ AMPETATRIG EKLSLKELGR DYKLVLKNGR FVAGALALGF VSLPLLAWIA QSPIIIITGE QLSSYEYGLL QVPIFGALIA GNLLLARLTS RRTVRSLIIM GGWPIMIGLL VAAAATVISS HAYLWMTAGL SIYAFGIGLA NAGLVRLTLF ASDMSKGTVS AAMGMLQMLI FTVGIEISKH AWLNGGNGLF NLFNLVNGIL WLSLMVIFLK DKQMGNSHEG
|
| |