Gene EcSMS35_0869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0869 
SymbolmdfA 
ID6146383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp876305 
End bp877537 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content53% 
IMG OID641615757 
Productmultidrug translocase MdfA 
Protein accessionYP_001742949 
Protein GI170681504 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.468939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATA AATTAGCTTC CGGTGCCAGG CTTGGACGTC AGGCGTTACT TTTCCCTCTC 
TGTCTGGTGC TTTACGAATT TTCAACCTAT ATCGGCAACG ATATGATTCA ACCCGGTATG
TTGGCGGTGG TGGAACAATA TCAGGCGGGC ATTGATTGGG TTCCTACTTC GATGACCGCG
TATCTGGCGG GCGGGATGTT TTTACAATGG CTCCTGGGGC CGCTGTCGGA TCGTATTGGC
CGCCGTCCGG TGATGCTGGC GGGAGTGGTG TGGTTTATCG TCACCTGTCT GGCAATATTA
CTGGCGCAAA ACATTGAACA ATTCACCCTG TTGCGCTTCT TGCAGGGCAT AAGCCTCTGT
TTCATTGGCG CTGTGGGATA CGCCGCAATT CAGGAATCCT TCGAAGAGGC GGTTTGTATC
AAGATCACCG CGCTGATGGC GAACGTGGCG CTGATTGCTC CGCTACTTGG TCCGCTGGTG
GGCGCGGCGT GGATCCATGT GCTGCCCTGG GAGGGGATGT TTGTCTTGTT TGCCGCATTG
GCAGCGATCT CCTTTTTCGG TCTACAACGA GCCATGCCTG AAACCGCCAC GCGTATAGGC
GAGAAACTGT CGCTGAAAGA ACTCGGTCGT GACTATAAGC TGGTGCTGAA GAACGGTCGC
TTTGTGGCGG GGGCGCTGGC GCTGGGATTC GTTAGCCTGC CATTGCTGGC GTGGATCGCC
CAGTCGCCGA TTATCATCAT TACCGGCGAG CAGTTGAGCA GCTATGAATA TGGTTTGCTG
CAAGTGCCTA TTTTCGGGGC GTTAATTGCG GGTAACTTGC TGTTAGCGCG TCTGACCTCG
CGCCGCACCG TACGTTCGCT GATTATTATG GGCGGCTGGC CGATTATGAT TGGTCTGTTG
GTCGCTGCTG CGGCAACGGT TATCTCATCG CATGCGTATT TATGGATGAC CGCCGGGTTA
AGTCTTTATG CTTTCGGTAT TGGTCTGGCG AATGCGGGAC TGGTGCGATT AACCCTGTTT
GCCAGCGATA TGAGTAAAGG TACGGTTTCT GCGGCGATGG GGATGCTGCA AATGCTGATC
TTTACCGTCG GTATTGAAAT CAGCAAACAT GCCTGGCTGA ACGGGGGCAA CGGACTGTTT
AATCTCTTCA ACCTTGTCAA CGGCATTTTG TGGCTGTTGC TGATGGTTAT CTTTTTAAAA
GATAAACAGA TGGGAAATTC GCACGAAGGG TAA
 
Protein sequence
MQNKLASGAR LGRQALLFPL CLVLYEFSTY IGNDMIQPGM LAVVEQYQAG IDWVPTSMTA 
YLAGGMFLQW LLGPLSDRIG RRPVMLAGVV WFIVTCLAIL LAQNIEQFTL LRFLQGISLC
FIGAVGYAAI QESFEEAVCI KITALMANVA LIAPLLGPLV GAAWIHVLPW EGMFVLFAAL
AAISFFGLQR AMPETATRIG EKLSLKELGR DYKLVLKNGR FVAGALALGF VSLPLLAWIA
QSPIIIITGE QLSSYEYGLL QVPIFGALIA GNLLLARLTS RRTVRSLIIM GGWPIMIGLL
VAAAATVISS HAYLWMTAGL SLYAFGIGLA NAGLVRLTLF ASDMSKGTVS AAMGMLQMLI
FTVGIEISKH AWLNGGNGLF NLFNLVNGIL WLLLMVIFLK DKQMGNSHEG