Gene EcSMS35_3817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3817 
SymbolmdtF 
ID6146253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3881059 
End bp3884172 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content53% 
IMG OID641618643 
Productmultidrug resistance protein MdtF 
Protein accessionYP_001745783 
Protein GI170682414 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0885908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAACT ATTTTATTGA TCGCCCGGTT TTTGCCTGGG TACTTGCCAT TATTATGATG 
CTTGCAGGTG GTCTGGCGAT CATGAACTTA CCGGTTGCGC AGTATCCGCA GATTGCGCCA
CCGACCATTA CCATCAGCGC TACCTATCCG GGTGCCGATG CGCAAACAGT AGAAGACTCG
GTCACTCAGG TGATTGAGCA AAATATGAAT GGGCTTGATG GCCTGATGTA CATGTCTTCA
ACCAGCGATG CGGCAGGTAA TGCCTCCATC ACTCTGACCT TCGAGACTGG GACATCGCCT
GATATCGCAC AGGTTCAGGT GCAAAATAAA CTGCAGCTCG CTATGCCTTC ATTACCTGAA
GCGGTGCAGC AGCAGGGGAT TAGCGTCGAT AAGTCGAGCA GTAATATCCT GATGGTAGCG
GCATTTATTT CTGATAACGG CAGCCTCAAC CAGTACGATA TCGCGGACTA TGTAGCGTCT
AATATCAAAG ACCCGCTAAG CCGTACCGCG GGCGTTGGTA GCGTACAGCT CTTTGGTTCC
GAGTACGCCA TGCGTATCTG GCTGGACCCG CAAAAACTCA ATAAATATAA CCTGGTACCT
TCCGATGTTA TTTCCCAGAT TAAGGTGCAA AACAACCAGA TTTCCGGTGG TCAACTGGGT
GGCATGCCAC AGGCGGCAGA CCAGCAACTG AACGCCTCGA TCATTGTGCA GACGCGTCTG
CAAACACCGG AAGAATTTGG CAAAATCCTG TTGAAAGTTC AGCAAGATGG TTCGCAAGTG
CTGCTGCGTG ATGTCGCCCG CGTCGAACTT GGGGCGGAAG ATTATTCCAC CGTGGCGCGC
TATAACGGCA AACCTGCTGC CGGGATCGCC ATCAAACTGG CTACCGGAGC AAACGCCCTG
GATACCTCGC GAGCGGTGAA AGAGGAGCTG AACCGTTTAT CGGCTTATTT CCCTGCAAGC
CTGAAGACGG TTTATCCTTA CGACACCACA CCGTTTATCA AAATTTCTAT TCAGGAAGTT
TTCAAAACGC TGGTTGAGGC TATCATCCTC GTCTTCCTGG TCATGTATCT GTTTTTGCAG
AATTTCCGCG CCACAATCAT CCCGACGATT GCCGTACCGG TGGTTATTCT CGGGACGTTT
GCGATCTTGT CGGCGGTCGG TTTCACCATC AACACGTTAA CTATGTTCGG GATGGTGCTG
GCGATAGGGT TACTGGTGGA TGACGCCATC GTGGTGGTGG AGAACGTCGA GCGTGTCATT
GCGGAAGATA AGCTACCGCC GAAGGAAGCG ACGCATAAAT CGATGAGGCA GATCCAACGT
GCGCTGGTCG GGATTGCCGT TGTTCTTTCC GCAGTGTTTA TGCCGATGGC CTTTATGAGC
GGTGCAACCG GGGAGATCTA CCGACAGTTC TCCATCACGC TGATCTCCTC CATGCTGCTT
TCAGTATTTG TGGCAATGAG CCTGACCCCT GCCCTGTGCG CCACCATTCT GAAAGCCGCG
CCGGAAGGCG GTCACAAACC TAACGCCCTG TTCGCACGCT TCAACACGCT GTTTGAAAAA
TCAACTCAGC ACTATACCGA CAGCACCCGT TCGCTGTTGC GTTGTACCGG TCGCTACATG
GTGGTCTACC TGCTGATTTG CGCCGGGATG GCGGTGCTGT TCCTGCGTAC GCCGACCTCT
TTCTTACCAG AAGAGGATCA GGGGGTATTT ATGACCACCG CTCAGTTACC TTCCGGTGCC
ACCATGGTTA ACACCACGAA AGTGCTGCAA CAGGTGACGG ATTATTATCT GACTAAAGAG
AAAGATAATG TTCAGTCGGT GTTTACCGTT GGCGGCTTTG GCTTCAGCGG CCAGGGGCAA
AACAACGGTC TGGCGTTTAT CAGCCTCAAA CCGTGGTCTG AACGTGTCGG TGAGGAAAAC
TCGGTTACCG CGATCATTCA GCGGGCAATG ATTGCGTTAA GCAGTATCAA TAAAGCCGTC
GTCTTCCCGT TCAACTTACC CGCGGTGGCT GAACTGGGTA CCGCGTCAGG TTTTGATATG
GAACTGCTGG ACAACGGCAA CCTGGGGCAC GAAAAACTGA CCCAGGCGCG AAACGAGCTG
TTATCACTGG CAGCGCAATC ACCGAATCAG GTCACCGGGG TTCGCCCGAA CGGCCTGGAA
GATACGCCGA TGTTCAAAGT GAACGTCAAC GCTGCGAAAG CCGAGGCTAT GGGCGTGGCG
CTGTCTGATA TCAACCAGAC AATTTCCACC GCCTTCGGCA GCAGCTACGT GAACGACTTC
CTCAACCAGG GGCGGGTGAA AAAAGTGTAT GTCCAGGCAG GCACGCCGTT CCGTATGTTG
CCGGATAACA TCAACCAATG GTATGTACGC AACGCCTCTG GCACGATGGC ACCGCTTTCT
GCCTACTCGT CTACCGAATG GACCTATGGT TCACCGCGAC TGGAACGCTA CAACGGCATC
CCGTCAATGG AGATTTTAGG TGAAGCGGCG GCCGGTAAAA GTACCGGTGA CGCCATGAAA
TTTATGGCAG ACCTGGTCGC TAAACTTCCG GCAGGCGTCG GCTACTCATG GACCGGACTG
TCGTATCAGG AAGCGTTATC CTCAAATCAG GCTCCCGCGC TGTATGCCAT TTCACTGGTC
GTGGTGTTCC TCGCCCTCGC CGCACTTTAT GAGAGCTGGT CAATTCCGTT CTCGGTGATG
TTGGTTGTAC CGTTAGGCGT CGTTGGCGCA TTACTGGCCA CCGATCTACG CGGCTTAAGT
AATGACGTCT ACTTCCAGGT TGGTTTGCTG ACCACCATCG GGCTTTCCGC CAAAAACGCC
ATCCTGATTG TCGAATTTGC CGTTGAGATG ATGCAGAAAG AAGGGAAAAC GCCGATAGAG
GCGATCATCG AAGCGGCGCG AATGCGTTTA CGCCCAATCC TGATGACCTC TCTGGCCTTT
ATTCTCGGCG TGCTGCCGCT GGTTATCAGT CATGGTGCCG GTTCTGGCGC GCAAAACGCG
GTAGGTACCG GCGTGATGGG CGGGATGTTT GCCGCAACAG TGCTGGCAAT TTACTTCGTT
CCGGTCTTTT TCGTTGTAGT GGAACATCTC TTTGCCCGCT TTAAAAAAGC GTAA
 
Protein sequence
MANYFIDRPV FAWVLAIIMM LAGGLAIMNL PVAQYPQIAP PTITISATYP GADAQTVEDS 
VTQVIEQNMN GLDGLMYMSS TSDAAGNASI TLTFETGTSP DIAQVQVQNK LQLAMPSLPE
AVQQQGISVD KSSSNILMVA AFISDNGSLN QYDIADYVAS NIKDPLSRTA GVGSVQLFGS
EYAMRIWLDP QKLNKYNLVP SDVISQIKVQ NNQISGGQLG GMPQAADQQL NASIIVQTRL
QTPEEFGKIL LKVQQDGSQV LLRDVARVEL GAEDYSTVAR YNGKPAAGIA IKLATGANAL
DTSRAVKEEL NRLSAYFPAS LKTVYPYDTT PFIKISIQEV FKTLVEAIIL VFLVMYLFLQ
NFRATIIPTI AVPVVILGTF AILSAVGFTI NTLTMFGMVL AIGLLVDDAI VVVENVERVI
AEDKLPPKEA THKSMRQIQR ALVGIAVVLS AVFMPMAFMS GATGEIYRQF SITLISSMLL
SVFVAMSLTP ALCATILKAA PEGGHKPNAL FARFNTLFEK STQHYTDSTR SLLRCTGRYM
VVYLLICAGM AVLFLRTPTS FLPEEDQGVF MTTAQLPSGA TMVNTTKVLQ QVTDYYLTKE
KDNVQSVFTV GGFGFSGQGQ NNGLAFISLK PWSERVGEEN SVTAIIQRAM IALSSINKAV
VFPFNLPAVA ELGTASGFDM ELLDNGNLGH EKLTQARNEL LSLAAQSPNQ VTGVRPNGLE
DTPMFKVNVN AAKAEAMGVA LSDINQTIST AFGSSYVNDF LNQGRVKKVY VQAGTPFRML
PDNINQWYVR NASGTMAPLS AYSSTEWTYG SPRLERYNGI PSMEILGEAA AGKSTGDAMK
FMADLVAKLP AGVGYSWTGL SYQEALSSNQ APALYAISLV VVFLALAALY ESWSIPFSVM
LVVPLGVVGA LLATDLRGLS NDVYFQVGLL TTIGLSAKNA ILIVEFAVEM MQKEGKTPIE
AIIEAARMRL RPILMTSLAF ILGVLPLVIS HGAGSGAQNA VGTGVMGGMF AATVLAIYFV
PVFFVVVEHL FARFKKA