Gene EcSMS35_0505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0505 
Symbol 
ID6144466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp509567 
End bp512716 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content54% 
IMG OID641615399 
Productacriflavine resistance protein B 
Protein accessionYP_001742606 
Protein GI170680127 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAATT TCTTTATCGA TCGCCCGATT TTTGCGTGGG TGATCGCCAT TATCATCATG 
TTGGCAGGGG GGCTGGCGAT CCTCAAACTG CCGGTGGCGC AATATCCTAC GATTGCACCG
CCGGCAGTAA CGATCTCCGC CTCCTACCCT GGCGCTGATG CGAAAACAGT GCAGGACACG
GTGACACAGG TTATCGAACA GAATATGAAC GGTATCGATA ACCTGATGTA CATGTCCTCT
AACAGTGACT CCACGGGTAC CGTGCAGATC ACCCTGACCT TTGAGTCTGG TACTGATGCG
GATATCGCGC AGGTTCAGGT GCAGAACAAA CTGCAGCTGG CGATGCCGTT GCTGCCGCAA
GAAGTTCAGC AGCAAGGGGT GAGCGTTGAG AAATCATCCA GCAGCTTCCT GATGGTTGTC
GGCGTTATCA ACACCGATGG CACCATGACG CAGGAGGATA TCTCCGACTA CGTGGCGGCG
AATATGAAAG ATGCCATCAG CCGTACGTCG GGCGTGGGTG ACGTTCAGTT GTTCGGTTCA
CAGTACGCGA TGCGTATCTG GATGAACCCG AATGAACTGA ACAAATTCCA GCTAACGCCG
GTTGATGTCA TTACCGCCAT CAAAGCACAG AACGCCCAGG TTGCAGCTGG TCAGCTCGGT
GGTACGCCGC CGGTGAAAGG CCAACAGCTT AACGCCTCTA TTATTGCTCA GACGCGTCTG
ACCTCTACTG AAGAGTTCGG CAAAATCCTG CTGAAAGTGA ATCAGGATGG TTCCCGCGTG
CTGCTGCGTG ATGTGGCGAA AATTGAGCTG GGTGGTGAGA ACTACGACAT CATCGCAGAG
TTTAACGGCC AACCGGCTTC CGGTCTGGGG ATCAAGCTGG CGACCGGTGC AAACGCGCTG
GATACCGCTG CGGCAATCCG TGCTGAACTG GCGAAGATGG AACCGTTCTT CCCGTCGGGT
CTGAAAATTG TTTACCCGTA CGACACCACG CCGTTCGTGA AAATCTCTAT TCACGAAGTG
GTTAAAACGC TGGTCGAAGC GATCATCCTC GTGTTCCTGG TTATGTATCT GTTCCTGCAG
AACTTCCGCG CGACGTTGAT TCCGACCATT GCCGTACCGG TGGTATTGCT GGGGACCTTT
GCCGTCCTTG CCGCCTTTGG CTTCTCGATA AACACGCTAA CAATGTTCGG GATGGTGCTC
GCCATCGGCC TGTTGGTGGA TGACGCCATC GTTGTGGTAG AAAACGTTGA GCGTGTTATG
GCGGAAGAAG GTTTGCCGCC AAAAGAAGCT ACCCGTAAGT CGATGGGGCA GATTCAGGGC
GCTCTGGTCG GTATCGCGAT GGTACTGTCG GCGGTATTCG TACCGATGGC CTTCTTTGGC
GGTTCTACCG GTGCTATTTA TCGTCAGTTC TCTATTACCA TTGTTTCAGC AATGGCGCTG
TCAGTACTGG TGGCGTTGAT CCTGACTCCG GCTCTTTGTG CCACCATGCT GAAACCGATT
GCCAAAGGCG ATCACGGGGA AGGTAAAAAA GGCTTCTTCG GCTGGTTTAA CCGCATGTTC
GAAAAGAGCA CGCACCACTA CACCGACAGC GTAGGCGGTA TTCTGCGCAG TACAGGGCGT
TACCTGGTGC TGTATCTGAT CATCGTGGTC GGCATGGCCT ATCTGTTCGT GCGTCTGCCA
AGCTCCTTCT TGCCAGATGA AGACCAGGGC GTATTTATGA CCATGGTTCA GCTGCCAGCA
GGTGCAACGC AGGAACGTAC GCAGAAAGTG CTCAATGAGG TAACGCATTA CTATCTGACC
AAAGAAAAGA ACAACGTTGA GTCGGTGTTC GCCGTTAACG GCTTCGGCTT TGCGGGACGT
GGTCAGAATA CCGGTATTGC GTTCGTTTCC TTGAAGGACT GGGCCGATCG TCCGGGCGAA
GAAAACAAAG TTGAAGCGAT TACCATGCGT GCAACACGCG CTTTCTCGCA AATCAAAGAT
GCGATGGTTT TCGCCTTTAA CCTGCCCGCA ATCGTGGAAC TGGGTACTGC AACCGGCTTT
GACTTTGAGC TGATTGACCA GGCTGGCCTT GGTCACGAAA AACTGACTCA GGCGCGTAAC
CAGCTGCTTG CAGAAGCAGC GAAGCACCCT GATATGTTGA CCAGCGTACG TCCAAACGGT
CTGGAAGATA CCCCGCAGTT TAAGATTGAT ATCGACCAGG AAAAAGCGCA GGCGCTGGGT
GTTTCTATCA ACGACATTAA CACCACTCTG GGCGCTGCAT GGGGCGGTAG CTATGTGAAC
GACTTTATCG ACCGCGGTCG TGTGAAGAAA GTTTACGTCA TGTCAGAAGC GAAATACCGT
ATGCTGCCGG ATGATATCGG CGACTGGTAT GTTCGTGCTG CTGATGGTCA GATGGTGCCG
TTCTCGGCGT TCTCCTCTTC TCGTTGGGAG TACGGTTCGC CGCGTCTGGA ACGTTACAAC
GGCCTGCCAT CCATGGAAAT CTTAGGCCAG GCGGCACCGG GTAAAAGTAC CGGTGAAGCA
ATGGAGCTGA TGGAACAACT GGCGAGCAAA CTGCCTACCG GTGTTGGCTA TGACTGGACG
GGGATGTCCT ATCAGGAACG TCTCTCCGGC AACCAGGCAC CTTCACTGTA CGCGATTTCG
TTGATTGTCG TGTTCCTGTG TCTGGCGGCG CTGTACGAGA GCTGGTCGAT TCCGTTCTCC
GTTATGCTGG TCGTTCCGCT GGGGGTTATC GGTGCGTTGC TGGCTGCCAC CTTCCGTGGC
CTGACCAATG ACGTTTACTT CCAGGTAGGC CTGCTCACAA CCATTGGGTT GTCGGCGAAG
AACGCGATAC TTATCGTCGA ATTCGCCAAA GACTTGATGG ATAAAGAAGG TAAAGGTCTG
ATTGAAGCGA CGCTTGATGC GGTGCGGATG CGTTTACGCC CAATCCTGAT GACCTCGCTG
GCGTTTATCC TCGGCGTTAT GCCGCTGGTT ATCAGTACTG GTGCTGGTTC CGGCGCGCAG
AACGCAGTAG GTACCGGTGT AATGGGCGGG ATGGTGACCG CAACGGTACT GGCAATCTTC
TTCGTTCCGG TATTCTTTGT GGTGGTTCGC CGCCGCTTTA GCCGCAAGAA TGAAGATATC
GAGCACAGCC ATACTGTCGA TCATCATTGA
 
Protein sequence
MPNFFIDRPI FAWVIAIIIM LAGGLAILKL PVAQYPTIAP PAVTISASYP GADAKTVQDT 
VTQVIEQNMN GIDNLMYMSS NSDSTGTVQI TLTFESGTDA DIAQVQVQNK LQLAMPLLPQ
EVQQQGVSVE KSSSSFLMVV GVINTDGTMT QEDISDYVAA NMKDAISRTS GVGDVQLFGS
QYAMRIWMNP NELNKFQLTP VDVITAIKAQ NAQVAAGQLG GTPPVKGQQL NASIIAQTRL
TSTEEFGKIL LKVNQDGSRV LLRDVAKIEL GGENYDIIAE FNGQPASGLG IKLATGANAL
DTAAAIRAEL AKMEPFFPSG LKIVYPYDTT PFVKISIHEV VKTLVEAIIL VFLVMYLFLQ
NFRATLIPTI AVPVVLLGTF AVLAAFGFSI NTLTMFGMVL AIGLLVDDAI VVVENVERVM
AEEGLPPKEA TRKSMGQIQG ALVGIAMVLS AVFVPMAFFG GSTGAIYRQF SITIVSAMAL
SVLVALILTP ALCATMLKPI AKGDHGEGKK GFFGWFNRMF EKSTHHYTDS VGGILRSTGR
YLVLYLIIVV GMAYLFVRLP SSFLPDEDQG VFMTMVQLPA GATQERTQKV LNEVTHYYLT
KEKNNVESVF AVNGFGFAGR GQNTGIAFVS LKDWADRPGE ENKVEAITMR ATRAFSQIKD
AMVFAFNLPA IVELGTATGF DFELIDQAGL GHEKLTQARN QLLAEAAKHP DMLTSVRPNG
LEDTPQFKID IDQEKAQALG VSINDINTTL GAAWGGSYVN DFIDRGRVKK VYVMSEAKYR
MLPDDIGDWY VRAADGQMVP FSAFSSSRWE YGSPRLERYN GLPSMEILGQ AAPGKSTGEA
MELMEQLASK LPTGVGYDWT GMSYQERLSG NQAPSLYAIS LIVVFLCLAA LYESWSIPFS
VMLVVPLGVI GALLAATFRG LTNDVYFQVG LLTTIGLSAK NAILIVEFAK DLMDKEGKGL
IEATLDAVRM RLRPILMTSL AFILGVMPLV ISTGAGSGAQ NAVGTGVMGG MVTATVLAIF
FVPVFFVVVR RRFSRKNEDI EHSHTVDHH