Gene EcHS_A3459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3459 
SymbolacrF 
ID5594803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3459376 
End bp3462480 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content51% 
IMG OID640922577 
Productacriflavine resistance protein F 
Protein accessionYP_001460065 
Protein GI157162747 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0766453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAACT TTTTTATTCG ACGACCGATA TTTGCATGGG TGCTGGCCAT TATTCTGATG 
ATGGCGGGCG CACTGGCGAT CCTACAATTG CCCGTCGCTC AGTATCCAAC CATTGCACCG
CCTGCGGTTT CTGTTTCGGC AAACTATCCA GGTGCTGATG CGCAGACCGT GCAGGATACG
GTGACGCAGG TTATCGAACA GAATATGAAT GGTATCGATA ACCTGATGTA TATGTCCTCC
ACCAGCGATT CCGCCGGTAG CGTGACAATT ACCCTTACCT TCCAGTCCGG GACCGATCCT
GATATCGCGC AAGTGCAGGT GCAGAACAAA CTCCAGCTCG CCACGCCGTT GCTGCCGCAG
GAAGTTCAGC AGCAGGGGAT CAGTGTTGAA AAGTCCAGTA GCAGCTATTT GATGGTGGCG
GGCTTTGTCT CTGATAACCC AGACACCACA CAGGACGATA TCTCGGACTA TGTGGCTTCT
AACGTTAAAG ATACGCTTAG CCGTCTGAAT GGCGTCGGTG ACGTACAGCT TTTCGGCGCA
CAGTATGCGA TGCGTATCTG GCTGGACGCC GATCTGCTAA ACAAATATAA ACTGACACCG
GTTGATGTGA TTAACCAGTT GAAGGTACAG AACGATCAGA TCGCTGCCGG ACAGTTGGGC
GGAACGCCAG CGTTACCAGG GCAACAGTTG AATGCCTCGA TTATTGCTCA GACGCGGTTA
AAAAATCCGG AAGAATTCGG CAAAGTGACC CTGCGCGTAA ACAGTGACGG CTCGGTGGTG
CGTCTGAAAG ATGTCGCACG GGTTGAACTT GGCGGTGAAA ACTATAACGT TATCGCTCGC
ATCAACGGAA AACCGGCGGC GGGCCTGGGG ATTAAGCTGG CAACCGGCGC GAATGCTCTC
GATACCGCGA AAGCCATTAA GGCAAAACTG GCGGAATTAC AGCCATTCTT CCCGCAGGGA
ATGAAGGTTC TCTACCCTTA TGACACCACG CCATTCGTCC AGCTTTCTAT TCACGAAGTG
GTAAAAACGC TGTTCGAAGC CATTATGCTG GTGTTCCTGG TGATGTATCT GTTCTTGCAG
AATATGCGAG CAACGCTGAT CCCCACCATT GCGGTACCCG TGGTGTTGTT AGGGACGTTT
GCCATCCTCG CCGCTTTTGG TTACTCCATC AACACACTAA CGATGTTCGG GATGGTGCTT
GCCATCGGGC TGCTCGTCGA TGATGCGATA GTGGTGGTGG AGAACGTCGA GCGCGTGATG
ATGGAGGATA AGCTCCCGCC AAAAGAAGCG ACGGAAAAAT CAATGTCGCA AATTCAGGGC
GCACTGGTGG GTATCGCGAT GGTGCTGTCA GCGGTATTTA TTCCGATGGC ATTCTTCGGC
GGTTCTACTG GGGCAATTTA TCGCCAGTTC TCTATCACCA TCGTTTCGGC AATGGCGCTT
TCTGTTCTGG TGGCATTGAT TCTTACCCCT GCGTTATGTG CAACGCTGCT TAAACCCGTC
TCTGCTGAGC ATCACGAAAA TAAGGGCGGT TTCTTCGGCT GGTTTAATAC CACCTTCGAT
CATAGCGTTA ACCACTACAC CAACAGCGTC GGCAAAATCC TCGGTTCCAC AGGACGATAT
TTACTGATCT ATGCGCTGAT TGTTGCAGGA ATGGTGGTGT TGTTTTTACG TCTTCCGTCT
TCCTTCTTAC CTGAAGAGGA TCAGGGTGTC TTTCTGACCA TGATTCAGTT ACCCGCTGGC
GCGACGCAAG AGCGGACGCA AAAAGTGTTG GATCAAGTGA CGGATTACTA TCTGAAAAAC
GAGAAAGCGA ACGTTGAAAG TGTCTTTACG GTTAACGGCT TTAGCTTCAG CGGCCAGGCA
CAAAACGCCG GTATGGCCTT CGTCAGTCTG AAACCGTGGG AAGAGCGTAA TGGTGATGAA
AACAGTGCGG AAGCGGTAAT CCATCGTGCC AAAATGGAAT TGGGCAAGAT CCGCGACGGT
TTTGTCATTC CATTCAATAT GCCAGCCATT GTTGAACTGG GCACGGCAAC GGGTTTCGAC
TTTGAGTTAA TTGATCAGGC TGGGCTGGGT CACGATGCCC TAACCCAGGC CCGTAACCAG
TTGCTTGGTA TGGCTGCGCA ACATCCTGCC AGCTTAGTCA GCGTGCGTCC TAATGGCCTG
GAAGACACCG CGCAGTTTAA ACTGGAAGTT GACCAGGAAA AGGCGCAGGC ATTAGGTGTT
TCACTTTCTG ACATCAATCA GACCATTTCA ACGGCGCTGG GTGGGACTTA CGTTAACGAC
TTCATCGACC GTGGTCGTGT GAAAAAGGTG TATGTTCAGG CGGATGCCAA ATTCCGTATG
CTGCCAGAAG ATGTCGATAA ACTTTATGTC CGCAGCGCCA ACGGCGAAAT GGTGCCATTC
TCTGCCTTTA CCACTTCACA TTGGGTGTAT GGCTCTCCGC GACTGGAACG CTACAACGGT
CTGCCGTCAA TGGAGATTCA GGGGGAAGCC GCGCCAGGAA CCAGTTCCGG CGATGCCATG
GCGTTGATGG AAAACCTTGC GTCAAAATTA CCTGCGGGCA TTGGTTATGA CTGGACGGGT
ATGTCGTATC AGGAACGCTT ATCGGGAAAC CAGGCTCCCG CTCTGGTAGC AATTTCCTTT
GTGGTTGTTT TCCTGTGCCT TGCTGCACTC TATGAAAGCT GGTCAATTCC TGTCTCGGTT
ATGTTGGTCG TGCCGTTAGG GATTGTCGGC GTGCTGCTGG CGGCGACACT CTTTAATCAA
AAAAATGACG TCTACTTTAT GGTGGGCTTG CTAACGACAA TTGGCTTGTC GGCCAAAAAC
GCTATTTTGA TTGTTGAGTT CGCTAAAGAT CTCATGGAGA AAGAGGGTAA AGGTGTTGTT
GAAGCGACAC TGATGGCAGT ACGTATGCGT CTGCGTCCTA TCCTGATGAC CTCTCTCGCC
TTTATTCTCG GCGTATTACC GCTAGCTATC AGTAACGGTG CCGGCAGTGG CGCGCAGAAC
GCAGTGGGTA TCGGGGTAAT GGGAGGAATG GTCTCTGCAA CGTTGCTGGC AATCTTCTTC
GTACCGGTGT TCTTTGTGGT GATCCGCCGT TGCTTTAAAG GATAA
 
Protein sequence
MANFFIRRPI FAWVLAIILM MAGALAILQL PVAQYPTIAP PAVSVSANYP GADAQTVQDT 
VTQVIEQNMN GIDNLMYMSS TSDSAGSVTI TLTFQSGTDP DIAQVQVQNK LQLATPLLPQ
EVQQQGISVE KSSSSYLMVA GFVSDNPDTT QDDISDYVAS NVKDTLSRLN GVGDVQLFGA
QYAMRIWLDA DLLNKYKLTP VDVINQLKVQ NDQIAAGQLG GTPALPGQQL NASIIAQTRL
KNPEEFGKVT LRVNSDGSVV RLKDVARVEL GGENYNVIAR INGKPAAGLG IKLATGANAL
DTAKAIKAKL AELQPFFPQG MKVLYPYDTT PFVQLSIHEV VKTLFEAIML VFLVMYLFLQ
NMRATLIPTI AVPVVLLGTF AILAAFGYSI NTLTMFGMVL AIGLLVDDAI VVVENVERVM
MEDKLPPKEA TEKSMSQIQG ALVGIAMVLS AVFIPMAFFG GSTGAIYRQF SITIVSAMAL
SVLVALILTP ALCATLLKPV SAEHHENKGG FFGWFNTTFD HSVNHYTNSV GKILGSTGRY
LLIYALIVAG MVVLFLRLPS SFLPEEDQGV FLTMIQLPAG ATQERTQKVL DQVTDYYLKN
EKANVESVFT VNGFSFSGQA QNAGMAFVSL KPWEERNGDE NSAEAVIHRA KMELGKIRDG
FVIPFNMPAI VELGTATGFD FELIDQAGLG HDALTQARNQ LLGMAAQHPA SLVSVRPNGL
EDTAQFKLEV DQEKAQALGV SLSDINQTIS TALGGTYVND FIDRGRVKKV YVQADAKFRM
LPEDVDKLYV RSANGEMVPF SAFTTSHWVY GSPRLERYNG LPSMEIQGEA APGTSSGDAM
ALMENLASKL PAGIGYDWTG MSYQERLSGN QAPALVAISF VVVFLCLAAL YESWSIPVSV
MLVVPLGIVG VLLAATLFNQ KNDVYFMVGL LTTIGLSAKN AILIVEFAKD LMEKEGKGVV
EATLMAVRMR LRPILMTSLA FILGVLPLAI SNGAGSGAQN AVGIGVMGGM VSATLLAIFF
VPVFFVVIRR CFKG