Gene EcolC_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0203 
Symbol 
ID6065313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp231128 
End bp234241 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content53% 
IMG OID641599604 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001723211 
Protein GI170018257 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.115131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAACT ATTTTATTGA TCGCCCGGTT TTTGCCTGGG TACTTGCCAT TATTATGATG 
CTTGCAGGTG GTCTGGCGAT CATGAACTTA CCGGTTGCGC AGTATCCGCA GATTGCGCCA
CCGACCATTA CCGTCAGCGC TACCTATCCA GGTGCCGATG CGCAAACGGT AGAAGACTCG
GTCACTCAGG TGATTGAGCA AAATATGAAT GGGCTTGATG GCCTGATGTA CATGTCTTCA
ACCAGTGATG CGGCGGGCAA TGCCTCTATC ACTCTGACCT TCGAGACTGG GACATCTCCT
GATATCGCAC AGGTTCAAGT GCAAAATAAA CTGCAACTCG CTATGCCTTC ATTACCTGAA
GCAGTGCAGC AGCAGGGGAT TAGCGTCGAT AAGTCGAGCA GTAATATCCT GATGGTAGCG
GCGTTTATTT CTGATAACGG CAGCCTCAAC CAGTACGATA TCGCGGACTA TGTAGCGTCT
AATATCAAAG ACCCGCTAAG CCGTACCGCG GGCGTTGGTA GCGTACAACT CTTTGGTTCC
GAGTATGCCA TGCGTATCTG GCTGGACCCG CAAAAACTCA ATAAATATAA CCTGGTACCT
TCCGATGTTA TTTCCCAGAT TAAGGTGCAA AACAACCAGA TTTCCGGTGG TCAACTGGGT
GGCATGCCAC AGGCGGCAGA CCAGCAGCTA AACGCCTCGA TCATTGTGCA GACGCGTCTG
CAAACGCCGG AAGAATTTGG CAAAATCCTG TTGAAAGTTC AGCAAGATGG TTCGCAAGTG
CTGCTGCGTG ATGTCGCTCG CGTCGAACTT GGGGCGGAAG ATTATTCCAC CGTGGCGCGC
TATAACGGCA AACCTGCTGC CGGGATCGCC ATCAAACTGG CTGCCGGAGC AAACGCCCTG
GATACCTCGC GGGCAGTCAA AGAGGAACTG AACCGCTTAT CAGCCTATTT CCCGGCAAGT
CTGAAGACGG TTTATCCTTA CGACACCACG CCGTTTATCG AAATTTCTAT TCAGGAAGTT
TTCAAAACAC TGGTTGAGGC TATCATCCTA GTCTTCCTGG TCATGTATCT GTTTTTGCAG
AATTTCCGTG CCACAATCAT CCCGACGATT GCCGTACCGG TGGTTATTCT CGGGACGTTT
GCGATCTTGT CGGCGGTCGG TTTCACCATC AACACGTTGA CTATGTTCGG GATGGTGCTG
GCGATAGGGT TACTGGTGGA TGACGCCATC GTGGTGGTGG AGAACGTCGA GCGTGTCATT
GCGGAAGATA AGCTACCGCC GAAGGAAGCG ACGCATAAAT CGATGGGGCA GATCCAACGT
GCGCTGGTCG GTATTGCCGT TGTTCTTTCC GCAGTGTTTA TGCCGATGGC CTTTATGAGC
GGTGCAACCG GGGAGATCTA CCGCCAGTTC TCCATCACGC TGATCTCCTC CATGCTGCTT
TCAGTATTTG TGGCAATGAG CCTGACCCCT GCCCTGTGCG CCACCATTCT GAAAGCCGCG
CCGGAAGGCG GTCACAAACC TAACGCCCTG TTCGCACGCT TCAACACGCT GTTTGAAAAA
TCAACTCAAC ACTATACCGA TAGCACCCGC TCGCTGTTGC GTTGTACCGG TCGCTACATG
GTGGTCTACC TGCTGATTTG CGCCGGGATG GCGGTGCTGT TCCTGCGCAC GCCGACCTCT
TTCTTACCAG AAGAGGATCA GGGGGTATTT ATGACCACCG CGCAGTTACC TTCCGGTGCC
ACCATGGTTA ACACCACGAA AGTGCTGCAA CAGGTGACGG ATTATTATCT GACTAAAGAG
AAAGATAATG TCCAGTCGGT GTTTACCGTT GGCGGCTTTG GCTTCAGCGG TCAGGGGCAA
AACAACGGCC TGGCGTTTAT CAGTCTCAAG CCGTGGTCTG AACGTGTCGG TGAGGAAAAC
TCGGTTACCG CGATCATTCA GCGGGCAATG ATTGCGTTAA GCAGTATCAA TAAAGCCGTC
GTCTTCCCGT TCAACTTACC CGCGGTGGCT GAACTGGGTA CCGCGTCAGG TTTTGATATG
GAACTGCTGG ACAACGGTAA CCTGGGGCAC GAAAAACTAA CCCAGGCGCG AAACGAGCTG
TTATCACTGG CAGCGCAATC ACCGAATCAG GTCACCGGGG TACGCCCGAA CGGCCTGGAA
GATACGCCGA TGTTCAAAGT GAACGTCAAC GCTGCGAAAG CTGAAGCGAT GGGCGTGGCG
CTGTCTGATA TCAACCAGAC AATTTCCACC GCCTTCGGCA GCAGCTACGT GAACGACTTC
CTCAACCAGG GGCGGGTGAA AAAAGTGTAT GTCCAGGCAG GCACGCCGTT CCGTATGTTG
CCGGATAACA TCAACCAATG GTATGTACGC AACGCCTCTG GCACGATGGC ACCGCTTTCT
GCCTACTCGT CTACCGAATG GACCTATGGT TCACCGCGAC TGGAACGCTA CAACGGCATC
CCGTCAATGG AGATTTTAGG TGAAGCGGCG GCCGGGAAAA GTACCGGTGA CGCCATGAAA
TTTATGGCAG ACCTGGTCGC TAAACTTCCG GCAGGCGTCG GCTACTCATG GACCGGACTA
TCGTATCAGG AAGCGTTATC CTCAAATCAG GCTCCTGCGC TGTATGCGAT TTCACTGGTC
GTGGTGTTCC TCGCCCTCGC CGCACTCTAT GAGAGCTGGT CAATTCCGTT CTCGGTGATG
TTGGTTGTTC CGTTAGGCGT CGTTGGCGCA TTACTGGCCA CCGATCTGCG CGGCTTAAGT
AATGACGTCT ACTTCCAGGT TGGTTTGCTG ACCACCATCG GGCTTTCCGC CAAAAACGCC
ATCCTGATTG TCGAATTTGC CGTTGAGATG ATGCAGAAAG AAGGGAAAAC GCCGATAGAG
GCAATCATCG AAGCGGCGCG GATGCGTTTA CGCCCAATCC TGATGACCTC TCTGGCCTTT
ATTCTCGGCG TGCTGCCGCT GGTTATCAGT CATGGTGCCG GTTCTGGCGC GCAAAACGCG
GTAGGTACCG GCGTGATGGG CGGGATGTTT GCCGCAACAG TGCTGGCAAT TTACTTCGTT
CCGGTCTTTT TCGTTGTAGT GGAACATCTC TTTGCCCGCT TTAAAAAAGC GTAA
 
Protein sequence
MANYFIDRPV FAWVLAIIMM LAGGLAIMNL PVAQYPQIAP PTITVSATYP GADAQTVEDS 
VTQVIEQNMN GLDGLMYMSS TSDAAGNASI TLTFETGTSP DIAQVQVQNK LQLAMPSLPE
AVQQQGISVD KSSSNILMVA AFISDNGSLN QYDIADYVAS NIKDPLSRTA GVGSVQLFGS
EYAMRIWLDP QKLNKYNLVP SDVISQIKVQ NNQISGGQLG GMPQAADQQL NASIIVQTRL
QTPEEFGKIL LKVQQDGSQV LLRDVARVEL GAEDYSTVAR YNGKPAAGIA IKLAAGANAL
DTSRAVKEEL NRLSAYFPAS LKTVYPYDTT PFIEISIQEV FKTLVEAIIL VFLVMYLFLQ
NFRATIIPTI AVPVVILGTF AILSAVGFTI NTLTMFGMVL AIGLLVDDAI VVVENVERVI
AEDKLPPKEA THKSMGQIQR ALVGIAVVLS AVFMPMAFMS GATGEIYRQF SITLISSMLL
SVFVAMSLTP ALCATILKAA PEGGHKPNAL FARFNTLFEK STQHYTDSTR SLLRCTGRYM
VVYLLICAGM AVLFLRTPTS FLPEEDQGVF MTTAQLPSGA TMVNTTKVLQ QVTDYYLTKE
KDNVQSVFTV GGFGFSGQGQ NNGLAFISLK PWSERVGEEN SVTAIIQRAM IALSSINKAV
VFPFNLPAVA ELGTASGFDM ELLDNGNLGH EKLTQARNEL LSLAAQSPNQ VTGVRPNGLE
DTPMFKVNVN AAKAEAMGVA LSDINQTIST AFGSSYVNDF LNQGRVKKVY VQAGTPFRML
PDNINQWYVR NASGTMAPLS AYSSTEWTYG SPRLERYNGI PSMEILGEAA AGKSTGDAMK
FMADLVAKLP AGVGYSWTGL SYQEALSSNQ APALYAISLV VVFLALAALY ESWSIPFSVM
LVVPLGVVGA LLATDLRGLS NDVYFQVGLL TTIGLSAKNA ILIVEFAVEM MQKEGKTPIE
AIIEAARMRL RPILMTSLAF ILGVLPLVIS HGAGSGAQNA VGTGVMGGMF AATVLAIYFV
PVFFVVVEHL FARFKKA