Gene EcDH1_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0440 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp465627 
End bp468731 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content52% 
IMG OID 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionACX38129 
Protein GI260447707 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAACT TTTTTATTCG ACGACCGATA TTTGCATGGG TGCTGGCCAT TATTCTGATG 
ATGGCGGGCG CACTGGCGAT CCTACAATTG CCCGTCGCTC AGTATCCAAC AATTGCACCG
CCTGCGGTTT CTGTTTCAGC AAACTATCCG GGCGCTGATG CGCAGACCGT GCAGGATACG
GTGACGCAGG TTATCGAACA GAATATGAAC GGTATCGATA ACCTGATGTA TATGTCCTCC
ACCAGCGATT CCGCCGGTAG CGTGACAATT ACCCTTACCT TCCAGTCCGG GACCGATCCT
GATATCGCGC AAGTGCAGGT GCAGAACAAA CTCCAGCTCG CCACGCCGTT GCTGCCGCAG
GAGGTTCAGC AGCAGGGGAT CAGTGTTGAA AAGTCCAGTA GCAGCTATTT GATGGTGGCG
GGCTTTGTCT CTGATAACCC AGGCACCACA CAGGACGATA TCTCGGACTA TGTGGCCTCT
AACGTTAAAG ATACGCTTAG CCGTCTGAAT GGCGTCGGTG ACGTACAGCT TTTCGGCGCA
CAGTATGCGA TGCGTATCTG GCTGGATGCC GATCTGCTAA ACAAATATAA ACTGACACCG
GTTGATGTGA TTAACCAGTT GAAGGTACAG AACGATCAGA TCGCTGCCGG ACAGTTGGGC
GGAACGCCAG CGTTACCAGG GCAACAATTG AACGCCTCGA TTATTGCTCA GACGCGGTTT
AAAAATCCGG AAGAATTCGG CAAAGTGACC CTGCGCGTAA ACAGTGACGG CTCGGTGGTA
CGCCTGAAAG ATGTCGCACG GGTTGAACTT GGCGGTGAAA ACTATAACGT TATCGCTCGT
ATCAACGGAA AACCGGCGGC GGGCCTGGGG ATTAAGCTGG CAACCGGCGC GAATGCTCTC
GATACCGCGA AAGCCATTAA GGCAAAACTG GCGGAATTAC AGCCATTCTT CCCGCAGGGA
ATGAAGGTTC TCTACCCTTA TGACACCACG CCATTCGTCC AGCTTTCTAT TCACGAAGTG
GTAAAAACGC TGTTCGAAGC CATTATGCTG GTGTTCCTGG TGATGTATCT GTTCTTGCAG
AATATGCGAG CAACGCTGAT CCCCACCATT GCGGTACCCG TGGTGTTGTT AGGGACGTTT
GCCATCCTCG CCGCTTTTGG TTACTCCATC AACACACTAA CGATGTTCGG GATGGTGCTT
GCCATCGGGC TGCTCGTCGA TGATGCGATA GTGGTGGTGG AGAACGTCGA GCGCGTGATG
ATGGAGGATA AGCTCCCGCC AAAAGAAGCG ACGGAAAAAT CGATGTCGCA AATTCAGGGC
GCACTGGTGG GTATCGCGAT GGTGCTGTCA GCGGTATTTA TTCCGATGGC ATTCTTCGGC
GGTTCTACTG GGGCAATTTA TCGCCAGTTC TCTATCACCA TCGTTTCGGC AATGGCGCTT
TCTGTTCTGG TGGCATTGAT TCTTACCCCT GCGTTATGTG CAACGCTGCT TAAACCCGTC
TCTGCTGAGC ATCACGAAAA TAAGGGCGGT TTCTTCGGTT GGTTTAATAC CACCTTCGAT
CATAGCGTTA ACCACTACAC CAACAGCGTC GGCAAAATCC TCGGATCCAC AGGACGATAT
TTACTGATCT ATGCGCTGAT TGTTGCAGGA ATGGTGGTGT TGTTTTTACG TCTTCCGTCT
TCCTTCTTAC CTGAAGAGGA TCAGGGTGTC TTTCTGACCA TGATTCAGTT ACCCGCTGGC
GCGACGCAAG AGCGGACGCA AAAAGTGTTG GATCAAGTTA CGGATTACTA TCTGAAGAAC
GAGAAAGCGA ACGTTGAAAG TGTCTTTACG GTTAACGGCT TTAGCTTCAG CGGCCAGGCA
CAAAACGCCG GTATGGCCTT CGTCAGTCTG AAACCGTGGG AAGAGCGTAA TGGTGACGAA
AACAGTGCGG AAGCGGTAAT CCATCGTGCC AAAATGGAAT TGGGCAAGAT CCGCGACGGT
TTTGTCATTC CATTCAATAT GCCAGCCATT GTTGAACTGG GCACGGCAAC GGGTTTCGAC
TTTGAGTTAA TTGATCAGGC TGGGCTGGGT CACGATGCCC TAACCCAGGC CCGTAACCAG
TTGCTTGGTA TGGCGGCGCA ACATCCTGCC AGCTTAGTCA GCGTGCGCCC TAATGGCCTG
GAAGACACCG CGCAGTTTAA ACTGGAAGTT GACCAGGAAA AGGCGCAGGC ATTAGGTGTT
TCACTTTCTG ACATCAATCA GACCATTTCA ACGGCGCTGG GTGGGACTTA CGTTAACGAC
TTCATCGACC GTGGCCGCGT GAAAAAGTTG TATGTTCAGG CGGATGCCAA ATTCCGTATG
CTGCCAGAAG ATGTCGATAA ACTTTATGTC CGCAGCGCCA ACGGCGAAAT GGTGCCATTC
TCGGCCTTTA CCACTTCACA TTGGGTGTAT GGCTCTCCGC GACTGGAACG CTACAACGGT
CTGCCGTCAA TGGAGATTCA GGGGGAAGCC GCGCCAGGAA CCAGTTCCGG CGATGCCATG
GCGTTGATGG AAAACCTTGC GTCAAAATTA CCTGCGGGCA TTGGTTATGA CTGGACGGGT
ATGTCGTATC AGGAACGCTT ATCGGGAAAC CAGGCTCCCG CTCTGGTAGC AATTTCCTTT
GTGGTTGTTT TCCTGTGCCT TGCTGCACTC TATGAAAGCT GGTCAATTCC TGTCTCGGTT
ATGTTGGTAG TGCCGTTAGG GATTGTCGGC GTGCTGCTGG CGGCGACACT CTTTAATCAA
AAAAATGACG TCTACTTTAT GGTGGGCTTG CTAACGACAA TTGGCTTGTC GGCCAAAAAC
GCTATTTTGA TCGTTGAGTT CGCTAAAGAT CTCATGGAGA AAGAGGGTAA AGGTGTTGTT
GAAGCGACAC TGATGGCAGT ACGTATGCGT CTGCGTCCTA TCCTGATGAC CTCTCTCGCC
TTTATTCTCG GCGTATTACC GCTAGCTATC AGTAACGGTG CCGGCAGTGG CGCGCAGAAC
GCTGTGGGTA TCGGGGTAAT GGGAGGAATG GTCTCTGCAA CGTTGCTGGC AATCTTCTTC
GTACCGGTGT TCTTTGTGGT GATCCGCCGT TGCTTTAAAG GATAA
 
Protein sequence
MANFFIRRPI FAWVLAIILM MAGALAILQL PVAQYPTIAP PAVSVSANYP GADAQTVQDT 
VTQVIEQNMN GIDNLMYMSS TSDSAGSVTI TLTFQSGTDP DIAQVQVQNK LQLATPLLPQ
EVQQQGISVE KSSSSYLMVA GFVSDNPGTT QDDISDYVAS NVKDTLSRLN GVGDVQLFGA
QYAMRIWLDA DLLNKYKLTP VDVINQLKVQ NDQIAAGQLG GTPALPGQQL NASIIAQTRF
KNPEEFGKVT LRVNSDGSVV RLKDVARVEL GGENYNVIAR INGKPAAGLG IKLATGANAL
DTAKAIKAKL AELQPFFPQG MKVLYPYDTT PFVQLSIHEV VKTLFEAIML VFLVMYLFLQ
NMRATLIPTI AVPVVLLGTF AILAAFGYSI NTLTMFGMVL AIGLLVDDAI VVVENVERVM
MEDKLPPKEA TEKSMSQIQG ALVGIAMVLS AVFIPMAFFG GSTGAIYRQF SITIVSAMAL
SVLVALILTP ALCATLLKPV SAEHHENKGG FFGWFNTTFD HSVNHYTNSV GKILGSTGRY
LLIYALIVAG MVVLFLRLPS SFLPEEDQGV FLTMIQLPAG ATQERTQKVL DQVTDYYLKN
EKANVESVFT VNGFSFSGQA QNAGMAFVSL KPWEERNGDE NSAEAVIHRA KMELGKIRDG
FVIPFNMPAI VELGTATGFD FELIDQAGLG HDALTQARNQ LLGMAAQHPA SLVSVRPNGL
EDTAQFKLEV DQEKAQALGV SLSDINQTIS TALGGTYVND FIDRGRVKKL YVQADAKFRM
LPEDVDKLYV RSANGEMVPF SAFTTSHWVY GSPRLERYNG LPSMEIQGEA APGTSSGDAM
ALMENLASKL PAGIGYDWTG MSYQERLSGN QAPALVAISF VVVFLCLAAL YESWSIPVSV
MLVVPLGIVG VLLAATLFNQ KNDVYFMVGL LTTIGLSAKN AILIVEFAKD LMEKEGKGVV
EATLMAVRMR LRPILMTSLA FILGVLPLAI SNGAGSGAQN AVGIGVMGGM VSATLLAIFF
VPVFFVVIRR CFKG