Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1725 |
Symbol | adhE |
ID | 6969701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1660985 |
End bp | 1663660 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385679 |
Product | bifunctional acetaldehyde-CoA/alcohol dehydrogenase |
Protein accession | YP_002270171 |
Protein GI | 209397364 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.365908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.626874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTTA CTAATGTCGC TGAACTTAAC GCACTCGTAG AGCGTGTAAA AAAAGCCCAG CGTGAATATG CCAGTTTCAC TCAAGAGCAA GTAGACAAAA TCTTCCGCGC CGCCGCTCTG GCTGCTGCAG ATGCTCGAAT CCCACTCGCG AAAATGGCCG TTGCCGAATC CGGCATGGGT ATCGTCGAAG ATAAAGTGAT CAAAAACCAC TTTGCTTCTG AATATATCTA CAACGCCTAT AAAGATGAAA AAACCTGTGG TGTTCTGTCT GAAGACGACA CTTTTGGTAC CATCACTATC GCTGAACCAA TCGGTATTAT TTGCGGTATC GTTCCGACCA CTAACCCGAC TTCAACAGCT ATCTTCAAAT CGCTGATCAG TCTGAAGACC CGTAACGCCA TTATCTTCTC CCCGCACCCG CGTGCAAAAG ATGCGACCAA CAAAGCGGCT GATATCGTTC TGCAGGCTGC TATCGCTGCC GGTGCTCCGA AAGATCTGAT CGGCTGGATC GATCAACCTT CTGTTGAACT GTCTAACGCA CTGATGCACC ACCCAGACAT CAACCTGATC CTCGCGACTG GTGGTCCGGG CATGGTTAAA GCCGCATACA GCTCCGGTAA ACCAGCTATC GGCGTAGGCG CGGGCAACAC TCCGGTTGTT ATCGATGAAA CTGCTGATAT CAAACGTGCA GTTGCATCTG TACTGATGTC CAAAACCTTC GACAACGGCG TAATCTGTGC TTCTGAACAG TCTGTTGTTG TCGTTGACTC TGTTTATGAC GCAGTACGTG AACGTTTCGC AACCCACGGC GGCTATCTGT TGCAGGGTAA AGAGCTGAAA GCTGTTCAGG ACGTTATCCT GAAAAACGGT GCGCTGAACG CGGCTATCGT TGGTCAGCCA GCCTATAAAA TTGCTGAACT GGCAGGCTTC TCTGTACCAG AAAACACCAA GATTCTGATC GGTGAAGTGA CCGTTGTTGA TGAAAGCGAA CCGTTCGCAC ATGAAAAACT GTCCCCGACT CTGGCAATGT ACCGCGCTAA AGATTTCGAA GACGCGGTAG AAAAAGCAGA GAAACTGGTT GCTATGGGTG GTATCGGTCA TACCTCTTGC CTGTACACTG ACCAGGATAA CCAACCGGCT CGCGTTTCTT ACTTCGGTCA GAAAATGAAA ACGGCTCGTA TCCTGATTAA CACCCCGGCT TCTCAGGGTG GTATCGGTGA CCTGTATAAC TTCAAACTCG CACCTTCCCT GACTCTGGGT TGTGGTTCCT GGGGTGGTAA CTCCATCTCT GAAAACGTTG GTCCGAAACA CCTGATCAAC AAGAAAACCG TTGCTAAGCG AGCTGAAAAC ATGTTGTGGC ACAAACTTCC GAAATCTATC TACTTCCGCC GTGGCTCCCT GCCAATCGCG CTGGATGAAG TGATTACTGA TGGCCACAAA CGTGCGCTCA TCGTGACTGA CCGCTTCCTG TTCAACAATG GTTATGCTGA TCAGATCACT TCCGTACTGA AAGCAGCAGG CGTTGAAACT GAAGTCTTCT TCGAAGTAGA AGCGGACCCG ACCCTGAGCA TCGTTCGTAA AGGTGCAGAA CTGGCAAACT CCTTCAAACC AGACGTGATT ATCGCGCTGG GTGGTGGTTC CCCGATGGAC GCCGCGAAGA TCATGTGGGT TATGTACGAA CATCCGGAAA CTCACTTCGA AGAGCTGGCG CTGCGCTTTA TGGATATCCG TAAACGTATC TACAAGTTCC CGAAAATGGG CGTGAAAGCG AAAATGATTG CTGTCACCAC CACTTCTGGT ACAGGTTCTG AAGTCACTCC GTTTGCGGTT GTAACTGACG ACGCTACTGG TCAGAAATAC CCGCTGGCGG ACTATGCGCT GACCCCGGAT ATGGCGATTG TCGACGCCAA CCTGGTTATG GACATGCCGA AGTCCCTGTG TGCTTTCGGT GGTCTGGACG CAGTAACTCA CGCCATGGAA GCTTATGTTT CTGTACTGGC ATCTGAGTTC TCTGATGGTC AGGCTCTGCA GGCACTGAAA CTGCTGAAAG AATATCTGCC AGCGTCCTAC CACGAAGGGT CTAAAAATCC GGTAGCGCGT GAACGTGTTC ACAGTGCAGC GACTATCGCG GGTATCGCGT TTGCGAACGC CTTCCTGGGT GTATGTCACT CAATGGCGCA CAAACTGGGT TCCCAGTTCC ATATTCCGCA CGGTCTGGCA AACGCCCTGC TGATTTGTAA CGTTATTCGC TACAATGCGA ACGACAACCC GACCAAGCAG ACTGCATTCA GCCAGTATGA CCGTCCGCAG GCTCGCCGTC GTTATGCTGA AATTGCCGAC CACCTGGGTC TGAGCGCACC GGGCGACCGT ACTGCTGCTA AGATCGAGAA ACTGCTGGCA TGGCTGGAAA CGCTGAAGGC TGAACTGGGT ATTCCGAAAT CTATCCGTGA AGCTGGCGTT CAGGAAGCAG ACTTCCTGGC GAACGTGGAT AAACTGTCTG AAGATGCATT CGATGACCAG TGCACTGGCG CTAACCCGCG TTACCCGCTG ATCTCCGAGC TGAAACAGAT TCTGCTGGAT ACCTACTACG GTCGTGATTA TGTAGAAGGT GAAACTGCAG CGAAAAAAGA AGCCGCTCCG GCTAAAGCTG AGAAAAAAGC GAAAAAATCC GCTTAA
|
Protein sequence | MAVTNVAELN ALVERVKKAQ REYASFTQEQ VDKIFRAAAL AAADARIPLA KMAVAESGMG IVEDKVIKNH FASEYIYNAY KDEKTCGVLS EDDTFGTITI AEPIGIICGI VPTTNPTSTA IFKSLISLKT RNAIIFSPHP RAKDATNKAA DIVLQAAIAA GAPKDLIGWI DQPSVELSNA LMHHPDINLI LATGGPGMVK AAYSSGKPAI GVGAGNTPVV IDETADIKRA VASVLMSKTF DNGVICASEQ SVVVVDSVYD AVRERFATHG GYLLQGKELK AVQDVILKNG ALNAAIVGQP AYKIAELAGF SVPENTKILI GEVTVVDESE PFAHEKLSPT LAMYRAKDFE DAVEKAEKLV AMGGIGHTSC LYTDQDNQPA RVSYFGQKMK TARILINTPA SQGGIGDLYN FKLAPSLTLG CGSWGGNSIS ENVGPKHLIN KKTVAKRAEN MLWHKLPKSI YFRRGSLPIA LDEVITDGHK RALIVTDRFL FNNGYADQIT SVLKAAGVET EVFFEVEADP TLSIVRKGAE LANSFKPDVI IALGGGSPMD AAKIMWVMYE HPETHFEELA LRFMDIRKRI YKFPKMGVKA KMIAVTTTSG TGSEVTPFAV VTDDATGQKY PLADYALTPD MAIVDANLVM DMPKSLCAFG GLDAVTHAME AYVSVLASEF SDGQALQALK LLKEYLPASY HEGSKNPVAR ERVHSAATIA GIAFANAFLG VCHSMAHKLG SQFHIPHGLA NALLICNVIR YNANDNPTKQ TAFSQYDRPQ ARRRYAEIAD HLGLSAPGDR TAAKIEKLLA WLETLKAELG IPKSIREAGV QEADFLANVD KLSEDAFDDQ CTGANPRYPL ISELKQILLD TYYGRDYVEG ETAAKKEAAP AKAEKKAKKS A
|
| |