Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4583 |
Symbol | hpaE |
ID | 5591184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4591362 |
End bp | 4592828 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640923677 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_001461117 |
Protein GI | 157163799 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TAAATCATTG GATCAACGGT AAAAATGTTG CAGGTAACGA CTACTTCCAG ACTACCAATC CGGCAACGGG TGAAGTGCTG GCGGATGTGG CCTCCGGCGG TGAAGCGGAG ATCAATCAGG CGGTAGCGGC AGCGAAAGAG GCGTTCCCGA AATGGGCCAA TCTGCCGATG AAAGAGCGCG CGCGCCTGAT GCGCCGTCTG GGCGATCTGA TCGACCAGAA CGTGCCGGAG ATCGCCGCGA TGGAAACCGC GGACACGGGC CTGCCGATCC ATCAGACCAA AAATGTGTTG ATCCCACGCG CTTCTCACAA CTTTGAATTT TTCGCGGAAG TCTGCCAGCA GATGAACGGC AAGACTTATC CGGTCGACGA CAAGATGCTC AACTACACGC TGGTGCAGCC GGTAGGCGTT TGTGCACTGG TGTCACCGTG GAACGTGCCG TTTATGACCG CCACCTGGAA GGTCGCGCCG TGTCTGGCGC TGGGCAATAC CGCGGTACTG AAAATGTCGG AACTCTCCCC GCTGACCGCT GACCGCCTGG GTGAGCTGGC GCTGGAAGCC GGTATTCCGG CAGGCGTGCT GAACGTGGTA CAGGGCTACG GCGCAACCGC AGGGGATGCG CTGGTTCGTC ATCATGACGT ACGTGCCGTG TCGTTCACCG GCGGTACGGC CACCGGGCGC AACATCATGA AAAACGCCGG GCTGAAAAAA TACTCCATGG AACTGGGCGG TAAATCGCCG GTGCTGATTT TTGAAGATGC CGATATTGAA CGCGCGCTGG ACGCCGCCCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC GGTTCGCGCA TCTTTATTCA GCAAAGCATC TACCCGGAAT TCGTTAAACG CTTTGCCGAA CGCGCCAACC GTCTGCGCGT GGGCGATCCG AACGATCCGA ATACCCAGGT TGGCGCGCTT ATTAGCCAGC AGCACTGGGA AAAAGTCTCC GGCTATATCC GTCTCGGCAT TGAAGAAGGC GCAACCCTGC TGGCGGGCGG CCCGGATAAA CCGTCCGACC TGCCTGCGCA CCTGAAAGGC GGCAACTTCC TGCGCCCAAC CGTGCTGGCA GACGTTGATA ACCGTATGCG AGTCGCCCAG GAAGAGATTT TCGGGCCGGT CGCCTGCCTG CTGCCGTTTA AAGACGAAGC CGAAGGCTTA CGTCTGGCAA ACGACGTGGA GTACGGCCTC GCGTCGTACA TCTGGACACA GGATGTCAGC AAAGTGTTAC GCCTGGCGCG TGGCATTGAA GCTGGCATGG TGTTCGTCAA CACCCAGAAC GTGCGTGACC TGCGCCAGCC ATTTGGCGGC GTAAAAGCCT CCGGCACCGG GCGTGAAGGC GGTGAGTACA GCTTCGAAGT GTTCGCGGAA ATGAAGAACG TCTGCATTTC CATGGGCGAC CATCCAATTC CGAAATGGGG AGTCTGA
|
Protein sequence | MKKVNHWING KNVAGNDYFQ TTNPATGEVL ADVASGGEAE INQAVAAAKE AFPKWANLPM KERARLMRRL GDLIDQNVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE RANRLRVGDP NDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGPDK PSDLPAHLKG GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD HPIPKWGV
|
| |