Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1177 |
Symbol | hpaE |
ID | 6871178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1168005 |
End bp | 1169471 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642784360 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_002215033 |
Protein GI | 198245087 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.585771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAACGTTG CAGGTAACGA CTACTTCCAG ACCACTAACC CGGCGACCGG TGACGTGCTG GCGGAAGTAG CCTCCGGCGG TGAAGCAGAA GTGAACCAGG CTGTCGCGGC GGCCAAAGAG GCGTTCCCGA AATGGGCCAA TCTGCCGATG AAAGAGCGCG CGCGCCTGAT GCGTCGTCTG GGTGACCTGA TTGACCAGAA CGTACCGGAA ATCGCGGCGA TGGAAACCGC CGACACCGGT CTGCCTATTC ACCAGACTAA AAACGTGCTG ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC AAGACCTATC CTGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTCGCGCCG TGCCTGGCGC TGGGCAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GTTGACTGCC GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGGGTGCT GAACGTGGTG CAGGGCTACG GCGCGACGGC GGGCGATGCG CTGGTACGCC ACCATGACGT ACGTGCGGTG TCGTTTACCG GCGGTACCGC CACCGGTCGC AATATCATGA AAAATGCCGG CCTGAAAAAA TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGATGC CGACATTGAG CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC GGGTCGCGCA TCTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA CGCGCGAATC GCCTGCGTGT CGGCGATCCG ACCGACCCGA ACACCCAGGT CGGCGCGCTG ATTAGCCAAC AGCACTGGGA AAAAGTCTCC GGTTATATCC GCCTCGGCAT TGAAGAGGGC GCAACGCTGC TGGCGGGCGG TGCGGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTCGCGCAG GAAGAGATCT TTGGGCCAGT CGCCTGTCTG CTGCCATTCA AAGACGAAGC GGAAGGGTTA CGTTTGGCGA ACGACGTGGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC AAAGTGTTGC GCCTGGCGCG TGCGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC CATCCTATCC CAAAATGGGG AGTTTGA
|
Protein sequence | MKKINHWING KNVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM KERARLMRRL GDLIDQNVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS KVLRLARAIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD HPIPKWGV
|
| |