Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1061 |
Symbol | hpaE |
ID | 6797381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 1059497 |
End bp | 1060963 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642775330 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_002145971 |
Protein GI | 197250383 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAACGTTG CAGGTAACGA CTACTTCCAG ACCACTAACC CGGCGACCGG TGACGTGCTG GCGGAAGTAG CCTCCGGCGG TGAAGCAGAA GTGAACCAGG CTGTCGCGGC GGCAAAAGAG GCGTTCCCGA AATGGGCCAA CCTGCCGATG AAAGAGCGCG CGCGCCTGAT GCGCCGCCTT GGGGACCTGA TTGACCAGCA TGTGCCGGAA ATCGCGGCGA TGGAAACCGC CGACACCGGC CTGCCTATTC ACCAGACTAA AAACGTGCTG ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC AAGACCTACC CGGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTTGCGCCG TGCCTGGCGC TGGGTAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GCTGACTGCC GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGCGTGCT GAACGTGGTG CAGGGCTACG GCGCGACGGC GGGCGATGCG CTGGTACGCC ACCATGACGT GCGTGCGGTG TCGTTTACCG GCGGTACCGC CACCGGTCGC AATATCATGA AAAATGCCGG GCTGAAAAAA TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGACGC CGACATTGAG CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC GGGTCGCGCA TCTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA CGCGCGAATC GCCTGCGTGT CGGCGATCCG ACCGACCCGA ACACCCAGGT CGGCGCGCTG ATTAGCCAAC AGCACTGGGA AAAAGTCTCC GGTTATATCC GCCTCGGCAT TGAAGAGGGC GCAACGCTGC TGGCGGGCGG TGCAGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTCGCGCAG GAAGAGATCT TTGGGCCAGT CGCCTGTCTG CTGCCATTCA AAGACGAAGC GGAAGGGTTA CGTTTGGCGA ACGACGTGGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC AAAGTGTTGC GCCTGGCGCG TGGGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC CATCCTATCC CAAAATGGGG AGTTTGA
|
Protein sequence | MKKINHWING KNVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM KERARLMRRL GDLIDQHVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD HPIPKWGV
|
| |