Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A1166 |
Symbol | hpaE |
ID | 6519569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 1148178 |
End bp | 1149644 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642746291 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_002114100 |
Protein GI | 194734436 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.608265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.856893 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAAAGTTG CAGGTAACGA CTACTTCCAA ACCACTAACC CGGCGACTGG TGACGTGCTG GCGGAAGTGG CCTCCGGCGG TGAAGCAGAA GTGAACCAGG CTGTCGCGGC GGCCAAAGAG GCGTTCCCGA AATGGGCCAA CCTGCCGATG AAAGAGCGTG CGCGCCTGAT GCGCCGCCTT GGCGACCTGA TTGACCAGCA TGTGCCGGAA ATCGCGGCGA TGGAAACCGC CGACACCGGC CTGCCTATTC ACCAGACTAA AAACGTGCTG ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC AAGACCTATC CTGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTCGCGCCG TGCCTGGCGC TGGGCAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GCTGACTGCC GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGCGTGCT GAACGTGGTG CAGGGCTATG GCGCGACGGC GGGCGATTCG CTGGTACGCC ACCATGACGT GCGTGCGGTG TCGTTTACCG GCGGGACCGC CACCGGTCGC AATATCATGA AAAATGCTGG CCTGAAAAAA TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGATGC CGACATTGAG CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC GGGTCGCGCA TCTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA CGCGCGAATC GCCTGCGTGT CGGCGATCCG ACCGACCCGA ATACCCAGGT CGGCGCGCTG ATTAGCCAAC AGCACTGGGA GAAAGTCTCC GGTTATATTC GCCTCGGCAT TGAAGAGGGC GCAACGCTGC TGGCGGGCGG TGCGGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTCGCGCAG GAAGAGATCT TTGGGCCGGT CGCCTGCCTG CTGCCATTCA AAGACGAAGC GGAAGGGTTA CGTTTGGCGA ACGACGTGGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC AAAGTGTTGC GCCTGGCGCG TGGGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC CATCCTATCC CAAAATGGGG AGTTTGA
|
Protein sequence | MKKINHWING KKVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM KERARLMRRL GDLIDQHVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA DRLGELALEA GIPAGVLNVV QGYGATAGDS LVRHHDVRAV SFTGGTATGR NIMKNAGLKK YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD HPIPKWGV
|
| |