Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1197 |
Symbol | hpaE |
ID | 6483568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1191843 |
End bp | 1193309 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642736599 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_002040357 |
Protein GI | 194445601 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 0.692139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAACGTTG CAGGTAACGA CTACTTCCAG ACCACTAACC CGGCGACCGG TGATGTGCTG GCGGAAGTAG CCTCCGGCGG TGAAGCAGAA GTGAACCAGG CTGTCGCGGC GGCAAAAGAG GCGTTCCCGA AATGGGCCAA CCTGCCGATG AAAGAGCGCG CGCGCCTGAT GCGTCGCCTT GGCGATCTGA TTGACCAGCA TGTGCCGGAA ATCGCGGCGA TGGAAACCGC CGACACCGGC TTGCCTATTC ACCAGACTAA AAACGTGCTG ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC AAGACCTATC CTGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTCGCGCCG TGCCTGGCGC TGGGTAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GCTGACTGCT GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGGGTGCT GAACGTGGTG CAGGGCTATG GCGCGACGGC GGGCGATGCG CTGGTACGCC ACCATGACGT GCGTGCGGTG TCGTTTACCG GCGGTACCGC CACCGGTCGC AATATCATGA AAAATGCCGG CCTGAAAAAA TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGATGC CGACATTGAG CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC GGGTCGCGCA TCTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA CGTGCGAATC GCCTGCGTGT CGGCGATCCG ACCGACCCGA ACACCCAGGT CGGCGCGCTG ATTAGCCAAC AGCACTGGGA AAAAGTCTCC GGTTATATCC GCCTCGGCAT TGAAGAGGGC GCAACGCTGC TGGCGGGCGG TGCGGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTCGCGCAG GAAGAGATCT TTGGGCCAGT TGCCTGTCTG CTGCCATTCA AAGACGAAGC GGAAGGGTTA CGCCTGGCGA ACGATGTGGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC AAAGTGTTGC GCCTGGCGCG TGGGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC CATCCTATCC CAAAATGGGG AGTTTGA
|
Protein sequence | MKKINHWING KNVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM KERARLMRRL GDLIDQHVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD HPIPKWGV
|
| |