Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4889 |
Symbol | hpaE |
ID | 6270890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4561112 |
End bp | 4562578 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641728621 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_001883015 |
Protein GI | 187731525 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TAAATCATTG GATCAACGGC AAAAATGTTG CAGGTAACGA CTACTTCCAG ACCACCAATC CGGCAACGGG TGAAGTGCTG GCGGATGTGG CCTCTGGCGG TGAAGCGGAG ATCAATCAGG CGGTAGCGGC AGCGAAAGAG GCGTTCCCGA AATGGGCCAA TCTGCCGATG AAAGAGCGTG CGCGCCTGAT GCGCCGTCTG GGCGATCTGA TCGACCAGAA CGTGCCAGAG ATCGCCGCGA TGGAAACCGC GGACACCGGC CTGCCGATCC ATCAGACCAA AAATGTGTTG ATCCCACGCG CTTCCCACAA CTTTGAATTT TTCGCGGAAG TCTGCCAGCA GATGAACGGC AAGACCTATC CGGTTGACGA CAAGATGCTC AACTACACGC TGGTGCAGCC GGTGGGCGTT TGTGCGCTGG TATCGCCGTG GAACGTACCG TTTATGACCG CCACATGGAA GGTCGCGCCG TGTCTGGCGC TGGGCAATAC CGCGGTACTG AAAATGTCGG AACTCTCCCC GCTGACCGCT GACCGCCTGG GTGAGCTGGC GCTGGAAGCC GGTATTCCGG CAGGCGTGCT GAACGTGGTA CAGGGCTACG GCGCAACCGC AGGGGATGCG CTGGTTCGTC ATCATGACGT ACGTGCCGTG TCGTTCACCG GCGGTACGGC CACCGGGCGC AACATCATGA AAAACGCCGG GCTGAAAAAA TACTCCATGG AACTGGGCGG TAAATCGCCG GTGCTGATTT TTGAAGATGC CGATATTGAA CGCGCGCTGG ACGCCGCCCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC GGTTCACGCA TCTTTATTCA GCAAAGCATC TACCCGGAAT TCGTTAAACG CTTTGCCGAA CGCGCCAACC GTCTGCGCGT GGGCGATCCG AACGATCCGA ATACCCAGGT TGGCGCGCTT ATCAGCCAGC AGCACTGGGA AAAAGTCTCC GGCTATATCC GTCTCGGCAT TGAAGAAGGC GCCACCCTGC TGGCGGGCGG CCCGGATAAA CCGTCTGACC TGCCTGCACA CCTGAAAGGC GGCAACTTCC TGCGCCCAAC GGTGCTGGCA GACGTTGATA ACCGTATGCG AGTCGCCCAA GAAGAGATTT TCGGGCCGGT CGCCTGCCTG CTGCCGTTTA AAGACGAAGC TGAAGGCTTA CGCCTGGCAA ACGACGTGGA GTACGGCCTC GCGTCGTACA TCTGGACACA GGATGTCAGC AAAGTGTTAC GCCTGGCGCG TGGCATTGAA GCTGGCATGG TGTTCGTCAA CACCCAGAAC GTGCGTGACC TGCGCCAGCC ATTTGGCGGC GTAAAAGCCT CCGGCACCGG GCGTGAAGGC GGTGAGTACA GCTTCGAAGT GTTCGCGGAA ATGAAGAACG TCTGCATTTC CATGGGCGAC CATCCAATTC CGAAATGGGG AGTCTGA
|
Protein sequence | MKKVNHWING KNVAGNDYFQ TTNPATGEVL ADVASGGEAE INQAVAAAKE AFPKWANLPM KERARLMRRL GDLIDQNVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE RANRLRVGDP NDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGPDK PSDLPAHLKG GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD HPIPKWGV
|
| |