Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1646 |
Symbol | hpaE |
ID | 5800117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 1699703 |
End bp | 1701100 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641339592 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_001606149 |
Protein GI | 162418764 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTG TTAACCATTG GATTGATGGG AAAAATATTA CCAGCAATGA CTATTTCACA ACAATCAATC CGGCCACTGG CGAGGTGCTG GCTGACGTGG CAAGCGGGGG GATAAAAGAG ATCAACCAAG CCGTTGCTGC CGCGAAAAGT GCTTTCCCTC ACTGGGCTAA CCTGCCGATG AAAGTGCGTG CCCGTCTAAT GCGCCGTCTG GGGGAGTTGA TTGAGCAACA GATCCCAGAA ATAGCGCAGA TGGAAACGCA GGATACCGGC CTGCCCATTT ATCAAACGCA AAATGCCTTG ATCCCGCGGG CAGCACATAA CTTCGAATTT TTTGCCGAAA TTTGTCAGCA AATGAATGGC CAGACGTATC CGGTTGACGA TCAAATGTTG AATTACACCT TGGTGCAACC CGTGGGAGTG TGTGCGTTGG TCTCCCCTTG GAATGTCCCT TTTATGACGG CGACCTGGAA GGTCGCGCCT TGTTTGGCGC TGGGTAACAC CGCGATATTG AAAATGTCGG AGCTATCGCC ACTGACCGCA GACAAACTGG GTGAACTGGC CTTAGAGGCG GGTATACCGG CGGGGGTTCT CAACGTGGTA CAAGGATATG GGGCCACTGT CGGTGATGCA TTGGTATGTC ATCAGGATGT CCGAGCTATC TCTTTTACCG GCGGCACCGC GACGGGAAAC CGCATCATGC AACGTGCCGG GTTGAAAAAA TACTCCATGG AACTCGGTGG TAAATCCCCG GTACTTATCT TCGACGATGC TGATATCGAA CGGGCTATGG ATGCGGCGCT ATTTTCCATC TTCTCTCTCA ATGGTGAACG TTGCACGGCG GGTTCGCGCA TTTTTATTCA AGAGAGTCTC TATTCGGCAT TTATTCAACG TTTTGCTGAG CGGGCCAGCC GTTTACGTGT GGGGGACCCA CAAGATCTCG ACACTCAAGT TGGCGCATTG ATCAGTAAAC CGCATTGGGA CAAAGTTTCC GGCTATATCC AGTTGGGGAT AGAGGAGGGG GCCACGTTGT TGGCAGGGGG GCCGGATAAA CCCATCGACC TACCTGCTCA TCTGCGCGGA GGGCACTTCC TGCGTCCAAC GGTGTTGGCC GATGTTGATA ACCGAATGCG GGTTGCTCAG GAAGAGATTT TTGGACCGGT CGCTTGCCTG ATCCCCTTTA AGAATGAAGA CGCCGGACTG CGTTTGGCCC GAAGTATTGA AGCCGGCATG GTGTTCGTGA ATACCCAGAA TGTGCGGGAT CTCCGCCAGC CATTTGGCGG CATCAAGGCA TCGGGAACCG GGCGTGAAGG GGGAAAGTAC AGTTTTGATG TTTTTGCTGA AGTGAAAAAC GTCTGTATTT CCATGGGGGA GCATCCGATC CCCCGTTGGG GGATGTAA
|
Protein sequence | MKIVNHWIDG KNITSNDYFT TINPATGEVL ADVASGGIKE INQAVAAAKS AFPHWANLPM KVRARLMRRL GELIEQQIPE IAQMETQDTG LPIYQTQNAL IPRAAHNFEF FAEICQQMNG QTYPVDDQML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAIL KMSELSPLTA DKLGELALEA GIPAGVLNVV QGYGATVGDA LVCHQDVRAI SFTGGTATGN RIMQRAGLKK YSMELGGKSP VLIFDDADIE RAMDAALFSI FSLNGERCTA GSRIFIQESL YSAFIQRFAE RASRLRVGDP QDLDTQVGAL ISKPHWDKVS GYIQLGIEEG ATLLAGGPDK PIDLPAHLRG GHFLRPTVLA DVDNRMRVAQ EEIFGPVACL IPFKNEDAGL RLARSIEAGM VFVNTQNVRD LRQPFGGIKA SGTGREGGKY SFDVFAEVKN VCISMGEHPI PRWGM
|
| |