Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A1511 |
Symbol | hpaE |
ID | 4889949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | - |
Start bp | 1464753 |
End bp | 1466216 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640147777 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_001078695 |
Protein GI | 126446976 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0625314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATCA AGCACTGGAT CGGCGGCCGC GAGGTCGACA GCCCGGAAAC GTTCACGACG TTCAATCCGG CGACGGGCCA GCCGATCGCC GAGGTCGCCT CGGGCGGCGC GCAGGAAATC GACGCGGCGG TGCGCGCCGC GAAGGACGCG TTTCCGAAAT GGGCAGGCAC GCCCGCGAAG GAGCGCGCGA AGCTGATGCG CCGGCTGGGA GAGCTGATCG AGCGCAACGT GCCGGCGCTC GCCGACCTGG AGACCCGCGA CACCGGCCTG CCGATCTCGC AGACGAGAAA GCAACTGATT CCGCGCGCAT CGGAGAACTT CCATTTCTTC GCCGAGGTGT GCACGCGGAT GAACGGGCGC AGCTATCCGG TCGACGACCA GATGCTGAAC TACACGCTAT ATCAGCCGGT GGGCGTGTGC GCGCTCGTGT CGCCGTGGAA CGTGCCGTTC ATGACCGCGA CGTGGAAGAC CGCGCCGTGC CTCGCGCTCG GCAACACCGC GGTGCTGAAG ATGTCGGAGC TCTCGCCGCT CACGGCCGAC CAGCTCGGCC GCCTCGCGCT CGAGGCGGGC ATCCCGGCCG GCGTGCTCAA CGTCGTGCAG GGCTACGGCG CGACGGCGGG CGACGCGCTC GTGCGGCATC CGGACGTGCG CGCGGTGTCG TTTACGGGCG GCACGGTGAC GGGCAAGCGG ATCATGGAGC GCGCGGGCCT GAAGAAATAC TCGATGGAGC TGGGCGGCAA GTCGCCCGTG CTGATCTTCG ACGACGCCGA TTTCGACCGC GCGCTCGACG CGTCGCTCTT CACGATCTTC TCGATCAACG GCGAGCGCTG CACCGCGGGC TCGCGGATCT TCGTGCAGCG CACGATCTAC GACAGGTTCG TCGCGGAGTT CGCGCGGCGC GCGAACAACC TGATCGTCGG CGATCCGGCC GACGAGAGCA CGCAGGTGGG CTCGATGATC ACGCGCGCGC ACTGGGAAAA AGTGACGGGC TATGTCCGGC TCGGCGTCGA GGAGGGCGCG CGGCTCGTGG CCGGCGGCCC GGACAAGCCG GCGAATCTCC CCGCGCATCT CGCGAACGGC AATTTCGTGC GGCCGAGCGT GTTCGCCGAC GTCGACAACC GGATGCGGAT CGCGCAGGAA GAGATCTTCG GGCCGGTCGC GTGCCTGATT CCGTTCGACG GCGAGGAAGA CGGGCTGCGT CTTGCCAACG ACACGGCCTA CGGTCTCGCG TCGTACCTGT GGACGCGCGA CGTCGGCCGT GCGCACCGGC TCGCGCGCGG CATCGAGGCG GGCATGGTGT TCGTCAACAG CCAGAACGTG CGCGATCTGC GCCAGCCGTT CGGCGGCGTG AAGGAATCGG GCACCGGCCG CGAGGGCGGC GAATACAGCT TCGAGGTGTT CGCCGAGATC AAGAACGTGT GCCTCTCGAT GGGCAGCCAT CACATTCCCC GCTGGGGCGT GTGA
|
Protein sequence | MGIKHWIGGR EVDSPETFTT FNPATGQPIA EVASGGAQEI DAAVRAAKDA FPKWAGTPAK ERAKLMRRLG ELIERNVPAL ADLETRDTGL PISQTRKQLI PRASENFHFF AEVCTRMNGR SYPVDDQMLN YTLYQPVGVC ALVSPWNVPF MTATWKTAPC LALGNTAVLK MSELSPLTAD QLGRLALEAG IPAGVLNVVQ GYGATAGDAL VRHPDVRAVS FTGGTVTGKR IMERAGLKKY SMELGGKSPV LIFDDADFDR ALDASLFTIF SINGERCTAG SRIFVQRTIY DRFVAEFARR ANNLIVGDPA DESTQVGSMI TRAHWEKVTG YVRLGVEEGA RLVAGGPDKP ANLPAHLANG NFVRPSVFAD VDNRMRIAQE EIFGPVACLI PFDGEEDGLR LANDTAYGLA SYLWTRDVGR AHRLARGIEA GMVFVNSQNV RDLRQPFGGV KESGTGREGG EYSFEVFAEI KNVCLSMGSH HIPRWGV
|
| |