Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1733 |
Symbol | hpaE |
ID | 3844731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 2095849 |
End bp | 2097312 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637839034 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_439927 |
Protein GI | 83717784 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.080834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATCA AGCACTGGAT CGGCGGGCGC GAAGTCGACA GCCGGGAAAC GTTCACGACG TTCAATCCGG CGACGGGCGA GCCGATCGCC GAGGTCGCGT CGGGCGGCGC GCAGGAAATC GACGCGGCGG TGCGCGCCGC GAAGGACGCG TTTCCGAAAT GGGCGAACAC GCCGGCGAAG GAGCGCGCGA AGCTGATGCG CAAGCTCGGC GAGCTGATCG AGCGCAACGT GCCGGCGCTC GCCGATCTGG AGACGCAGGA CACCGGTCTG CCGATCTCGC AGACGAAGAA GCAGTTGATT CCGCGCGCGT CGGAGAACTT CCATTTCTTC GCCGAAGTGT GCACGCAGAT GAACGGCCGC AGCTATCCGG TCGACGACCA GATGCTGAAC TACACGCTGT ACCAGCCGGT CGGCGTGTGC GCGCTCGTGT CGCCGTGGAA CGTGCCGTTC ATGACCGCGA CGTGGAAGAC CGCGCCGTGT CTCGCGCTCG GCAACACTGC GGTGCTGAAG ATGTCCGAAC TCTCGCCGCT CACGGCCGAC CAGCTCGGCC GCCTCGCGCT CGAGGCGGGC ATTCCGGCGG GCGTGCTGAA CGTCGTGCAG GGCTACGGCG CGACGGCGGG CGACGCGCTC GTGCGGCATC CGGACGTGCG CGCGGTGTCG TTTACGGGCG GCACCGTGAC GGGCAAGCGG ATCATGGAGC GCGCGGGCCT GAAGAAATAT TCGATGGAGC TGGGCGGCAA GTCGCCCGTG CTGATCTTCG ACGATGCGGA TTTCGACCGA GCGCTCGACG CGTCGCTCTT CACGATCTTC TCGATCAACG GCGAGCGCTG CACCGCGGGC TCGCGGATCT TCGTGCAGCG GACGATCTAC GACAAGTTCG TCGCCGAGTT CGCGCGCCGC GCGAACAACC TGATCGTCGG CGATCCGGCC GACGAGAGCA CGCAGGTAGG CTCGATGATC ACGCGCGCGC ATTGGGAGAA GGTGACGGGC TACATCCGGC TCGGCATCGA GGAGGGCGCG CGGCTCGTGG CGGGCGGCCC GGACAAGCCG GCGAATCTTC CCGCGCATCT CGCGAACGGC AACTTCGTGC GGCCGACCGC GTTCGCCGAC GTCGACAACC GGATGCGGAT CGCGCAGGAG GAGATCTTCG GGCCGGTGGC GTGCCTGATT CCATTCGACG GCGAGGAGGA CGGGCTGCGT CTCGCGAACG ACACGTCGTA CGGCCTCGCG TCGTACCTGT GGACGCAGGA CGTCCGCCGC GCGCACCGGC TCGCGCGCGG GATCGAGGCG GGCATGGTGT TCGTCAACAG CCACAACGTG CGTGACCTGC GCCAGCCGTT CGGCGGCGTG AAGGAATCGG GCACCGGCCG CGAAGGCGGC GAATACAGCT TCGAGGTGTT CGCCGAGATC AAGAACGTGT GCATCTCGAT GGGTGGCCAT CACATTCCCC GCTGGGGCGT GTGA
|
Protein sequence | MGIKHWIGGR EVDSRETFTT FNPATGEPIA EVASGGAQEI DAAVRAAKDA FPKWANTPAK ERAKLMRKLG ELIERNVPAL ADLETQDTGL PISQTKKQLI PRASENFHFF AEVCTQMNGR SYPVDDQMLN YTLYQPVGVC ALVSPWNVPF MTATWKTAPC LALGNTAVLK MSELSPLTAD QLGRLALEAG IPAGVLNVVQ GYGATAGDAL VRHPDVRAVS FTGGTVTGKR IMERAGLKKY SMELGGKSPV LIFDDADFDR ALDASLFTIF SINGERCTAG SRIFVQRTIY DKFVAEFARR ANNLIVGDPA DESTQVGSMI TRAHWEKVTG YIRLGIEEGA RLVAGGPDKP ANLPAHLANG NFVRPTAFAD VDNRMRIAQE EIFGPVACLI PFDGEEDGLR LANDTSYGLA SYLWTQDVRR AHRLARGIEA GMVFVNSHNV RDLRQPFGGV KESGTGREGG EYSFEVFAEI KNVCISMGGH HIPRWGV
|
| |