Gene BURPS1710b_A2265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2265 
SymbolhpaE 
ID3694311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2754537 
End bp2756000 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content68% 
IMG OID637732519 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_337416 
Protein GI76817682 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCATCA AGCACTGGAT CGGCGGCCGC GAGGTCGACA GCCCGGAAAC GTTCACGACG 
TTCAATCCGG CGACGGGCGA GCCGATCGCC GAGGTCGCCT CGGGCGGCGC GCAGGAAATC
GACGCGGCGG TGCGCGCCGC GAAGGACGCG TTTCCGAAAT GGGCAGGCAC GCCCGCGAAG
GAGCGCGCGA AGCTGATGCG CCGGCTGGGA GAGCTGATCG AGCGCAACGT GCCGGCGCTC
GCCGACCTGG AGACGCGCGA CACCGGCCTG CCGATCTCGC AGACGAGAAA GCAACTGATT
CCGCGCGCAT CGGAGAACTT CCATTTCTTC GCCGAGGTGT GCACGCGGAT GAACGGGCGC
AGCTATCCGG TCGACGACCA GATGCTGAAC TACACGCTGT ATCAGCCGGT GGGCGTGTGC
GCGCTCGTGT CGCCGTGGAA CGTGCCGTTC ATGACCGCGA CGTGGAAGAC CGCGCCGTGC
CTCGCGCTCG GCAACACCGC GGTGCTGAAG ATGTCGGAGC TCTCGCCGCT CACGGCCGAC
CAGCTCGGCC GCCTCGCGCT CGAGGCGGGC ATCCCGGCCG GCGTGCTCAA CGTCGTGCAG
GGCTACGGCG CGACGGCGGG CGACGCGCTC GTGCGGCATC CGGACGTGCG CGCGGTGTCG
TTTACGGGCG GCACGGTGAC GGGCAAGCGG ATCATGGAGC GCGCGGGCCT GAAGAAATAC
TCGATGGAGC TGGGCGGCAA GTCGCCCGTG CTGATCTTCG ACGACGCCGA TTTCGACCGC
GCGCTCGACG CGTCGCTCTT CACGATCTTC TCGATCAACG GCGAGCGCTG CACCGCGGGC
TCGCGGATCT TCGTGCAGCG CACGATCTAC GACAGGTTCG TCGCGGAGTT CGCGCGGCGC
GCGAACAACC TGATCGTCGG CGATCCGGCC GACGAGAGCA CGCAGGTGGG CTCGATGATC
ACGCGCGCGC ACTGGGAAAA AGTGACGGGC TATGTCCGGC TCGGCGTCGA GGAGGGCGCG
CGGCTCGTGG CCGGCGGCCC GGACAAGCCG GCGAATCTCC CCGCGCATCT CGCGAACGGC
AATTTCGTGC GGCCGAGCGT GTTCGCCGAC GTCGACAACC GGATGCGGAT CGCGCAGGAA
GAGATCTTCG GGCCGGTCGT GTGCCTGATT CCGTTCGACG GCGAGGAACA CGGGCTGCGT
CTTGCCAACG ACACGGCCTA CGGTCTCGCG TCGTACCTGT GGACGCGCGA CGTCGGCCGC
GCGCACCGGC TCGCGCGCGG CATCGAGGCG GGCATGGTGT TCGTCAACAG CCAGAACGTG
CGCGATCTGC GCCAGCCGTT CGGCGGCGTG AAGGAATCGG GCACCGGCCG CGAGGGCGGC
GAATACAGCT TCGAGGTGTT CGCCGAGATC AAGAACGTGT GCCTCTCGAT GGGCAGCCAT
CACATTCCCC GCTGGGGCGT GTGA
 
Protein sequence
MGIKHWIGGR EVDSPETFTT FNPATGEPIA EVASGGAQEI DAAVRAAKDA FPKWAGTPAK 
ERAKLMRRLG ELIERNVPAL ADLETRDTGL PISQTRKQLI PRASENFHFF AEVCTRMNGR
SYPVDDQMLN YTLYQPVGVC ALVSPWNVPF MTATWKTAPC LALGNTAVLK MSELSPLTAD
QLGRLALEAG IPAGVLNVVQ GYGATAGDAL VRHPDVRAVS FTGGTVTGKR IMERAGLKKY
SMELGGKSPV LIFDDADFDR ALDASLFTIF SINGERCTAG SRIFVQRTIY DRFVAEFARR
ANNLIVGDPA DESTQVGSMI TRAHWEKVTG YVRLGVEEGA RLVAGGPDKP ANLPAHLANG
NFVRPSVFAD VDNRMRIAQE EIFGPVVCLI PFDGEEHGLR LANDTAYGLA SYLWTRDVGR
AHRLARGIEA GMVFVNSQNV RDLRQPFGGV KESGTGREGG EYSFEVFAEI KNVCLSMGSH
HIPRWGV