Gene Bcep18194_B1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1781 
Symbol 
ID3753546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp2023407 
End bp2024870 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content67% 
IMG OID637766630 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_372539 
Protein GI78062631 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.137819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA AGCACTGGAT CGGCGGCCGG GAAGTCGAGA GCCGCGAGAC GTTCACGACG 
CTGAATCCCG CGACGGGCGA TGTCATCACC GACGTCGCAT CGGGCGGCGA GGCCGAAGTC
GACGCGGCCG TGCGCGCCGC GAAGGAAGCG TTCCCGAAAT GGGCGAGCAC GCCCGCCAAG
GAGCGCGCGA AGCTGATGCG CAAGCTCGGC GAGCTGATCG AGAAGAACGT GCCGAATCTC
GCGGCGCTGG AGACGCAGGA CACCGGCCTG CCGATCGCGC AGACGAGCAA GCAGCTGATT
CCGCGCGCAT CCGAGAACTT CAACTTCTTC GCGGAAGTGT GCGTGCAGAT GAACGGCCGC
ACCTATCCGG TCGACGACCA GATGCTGAAC TACACGCTGT ACCAGCCGGT CGGCGTGTGC
GCGCTGGTGT CGCCGTGGAA CGTGCCGTTC ATGACCGCGA CGTGGAAGAC GGCGCCGTGT
CTCGCGCTCG GCAACACGGC CGTGCTGAAG ATGTCGGAGC TGTCGCCGCT GACGGCCGAT
CAACTCGGCC GCCTCGCGCT CGAAGCCGGC ATCCCGCCGG GCGTGCTCAA CGTCGTGCAG
GGCTATGGCG CGACGGCCGG CGATGCGCTG GTGCGCCATC CGGACGTGCG TGCGGTGTCG
TTCACCGGCG GCACCGTCAC CGGCAAGCGG ATCATGGAGC GTGCCGGTCT CAAGAAGTAT
TCGATGGAAC TCGGCGGCAA GTCGCCGGTG CTGATCTTCG ACGACGCCGA TTTCGACCGC
GCGCTCGATG CATCGCTGTT CACGATCTTC TCGATCAACG GCGAACGCTG CACGGCCGGC
TCGCGGATCT TCGTGCAGCG CACGATCTAC GACCGCTTCG TGCAGGAATT CGCGCGCCGC
GCGAACAACC TCATCGTTGG CGATCCATCC GATCCGGCCA CGAATCTCGG CGCGATGATC
ACGCGCCAGC ACTGGGAGAA GGTGACGGGC TATATCCGCA TCGGCGAGCA GGAAGGCGCA
CGCGTGGTCG CGGGCGGCGC GGACAAGCCG GCGGGCCTGC CCGACCATCT GCGCAACGGC
AACTTCGTGC GGCCGACCGT GCTGGCCGAC GTCGACAACC GGATGCGCGT CGCGCAGGAA
GAGATCTTCG GGCCGGTCGC GTGCATCATC CCGTTCGAGG ACGAAGACGA CGGGCTGCGG
CTCGCGAACG ACACCGCGTA CGGCCTTGCG TCGTACATCT GGACGCAGGA CGTCGGCAAG
GTGCATCGCC TCGCGCGCGG CATCGAGGCC GGGATGGTGT TCGTGAACAG CCAGAACGTG
CGCGACCTGC GCCAGCCGTT CGGCGGCGTG AAGGAATCGG GTACCGGGCG CGAGGGCGGC
GAGTACAGTT TCGAGGTGTT CGCGGAGATC AAGAACGTGT GCATCTCGAT GGGTTCGCAC
CACATTCCGC GCTGGGGCGT GTAA
 
Protein sequence
MTIKHWIGGR EVESRETFTT LNPATGDVIT DVASGGEAEV DAAVRAAKEA FPKWASTPAK 
ERAKLMRKLG ELIEKNVPNL AALETQDTGL PIAQTSKQLI PRASENFNFF AEVCVQMNGR
TYPVDDQMLN YTLYQPVGVC ALVSPWNVPF MTATWKTAPC LALGNTAVLK MSELSPLTAD
QLGRLALEAG IPPGVLNVVQ GYGATAGDAL VRHPDVRAVS FTGGTVTGKR IMERAGLKKY
SMELGGKSPV LIFDDADFDR ALDASLFTIF SINGERCTAG SRIFVQRTIY DRFVQEFARR
ANNLIVGDPS DPATNLGAMI TRQHWEKVTG YIRIGEQEGA RVVAGGADKP AGLPDHLRNG
NFVRPTVLAD VDNRMRVAQE EIFGPVACII PFEDEDDGLR LANDTAYGLA SYIWTQDVGK
VHRLARGIEA GMVFVNSQNV RDLRQPFGGV KESGTGREGG EYSFEVFAEI KNVCISMGSH
HIPRWGV