Gene BURPS668_3936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3936 
SymboleutE 
ID4883892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3836179 
End bp3837891 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content67% 
IMG OID640129864 
Productacetaldehyde dehydrogenase (acetylating) 
Protein accessionYP_001060929 
Protein GI126441324 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGCATT TGAACAACCG GCGCGCGACG AGTGTTCAGC AATGGCGCAG TCCGCCGCGC 
CGAGCGCGCG CCGAGCCCCG CGCCGGCGCC CCGATGCACC GCGGCGCGGC GTGCTGGCAC
GGTTTTGGCG TAACGAACGG CGTCGCGCAC GTCGCGCGGC GCATCACCGA TACCGACATG
CGAGGAGCAA CGATGAATCA CGCGGACATG CAACATCTGA ACATCGAATT CCCGTACCGC
AAGCAGTACG GGAATTTCAT CGGCGGCGAA TGGGTCGCCC CGGTCGGCGG CGAGTATTTC
GACAACGTCT CGCCCGTCAC CGGCCGGCCG TTCACCGCGA TCCCTCGCTC GCGCGAAGCC
GACATCGAGC TCGCGCTCGA CGCCGCTCAC GCGGCCAAGG CGGGCTGGGC CGCGAAGGGC
GCGGCCGAGC GCGCGAACGT GCTGCTGAGG ATCGCCGACC GGATGGAGGC GAACCTCACG
CGCCTCGCCG TCGCCGAGAC GATCGACAAC GGCAAGCCGC TGCGCGAAAC CACCGCGGCC
GACGTGCCGC TCGCGATCGA CCACTTCCGC TACTTCGCGG GCTGCATCCG CGCGCAGGAA
GGCTCGATCG CCGATATCGG CGGCGACATG GTGGCCTACC ACTTCCACGA GCCGCTCGGC
GTCGTCGGCC AGATCATCCC GTGGAACTTC CCGCTGCTGA TGGCCGCGTG GAAGCTCGCG
CCGGCGCTCG CGGCCGGCAA CTGCGTCGTG CTCAAGCCGG CCGAGCAGAC GCCCGCGTCG
ATCCTCGTGT TCGCCGAGCT GATCCAGGAT CTGCTGCCGC CCGGCGTGCT CAACATCGTC
AACGGCTTCG GCCTCGAGGC CGGCAAGCCG CTCGCGTCGA GCAAGCGGAT CGCGAAGATC
GCGTTCACGG GCGAGACGTC GACGGGCCGC CTCATCATGC AGTACGCGAG CGAGAACCTG
ATTCCCGTCA CGCTCGAGCT GGGCGGCAAG AGCCCGAATA TTTTCTTCGC CGACGTGATG
GATCGCGACG ACAGCTACTT CGACAAGGCG CTCGAAGGCT TCGCGATGTT CGCGCTGAAC
CAGGGCGAAG TCTGCACGTG CCCATCGCGC GCGCTCGTCG AGGAGAGCAT CTACGATCGC
TTCATCGAAC GCGCGCTCAA GCGCGTCGAG GCGATCAAGC AGGGCCATCC GCTCGATTCG
CAGACGATGA TCGGCGCGCA GGCGTCGGCC GAGCAGCTCG AGAAGATCCT GTCGTACATC
GACATCGGCC GCGGCGAAGG CGCGCAATGC CTGACGGGCG GCGAGCGCAA CGTGCTCGGC
GGCGAGCTCG CCGAAGGCTA TTACGTGAAG CCGACCGTGT TCCGCGGCCA CAACAAGATG
CGCATCTTCC AGGAAGAAAT CTTCGGGCCG GTGCTCGCGG TGACGACGTT CAAGACCGAG
GAGGAAGCGC TCGAGATCGC GAACGACACG CTGTACGGCC TGGGCGCCGG CGTCTGGACG
CGCGACGGCA ACCGCGCGTA CCGCTTCGGC CGCGGCATCC AGGCGGGCCG CGTGTGGACG
AACTGCTATC ACGCGTATCC GGCGCACGCG GCGTTCGGCG GCTACAAGCA ATCCGGCATC
GGCCGCGAGA CGCACAAGAT GATGCTCGAC CACTACCAGC AGACGAAGAA CCTGCTCGTC
AGCTACAGCG AAAAGCCGCT CGGGTTCTTC TGA
 
Protein sequence
MPHLNNRRAT SVQQWRSPPR RARAEPRAGA PMHRGAACWH GFGVTNGVAH VARRITDTDM 
RGATMNHADM QHLNIEFPYR KQYGNFIGGE WVAPVGGEYF DNVSPVTGRP FTAIPRSREA
DIELALDAAH AAKAGWAAKG AAERANVLLR IADRMEANLT RLAVAETIDN GKPLRETTAA
DVPLAIDHFR YFAGCIRAQE GSIADIGGDM VAYHFHEPLG VVGQIIPWNF PLLMAAWKLA
PALAAGNCVV LKPAEQTPAS ILVFAELIQD LLPPGVLNIV NGFGLEAGKP LASSKRIAKI
AFTGETSTGR LIMQYASENL IPVTLELGGK SPNIFFADVM DRDDSYFDKA LEGFAMFALN
QGEVCTCPSR ALVEESIYDR FIERALKRVE AIKQGHPLDS QTMIGAQASA EQLEKILSYI
DIGRGEGAQC LTGGERNVLG GELAEGYYVK PTVFRGHNKM RIFQEEIFGP VLAVTTFKTE
EEALEIANDT LYGLGAGVWT RDGNRAYRFG RGIQAGRVWT NCYHAYPAHA AFGGYKQSGI
GRETHKMMLD HYQQTKNLLV SYSEKPLGFF