Gene BURPS1106A_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2666 
SymbolpdhB 
ID4899693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2626342 
End bp2627988 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content70% 
IMG OID640135893 
Productdihydrolipoamide acetyltransferase 
Protein accessionYP_001066919 
Protein GI126452707 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAG CGATCGAAGT CAAGGTGCCG GATATCGGCG ATTACAAGGA CGTGCCCGTC 
ATCGAAGTGC TCGTGAAGCC GGGCGATGCG GTCGAGCCCG AGCAGTCGCT CGTCACGCTC
GAATCGGACA AGGCGACGAT GGACGTGCCG AGCCCGTCGG CGGGCACGGT CAAGGAAGTG
AAGGTGAAGG TCGGCGACGC GGTGTCGCAA GGCTCGCTGA TCGTGCTGCT CGACGGCGCG
CAGGCGGCGG CCCAGCCCGC GCAGGCGAAC GGCGCCGCGA CGAGCGCCGC GCAGCCGGCG
GCGGCGCCCG CTGCCGCGCC TGCGCCGGCG GCGGCCGCGG GCGGCGGCAC GGTCGACGTG
AAGGTGCCGG ACATCGGCGA CTACAAGGAC GTGCCCGTCA TCGAGATCGC CGTGAAGATC
GGCGACACGG TCGAGAAGGA GCAGTCGCTC GTCACGCTCG AATCGGACAA GGCGACGATG
GACGTGCCGA GCCCGGCCGC GGGCGTCGTC AAGGACATCA AGGTGAAGGT CGGCGATGCG
GTGTCGGAAG GTTCGCTGAT CGTCGTGCTC GAAGCATCGG GCGGCGCCGC CGCGAGCGCG
CCGCAGGCGG CCGCGCCCGC CCCCGCGCCG GCGCCCGCGC CCGCGCCCGC GCCCGCGCCG
CAGGCCGCAC CCGCGGCTGC GCCGGCCCCC GCGCAGGCAC CGGCACCCGC CGCGAGCGGC
GAGTACCGCG CGAGCCACGC GTCGCCGTCG GTGCGCAAGT TCGCGCGCGA GCTCGGCGTC
GACGTGTCGC GCGTCACGGG CACGGGGCCG AAGAGCCGCA TCACGAAGGA CGACGTCACC
GCGTTCGTGA AGGGCGTGAT GACGGGACAG CGCGCGGCGC CCGGCGCCGC GGCCGCGCCC
GCGGGCGGCG GCGAGCTGAA CCTGCTGCCG TGGCCGAAGG TCGACTTCTC GAAGTTCGGC
CCGTTCGAGG CGAAGCCGCT GTCGCGCATC AAGAAGATCT CGGGCGCGAA CCTGCATCGC
AACTGGGTGA TGATCCCGCA CGTCACGAAC AACGACGAGG CGGACATCAC CGAGCTCGAA
GCGCTGCGCG TGCAACTGAA CAAGGAGCAC GAGAAGGCGG GCGTGAAGTT CACGATGCTC
GCGTTCGTGA TCAAGGCGGT CGTCGCCGCG CTGAAGAAGT TCCCGACCTT CAACGCGAGC
CTCGATGGCG ACAACCTCGT GTTCAAGCAG TACTACCACA TCGGTTTCGC CGCCGACACG
CCGAACGGCC TCGTCGTGCC GGTGATCCGC GACGCGGACA AGAAGGGGCT CGTCGACATC
GCGAAGGAAA TGGCCGAGCT GTCGAAGGCC GCGCGCGAAG GCAAGCTCAA GCCGGACCAG
ATGCAGGGCG GCTGCTTCTC GATCTCGTCG CTCGGCGGGA TCGGCGGCAC GCACTTCACG
CCGATCATCA ATGCGCCGGA AGTGGCGATC CTCGGGCTGT CGCGCGGCCA GATGAAGCCG
GTGTGGGACG GCAAGCAGTT TGTGCCGCGC CTCACGCTGC CGCTGTCGCT GTCGTATGAC
CATCGCGTGA TCGATGGCGC GGAAGCCGCG CGGTTCAATG CGTATCTCGG CGCGTTGCTT
GCCGATTTCC GTCGCATCAT TCTTTGA
 
Protein sequence
MSQAIEVKVP DIGDYKDVPV IEVLVKPGDA VEPEQSLVTL ESDKATMDVP SPSAGTVKEV 
KVKVGDAVSQ GSLIVLLDGA QAAAQPAQAN GAATSAAQPA AAPAAAPAPA AAAGGGTVDV
KVPDIGDYKD VPVIEIAVKI GDTVEKEQSL VTLESDKATM DVPSPAAGVV KDIKVKVGDA
VSEGSLIVVL EASGGAAASA PQAAAPAPAP APAPAPAPAP QAAPAAAPAP AQAPAPAASG
EYRASHASPS VRKFARELGV DVSRVTGTGP KSRITKDDVT AFVKGVMTGQ RAAPGAAAAP
AGGGELNLLP WPKVDFSKFG PFEAKPLSRI KKISGANLHR NWVMIPHVTN NDEADITELE
ALRVQLNKEH EKAGVKFTML AFVIKAVVAA LKKFPTFNAS LDGDNLVFKQ YYHIGFAADT
PNGLVVPVIR DADKKGLVDI AKEMAELSKA AREGKLKPDQ MQGGCFSISS LGGIGGTHFT
PIINAPEVAI LGLSRGQMKP VWDGKQFVPR LTLPLSLSYD HRVIDGAEAA RFNAYLGALL
ADFRRIIL