Gene BURPS1710b_A0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0783 
SymbolaceE 
ID3693283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1030282 
End bp1033005 
Gene Length2724 bp 
Protein Length907 aa 
Translation table11 
GC content70% 
IMG OID637731037 
Product2-oxoacid dehydrogenase subunit E1 
Protein accessionYP_335942 
Protein GI76818199 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type
[TIGR03186] alpha-ketoglutarate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000178151 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATT TATCGAACGG AATTCGCCCG GTCCTCGCCA CCGCGCGCCG CCACGACGAT 
ACCGATCCGC AGGAAACCGC CGAATGGCTG GACGCGCTCG ACGGCGTCGT CGCGCACGCG
GGCGCCGAAC GCGCGCAGTA CCTGCTCGCG CAGCTCGCCG AGCACGCGGC GCGACGCCGG
CTCGCGCCGC CGCACGCGGG CGCGACGCCG TACGTGAACA CGATCCCCGT CGACGAGGAG
CCGCCTTACC CGGGCGACCT GCAGCTCGAG GAGCGCCTCG CGGCGGTGCT GCGCTGGAAC
GCGCTCGCGA TGGTGGTCCG CGCGAACCGC GCATACGGCG AGCTGGGCGG CCACATCGCG
AGCTACGCGT CGGCCGCCGA TCTGTTCGAA GTCGGCTTCA ACCATTTCTT CCGCGCGGCG
GGCGACGCGT CGGGCGGCGA TCTCGTCTAC TTCCAGCCGC ATTCGTCGCC GGGCGTCTAT
GCGCGCGCGT TCCTCGAAGG CTTCCTGAGC GACGCGCAGC TCGAGCACTA CCGGCGCGAG
ATCGCCGGGC CGGGCCTGTG CTCGTACCCG CATCCGTGGC TGATGCCCGA CTTCTGGCAA
TTTCCGACGG GCTCGATGGG CATCGGCCCG ATCAACGCGA TCTACCAGGC GCGCTTCATG
CGCTACCTGC AGAACCGCGG CCTGTTGCGC ACCGAGGGCC GCAAGGTGTG GGGCTTCTTC
GGCGACGGCG AGATGGACGA GCCCGAATCG ATCGGCGCGC TGTCGCTCGC CGCGCGCGAG
GGGCTCGACA ATCTCGTGTT CGTGATCAAC TGCAACCTGC AGCGCCTCGA CGGCCCGGTG
CGCGGCAACG GCCGCATCGT CGACGAGCTC GAATCGCAGT TCGCGGGCGC CGGCTGGAAC
GTGATCAAGG TGCTGTGGGG CTCGGACTGG GACGCGCTGT TCGCGCGCGA CGCGACGGGC
GCGCTCGCGC GCGCGTTCGC GCAAACGGTC GACGGCCAGT TCCAGACGTT CTCGGCGAAC
GACGGCGCGT ACAACCGCGC GCGCTTCTTC GGCCAGGACG ACGCGCTCGC GGCGCTCGTC
GCGCACCTGC GCGACGAGGA CATCGACCGG CTGCGCCGCG GCGGCCACGA CGCGCGCAAG
CTGTACGCCG CGTACGATCG CGCGGTCCGG CACGAAGGCC GGCCGACGGT GATCCTCGCG
AAGACGATGA AGGGCTTCGG CCTGGGCGCG ATCGGCCAGG GCCGGATGAC GACCCACCAG
CAAAAGAAGC TCGACGTCGA GCACCTGCTC GCGTTCCGCG ACCGCTTCCG ACTGCCGCTC
ACCGACGAAG ACGTCGCGCA GCTGCGTTTC TACCGACCCG AGAAAGACAG CCCGGAGATG
CGATACCTGC ATGCGCGCCG GGCTGCCCTG GGAGGCTGTC TGCCCAGAAG GCGGGCAGCG
GCGACGGCGG CATTGACCGT GCCGTCGTTG CCGTCCTGGG GGCAATTCGC GCTCGACGGC
GGCCGCGAGA TGTCGACGAC GATGGCGATC GTGCGGATGC TCGGCGGTCT GCTGAAGGAT
GCGGCGCTCG GGCCGCGCAT CGTGCCCATC GTCGCCGACG AGGCGCGCAC GTTCGGGATG
GCGAACCTGT TCCGGCAGGT CGGCATCTAC TCGCCGCTCG GCCAGCGCTA CGAGCCCGAG
GATCTCGGCT CGATGCTCTA CTACCGCGAG GACACGCGCG GGCAGATTCT CGAGGAAGGG
ATCTCGGAAG CGGGCGCGAT ATCGTCGTGG ATCGCCGCGG CGACATCGTA CAGCGTGCAC
GATCTGCCGA TGCTGCCGTT CTACATCTAC TACTCGATGT TCGGCTTCCA GCGGATCGGC
GATCTGATCT GGGCCGCGGC CGATCAGCGC GCGCGCGGCT TCCTGATCGG CGCGACGTCG
GGCAAGACGA CGCTCGGCGG CGAAGGGCTC CAGCACCAGG ACGGCACGAG CCATCTCGCG
GCGTCGACGG TGCCGAACTG CCGCGCGTGG GATCCGGCGT TCGCCTACGA AATCGCGGTG
ATCGTCGACG AAGGAATGCG CGAGATGGTC GAGCGGCAGC GCGACACGTT CTACTACCTG
ACCGTCACGA ACGAGAACCA CGCGCAGCCG ACGCTGCCCG CGGACCGGCT CGACGCGGTG
CGCGGCGGCA TCCTGAAAGG CATGTATCCG CTCGATGCGG CCGCGCTGCC CGCCGCGCGC
GTGCAGTTGC TCGGCTCGGG CGCGATTCTC GGCGAAGTGC AGGCGGCCGC GCGCCTGCTG
CGCGACGATT GGCGAATCGA CGCGGCCGTC TGGAGCGTGA CGAGCTTCAC CGAGCTGCAA
CGCGACGGCA TCGCGGCCGA GCGCGCGCAG CGCCTCTTCG ATGCCGACGA CGCGAACGAA
CGCTCGTCGA CGCCCCACGT CACGTCCGCG CTCGCCGCGA CGCAAGGGCC CGTGATCGCC
GCGACCGACT ATGCGCGCGC GCTGCCCGAG CTGATCCGCG CGTACGTGCC GCGCCGCTAT
GTGACGCTCG GCACCGACGG CTTCGGCCGC AGCGACACGC GCGAAGCGCT GCGCGCGTTC
TTCGAGGTCG ATCGCGGCTC GATCGTAATC GCCGCGCTGC GCGCGCTCGC CGATGACGGG
GAGGTGACGC GCACCGTCGT GCGCGACGCG ATCGCGCGAT ACGGCAAGCG GGACGCGTCG
CGCACGCCGC CGTGGGAACG GTGA
 
Protein sequence
MNDLSNGIRP VLATARRHDD TDPQETAEWL DALDGVVAHA GAERAQYLLA QLAEHAARRR 
LAPPHAGATP YVNTIPVDEE PPYPGDLQLE ERLAAVLRWN ALAMVVRANR AYGELGGHIA
SYASAADLFE VGFNHFFRAA GDASGGDLVY FQPHSSPGVY ARAFLEGFLS DAQLEHYRRE
IAGPGLCSYP HPWLMPDFWQ FPTGSMGIGP INAIYQARFM RYLQNRGLLR TEGRKVWGFF
GDGEMDEPES IGALSLAARE GLDNLVFVIN CNLQRLDGPV RGNGRIVDEL ESQFAGAGWN
VIKVLWGSDW DALFARDATG ALARAFAQTV DGQFQTFSAN DGAYNRARFF GQDDALAALV
AHLRDEDIDR LRRGGHDARK LYAAYDRAVR HEGRPTVILA KTMKGFGLGA IGQGRMTTHQ
QKKLDVEHLL AFRDRFRLPL TDEDVAQLRF YRPEKDSPEM RYLHARRAAL GGCLPRRRAA
ATAALTVPSL PSWGQFALDG GREMSTTMAI VRMLGGLLKD AALGPRIVPI VADEARTFGM
ANLFRQVGIY SPLGQRYEPE DLGSMLYYRE DTRGQILEEG ISEAGAISSW IAAATSYSVH
DLPMLPFYIY YSMFGFQRIG DLIWAAADQR ARGFLIGATS GKTTLGGEGL QHQDGTSHLA
ASTVPNCRAW DPAFAYEIAV IVDEGMREMV ERQRDTFYYL TVTNENHAQP TLPADRLDAV
RGGILKGMYP LDAAALPAAR VQLLGSGAIL GEVQAAARLL RDDWRIDAAV WSVTSFTELQ
RDGIAAERAQ RLFDADDANE RSSTPHVTSA LAATQGPVIA ATDYARALPE LIRAYVPRRY
VTLGTDGFGR SDTREALRAF FEVDRGSIVI AALRALADDG EVTRTVVRDA IARYGKRDAS
RTPPWER