Gene Bpro_2672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_2672 
SymbolaceE 
ID4014665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp2808568 
End bp2811309 
Gene Length2742 bp 
Protein Length913 aa 
Translation table11 
GC content61% 
IMG OID637942334 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_549486 
Protein GI91788534 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000588589 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA ATCCAAATGA AGCCGGTGCC GACAACCTAC GCGCCGCCGC GAACGATGCC 
TTGGGTGAGA ACACCCTGGA CAAAGACAAG CAGGAGACCC GGGAGTGGAT GGATGCCCTC
TCGGCGGTCA TTGAAAGCGA GGGCCCGGAA CGTGCCCACT TCCTGCTTGA GCAACTGCTC
GAACATGCCC GCCAGAAAAG CATTGACATG CCTTTCTCGG CCAACACCGC TTACGTCAAC
ACCATAGAAA CCGACCAGGA AGAGCGCTCC CCCGGCAACC TCGAAATTGA AGAACGCCTG
CGCGCCTACA TGCGCTGGAA TGCCATGGCC ATGGTGGTCA AGGCCAACCG CCTGCATCCC
GCGGATGGCG GCGATCTGGG TGGACATATC GGCTCGTTTG CCTCGCTGGC CAGTCTCTTT
GGTGCGGGCT TCAACCACTT CTGGCATGCC GAGAGCGAGA ACCATGGCGG CGATTGCCTC
TACATCCAGG GGCATGTGTC GCCTGGCGTC TACGCACGCG CTTACCTTGA AGGCCGCCTG
ACGGAAGAGC AACTGCTCAA TTTCCGCCAG GAAGTTGCTG GCAAGGGGCT TTCCAGCTAC
CCGCACCCCA AGCTGATGCC CGAATTCTGG CAGTTCCCCA CGGTCTCCAT GGGCCTGGGC
CCCCTGATGG CGATTTACCA GGCCAGGTTC CTCAAGTACC TGCACGCCCG CGGTATTGCC
AACACCGAAA ACCGCAAGGT CTGGGTGTTC TGTGGCGACG GTGAGATGGA TGAAGTCGAA
TCGATGGGCG CCATCGGGCT GGCCGCGCGC GAGAAGCTCG ACAACCTGGT GTTCGTCATC
AACTGCAACC TGCAGCGTCT GGACGGCCCG GTCCGCGGCA ACGGCAAGAT CATCCAGGAA
CTCGAAGGCG AGTTCCGCGG CGCCGGCTGG AATGTCATCA AGCTGATCTG GGGCAGCAAC
TGGGATCCAT TGCTGGCGCG CGACAAGGAC GGCGCCTTGC GCAAAGTCAT GATTGACACG
CTGGACGGCG ACTACCAGGC CATGAAGGCC AACGACGGCG CCTACGTGCG CAAGCATTTC
TTTGGCCAGA ACCCCAAGAC GCTGGAGATG GTCTCCAAGA TGAGCGACGA CGACATCTGG
AACCTGCGCC GTGGCGGCCA CGACTCGCAA AAGGTGTATG CGGCATTCCA TGCGGCCGTC
AATCACACCG GCCAGCCCAC AGTGCTGCTG ATCAAGACCG TCAAGGGTTT TGGCATGGGC
AAGATTGGCG AGGGCAAAAA CACGGTGCAT CAGACCAAGA AACTGACGGA CGATGACATC
AAGATCTTCC GCGACCGCTT CAACATCCCG ATTCCCGACA GCCAGCTGGC CGACCTGCCG
TTTTACAAGC CGGCCGATGA CACCCCGGAA ATGCGCTATC TGCACGAGCG GCGCAAGGCC
CTGGGCGGTT ACCTGCCGCA CCGCCGCGTC AAGGCCGACG AGAGCTTTAC CGTGCCTGCG
CTCGAAACCT TCAAGGCCGT GCTCGAGCCG ACCGCCGAAG GCCGTGAAAT TTCCACCACG
CAGGCCTACG TGCGCTTCCT CACGCAGCTG CTGCGCGACC AGGCACTCGG CCCACGCGTG
GTGCCCATCC TGGTCGATGA GGCCCGCACC TTCGGCATGG AAGGCCTGTT CCGCCAGGTG
GGCATCTACA ACCCTGATGG CCAGAAGTAC ACGCCGGTCG ATAAAGACCA GGTGATGTAT
TACAAGGAAG ACGCCAAGGG CCAGATCCTG CAAGAGGGCA TCAACGAGGC TGGTGGCATG
AGCAGCTGGA TCGCAGCGGC CACTTCGTAC AGCACCAATA ACAGGATCAT GGTCCCGTTC
TACGTGTACT ACTCGATGTT CGGTTTCCAG CGCATCGGCG ACCTGGCCTG GGCGGCTGGC
GACATGCAGG CACGTGGCTT CCTGTTGGGC GGCACCTCTG GCCGGACCAC ACTGAATGGC
GAAGGCCTGC AGCACGAAGA CGGCCACAGC CACATCCTGG CCGGCACGAT TCCCAACTGC
ATCAGCTACG ACCCGACCTT TGCCCATGAG GTGGGTGTCA TCCTGCACCA TGGCTTGAAG
CGTATGGTGG AAAAGCAGGA CAACGTGTAT TTTTACCTCA CGCTGCTCAA TGAAAACTAT
CCGATGCCAG GCCTCCAGCC CGGTACCGAA GAACAGATCA TCAAGGGCAT GTACCTCTGC
AAGGAAGGCG CCAAGCTCAC GCCGCGCGTA CAACTGCTGG GCTCCGGCAC CATCCTGCGT
GAATCGATTG CCGCGCAGGA ACTGCTTGAG AAAGAGTGGG GTGTTGCCGC CAACGTGTGG
AGCTGCCCGA GCTTCAATGA GTTGGCCCGC GATGGCCAGA ACGCCGAACG CTGGAACCTG
CTGCACCCGA CCGACAAACC CCGCGTGCCT TTTGTGGGCG AGCAGCTCGA CAAGCACGCC
GGCCCGGTCG TTGCATCGAC CGACTACATG AAGGCTTACG CCGAGCAGAT CCGGCCGTTT
ATCCCCAAGG GGCGCACCTA CAAGGTCCTG GGCACCGATG GATTTGGCCG CAGCGACTTC
CGCAGCAAGC TGCGCGAGCA CTTTGAAATC AACCGCCACT ACATCGTGAT CGCGGCCCTG
AAGGCGCTCA GCGAAGAGGG TACGGTTCCG GTTGCCAAGG TTGCCGAAGC CATCAAGAAG
TACGGTATCA ATGCCGACAA GATCAATCCC CTTTATGCCT GA
 
Protein sequence
MAANPNEAGA DNLRAAANDA LGENTLDKDK QETREWMDAL SAVIESEGPE RAHFLLEQLL 
EHARQKSIDM PFSANTAYVN TIETDQEERS PGNLEIEERL RAYMRWNAMA MVVKANRLHP
ADGGDLGGHI GSFASLASLF GAGFNHFWHA ESENHGGDCL YIQGHVSPGV YARAYLEGRL
TEEQLLNFRQ EVAGKGLSSY PHPKLMPEFW QFPTVSMGLG PLMAIYQARF LKYLHARGIA
NTENRKVWVF CGDGEMDEVE SMGAIGLAAR EKLDNLVFVI NCNLQRLDGP VRGNGKIIQE
LEGEFRGAGW NVIKLIWGSN WDPLLARDKD GALRKVMIDT LDGDYQAMKA NDGAYVRKHF
FGQNPKTLEM VSKMSDDDIW NLRRGGHDSQ KVYAAFHAAV NHTGQPTVLL IKTVKGFGMG
KIGEGKNTVH QTKKLTDDDI KIFRDRFNIP IPDSQLADLP FYKPADDTPE MRYLHERRKA
LGGYLPHRRV KADESFTVPA LETFKAVLEP TAEGREISTT QAYVRFLTQL LRDQALGPRV
VPILVDEART FGMEGLFRQV GIYNPDGQKY TPVDKDQVMY YKEDAKGQIL QEGINEAGGM
SSWIAAATSY STNNRIMVPF YVYYSMFGFQ RIGDLAWAAG DMQARGFLLG GTSGRTTLNG
EGLQHEDGHS HILAGTIPNC ISYDPTFAHE VGVILHHGLK RMVEKQDNVY FYLTLLNENY
PMPGLQPGTE EQIIKGMYLC KEGAKLTPRV QLLGSGTILR ESIAAQELLE KEWGVAANVW
SCPSFNELAR DGQNAERWNL LHPTDKPRVP FVGEQLDKHA GPVVASTDYM KAYAEQIRPF
IPKGRTYKVL GTDGFGRSDF RSKLREHFEI NRHYIVIAAL KALSEEGTVP VAKVAEAIKK
YGINADKINP LYA