Gene P9211_04031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04031 
SymbolodhB 
ID5731356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp378671 
End bp380041 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content45% 
IMG OID641284760 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_001550288 
Protein GI159902944 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.261973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.760936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAACAC ACGACATTTT TATGCCCGCT CTTAGTTCCA CTATGACGGA GGGGAAAATA 
GTTGAATGGC TTAAGAATCC CGGCGAAAAA GTTGCCCGAG GAGAGGCTGT TCTTGTAGTG
GAGTCTGATA AGGCTGACAT GGAAGTCGAG TCATTTCAGG ATGGATACCT CGCTGCAGTG
CTTATGCCAG CAGGCAGTAC AGCCCCGGTT GGTGAAATAA TTGGTTTGAT TGTTGAAACA
GAAGATCAGA TTGCTGAAGT AAAAGCTAAG AATCCGACAA AAGATCAGGC CTCAAAAGAA
GTCAGCTCAA GCGACTCTGA ATCTTCTAAG CAGACACTTG AAGTAGCCTC TCAAGATCAA
GGATCTGTTT TAGAAGTTCA AGCATCAAAA AAAGCTGAAT CTTTGCCTCC TCGAGCCGTT
GTGAATGATG GTCGTATCAT TGCCACCCCT AGAGCCAGAA AGCTTGCCTC GCAATTAGGC
GTAGACTTGG CAACTGTGCT TGGGACAGGA CCGCATGGAC GAATTCAAGC TGAAGATGTT
CAAACTGCCC AAGGACAACC AATTACTGTC CCATGGGTGG CAGAAAGTGA TGCACCAGCA
CGATTAGAGG TCTTCAACTC TCAAGCAGCT AATACAGGCG CTCCTCAAGA AGAGACTAAG
GTGAATGAAG CTCCCAAGGG TAATAGTTTT GGGGCCCCTG GGGAGACAGT CTCATTCAAT
ACTCTTCAGC AAGCAGTCAA TAGAAATATG GAGGCAAGCT TATCTATTCC TTGTTTTAGG
GTGGGTTATT CAATAAATAC AGACAAGCTT GATATTTTTT ATAAGCAAGT AAAACCTAAT
GGAGTCACTA TGACTGCTTT ATTGGCTAAA GCAGTTGGGA AGACGCTTGC TCGACATCCT
CAATTGAATG CAGCGTGCAG TAATGAAGGC ATGTCTTATC CAGAGCAAGT TAATGTAGCA
GTTGCAGTTG CAATGGAAGA AGGGGGGCTA ATCACACCGG TACTTCAGAA TGCAGATACT
ACGGATCTAT TTGAGTTGTC ACGTCAATGG GCAGATTTAG TTAAACGTTC AAGATCAAAA
CAACTTCAAC CTAATGAATA CAGTAGTGGC ACTTTTACCA TTTCTAATTT AGGAATGTTT
GGCGTAGACC GTTTTGATGC CATACTCCCA CCTGGGACTG GTGCAATCTT GGCAATTGCT
GCTTCAATTC CTCAGGTCGT GGCAGCTAAG GATGGGTCGA TGGCTGTAAA ACGCCAAATG
CAAGTGAATC TTACTGCCGA TCACCGAGTT ATTTATGGTG CGGATGGCGC AGCATTTTTG
AAAGACCTTT CAAGGTTGAT TGAAAACAAC CCTGAACAAC TTGCTACTTA A
 
Protein sequence
MATHDIFMPA LSSTMTEGKI VEWLKNPGEK VARGEAVLVV ESDKADMEVE SFQDGYLAAV 
LMPAGSTAPV GEIIGLIVET EDQIAEVKAK NPTKDQASKE VSSSDSESSK QTLEVASQDQ
GSVLEVQASK KAESLPPRAV VNDGRIIATP RARKLASQLG VDLATVLGTG PHGRIQAEDV
QTAQGQPITV PWVAESDAPA RLEVFNSQAA NTGAPQEETK VNEAPKGNSF GAPGETVSFN
TLQQAVNRNM EASLSIPCFR VGYSINTDKL DIFYKQVKPN GVTMTALLAK AVGKTLARHP
QLNAACSNEG MSYPEQVNVA VAVAMEEGGL ITPVLQNADT TDLFELSRQW ADLVKRSRSK
QLQPNEYSSG TFTISNLGMF GVDRFDAILP PGTGAILAIA ASIPQVVAAK DGSMAVKRQM
QVNLTADHRV IYGADGAAFL KDLSRLIENN PEQLAT