Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04031 |
Symbol | odhB |
ID | 5731356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 378671 |
End bp | 380041 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641284760 |
Product | branched-chain alpha-keto acid dehydrogenase subunit E2 |
Protein accession | YP_001550288 |
Protein GI | 159902944 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.261973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.760936 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCAACAC ACGACATTTT TATGCCCGCT CTTAGTTCCA CTATGACGGA GGGGAAAATA GTTGAATGGC TTAAGAATCC CGGCGAAAAA GTTGCCCGAG GAGAGGCTGT TCTTGTAGTG GAGTCTGATA AGGCTGACAT GGAAGTCGAG TCATTTCAGG ATGGATACCT CGCTGCAGTG CTTATGCCAG CAGGCAGTAC AGCCCCGGTT GGTGAAATAA TTGGTTTGAT TGTTGAAACA GAAGATCAGA TTGCTGAAGT AAAAGCTAAG AATCCGACAA AAGATCAGGC CTCAAAAGAA GTCAGCTCAA GCGACTCTGA ATCTTCTAAG CAGACACTTG AAGTAGCCTC TCAAGATCAA GGATCTGTTT TAGAAGTTCA AGCATCAAAA AAAGCTGAAT CTTTGCCTCC TCGAGCCGTT GTGAATGATG GTCGTATCAT TGCCACCCCT AGAGCCAGAA AGCTTGCCTC GCAATTAGGC GTAGACTTGG CAACTGTGCT TGGGACAGGA CCGCATGGAC GAATTCAAGC TGAAGATGTT CAAACTGCCC AAGGACAACC AATTACTGTC CCATGGGTGG CAGAAAGTGA TGCACCAGCA CGATTAGAGG TCTTCAACTC TCAAGCAGCT AATACAGGCG CTCCTCAAGA AGAGACTAAG GTGAATGAAG CTCCCAAGGG TAATAGTTTT GGGGCCCCTG GGGAGACAGT CTCATTCAAT ACTCTTCAGC AAGCAGTCAA TAGAAATATG GAGGCAAGCT TATCTATTCC TTGTTTTAGG GTGGGTTATT CAATAAATAC AGACAAGCTT GATATTTTTT ATAAGCAAGT AAAACCTAAT GGAGTCACTA TGACTGCTTT ATTGGCTAAA GCAGTTGGGA AGACGCTTGC TCGACATCCT CAATTGAATG CAGCGTGCAG TAATGAAGGC ATGTCTTATC CAGAGCAAGT TAATGTAGCA GTTGCAGTTG CAATGGAAGA AGGGGGGCTA ATCACACCGG TACTTCAGAA TGCAGATACT ACGGATCTAT TTGAGTTGTC ACGTCAATGG GCAGATTTAG TTAAACGTTC AAGATCAAAA CAACTTCAAC CTAATGAATA CAGTAGTGGC ACTTTTACCA TTTCTAATTT AGGAATGTTT GGCGTAGACC GTTTTGATGC CATACTCCCA CCTGGGACTG GTGCAATCTT GGCAATTGCT GCTTCAATTC CTCAGGTCGT GGCAGCTAAG GATGGGTCGA TGGCTGTAAA ACGCCAAATG CAAGTGAATC TTACTGCCGA TCACCGAGTT ATTTATGGTG CGGATGGCGC AGCATTTTTG AAAGACCTTT CAAGGTTGAT TGAAAACAAC CCTGAACAAC TTGCTACTTA A
|
Protein sequence | MATHDIFMPA LSSTMTEGKI VEWLKNPGEK VARGEAVLVV ESDKADMEVE SFQDGYLAAV LMPAGSTAPV GEIIGLIVET EDQIAEVKAK NPTKDQASKE VSSSDSESSK QTLEVASQDQ GSVLEVQASK KAESLPPRAV VNDGRIIATP RARKLASQLG VDLATVLGTG PHGRIQAEDV QTAQGQPITV PWVAESDAPA RLEVFNSQAA NTGAPQEETK VNEAPKGNSF GAPGETVSFN TLQQAVNRNM EASLSIPCFR VGYSINTDKL DIFYKQVKPN GVTMTALLAK AVGKTLARHP QLNAACSNEG MSYPEQVNVA VAVAMEEGGL ITPVLQNADT TDLFELSRQW ADLVKRSRSK QLQPNEYSSG TFTISNLGMF GVDRFDAILP PGTGAILAIA ASIPQVVAAK DGSMAVKRQM QVNLTADHRV IYGADGAAFL KDLSRLIENN PEQLAT
|
| |