Gene P9303_21291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21291 
SymbolpdhC 
ID4777128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1893868 
End bp1895187 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content57% 
IMG OID640087637 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_001018129 
Protein GI124023822 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.372749 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCTC TGAGCTCAAC GATGACGGAG GGCAAGATCG TTGAGTGGCT TAAGCAACCT 
GGCGACAAGG TTGGGCGTGG TGAGTCGGTG CTTGTGGTGG AGTCAGATAA AGCGGATATG
GATGTGGAGT CATTTCAAGA TGGCTACTTG GCCGCGGTCT TGATGCCTGC TGGTCGTTCG
GCTCCAGTGG GTGAAACGAT TGGTTTGATC GTTGAAAGTG AGGCCGAAAT CGCGGCTGTT
CAGGCCAATG CCCCTGCTGC GCCAGCGTCT GATCCTGCCC CTCTCAAGGC CGCTGCAAAA
GTTGTCGATG ACCATGCCCC AGCATCTACT CCGGCGCCCG TCGTGGAGAG TCCCCCTGTT
GCTGCGCCGC CGCCTGTTAC CAGCCAAGCA GTAGAGAGTG ACAAACGCAT CGTTGCTTCC
CCGCGGGCTA AAAAACTTGC TGCGCAGATG GGTGTTGATC TGGCCAAGTT GAGAGGTAGC
GGACCCCATG GCCGTATCCA GGCTGAAGAC GTGCAGCTGG CTGCAGGTCA GCCGATCAGT
GTGCCTCAGG TTGCTGAAGG AAACGCTTCT TTCGCAACGA CGCATGCAAC TTCTGCAGGC
GTTGCTCATG CAGTGTCATC TCCTGTAGGT CAGAGCTTTG GGGCCCCGGG AGAAACCGCA
GCCTTCAACA ACCTCCAACA AGCGGTCAAC CGCAATATGG AGGCCAGTTT GGCCTTCCCC
TGCTTCAGGG TTGGCTACAC GATCACGACT GATCAGTTGG ATGCTTTTTA CAAGCAGGTG
AAGCCTAAGG GCGTCACGAT GACAGCCCTT CTGGCCAAAG CCGTGGCCTT GACGCTTGTG
CGTCATCCCC AGGTGAATGC TGCCTACAGC ACTGCTGGGA TGGTTTATCC AGAGCAGGTG
AATGTTGCTA TTGCAGTGGC GATGGACGAT GGCGGTCTGA TTACACCGGT TTTGCAGAAT
GCTGATCGCA CTGATCTCTA TGAGATGTCG CGGCAGTGGG CCGATCTTGT GAAGCGTTCA
CGCAGCAAGC AGCTGCAACC CGAGGAATAC AGCACTGGTA CTTTCACACT CTCCAATCTG
GGCATGTTTG GTGTGGATCG CTTTGATGCA ATCTTGCCCC CTGGCACTGG CGCAATTTTG
GCGGTAGCTG CATCGCGGCC TGCTGTGGTG GCAGGAAAGG ATGGCTCGAT TGGGGTCAAG
CGCCAGATGC AGGTGAACCT CACTGCCGAC CATCGCGTGA TTTATGGCGC CGATGGGGCG
GCCTTCCTTA AGGACCTGGC AGAGCTGATT GAGACGCGGG TAGAGAGTTT GGCGCTCTGA
 
Protein sequence
MPALSSTMTE GKIVEWLKQP GDKVGRGESV LVVESDKADM DVESFQDGYL AAVLMPAGRS 
APVGETIGLI VESEAEIAAV QANAPAAPAS DPAPLKAAAK VVDDHAPAST PAPVVESPPV
AAPPPVTSQA VESDKRIVAS PRAKKLAAQM GVDLAKLRGS GPHGRIQAED VQLAAGQPIS
VPQVAEGNAS FATTHATSAG VAHAVSSPVG QSFGAPGETA AFNNLQQAVN RNMEASLAFP
CFRVGYTITT DQLDAFYKQV KPKGVTMTAL LAKAVALTLV RHPQVNAAYS TAGMVYPEQV
NVAIAVAMDD GGLITPVLQN ADRTDLYEMS RQWADLVKRS RSKQLQPEEY STGTFTLSNL
GMFGVDRFDA ILPPGTGAIL AVAASRPAVV AGKDGSIGVK RQMQVNLTAD HRVIYGADGA
AFLKDLAELI ETRVESLAL