Gene P9301_04241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_04241 
SymbolpdhC 
ID4911603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp369312 
End bp370679 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content37% 
IMG OID640160002 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_001090648 
Protein GI126695762 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCACG AAATATTCAT GCCTGCCTTG AGTTCTACTA TGACGGAGGG CAAGATTGTG 
GAATGGTTGA AAAATCCAGG AGATAAAGTT GAAAGAGGTG AATCTGTCTT GGTTGTTGAA
TCTGACAAGG CAGATATGGA TGTTGAATCT TTTCAAGATG GATATCTTGC GGCTGTTTTA
ATGCCTGCTG GCAGCACTGC ACCAGTGGGA GAGACTATTG GCCTTATCGT AGAAAATGAG
GATGAGATAG CTTCTGTTCA AGAACAAAAT AAAGGAAATC AACCCGAAGT TTCTAGTTCG
GATCAACTTG AATTGGTAAG CAATAAAACT GAAGAAAAAC CTGTAGTTCA AAGTGAAATT
GTTGAAAAAC AAGAAAAAGA AGTCGTATTA ATGAATGAAA AGGCAGCATC ATCTTTTAAC
AGTGATCAAA TAAATGCTGC TACGAGTAAT GTTTCTTCGA GGGTGATTGC ATCTCCAAGA
GCTAAAAAAC TTGCCTCTCA AATGGGTGTT GATTTAGCAA AGGTTCATGG ATCCGGACCT
CACGGAAGGA TTCAAGCCGA TGATATTTTA AAAGCTAATG GCCAACCAGT CTCTATACCA
TGGATAGGTG AAGGTGGTTC TCCTGCAAGT ATACCTGGTG CAAATTTGGG AGTTGAAAGT
AAACCAGAAA CTTCAGGAAA TAGTTTTGGT AATCCCGGAG AAATAGTTCA ATTTAATACT
CTTCAAAAAG CGGTAAATAA AAATATGGAA TCTAGTTTAG ATGTGCCATG TTTTAGGGTG
GGTTATTCCA TTAACACAGA TAAATTAGAT AATTTCTACA AAAAAGTAAA ACAGAACGGA
GTTACTATGA CTGCTTTACT AGTAAAGGCA GTTGCAAAGA CACTAAAGAA ACATCCTCAA
GTTAACTCAA GTTTTTCAGA GAATGGAATT TCTTATCCAG AAAATATAAA TATTGCTGTT
GCTGTTGCGA TGGAAGATGG TGGACTAATA ACTCCAGTTT TAAAAGAACC ATGCAATACT
GATTTATTTG AATTGTCTAG GGAATGGAAA GATCTCGTAA AAAGATCAAG ATCAAAACAA
TTAGAACCTG ATGAGTACTC TACGGGAACC TTCACTTTAT CTAACCTTGG GATGTTTGGA
GTTGATAGAT TTGACGCAAT TCTTCCTCCA GGTACCGGTG CAATTTTAGC CATAGCTTCA
TCGAAACCAA CCGTTGTTGC TAATAGTGAT GGTTCAATAT CTGTTAAAAA AATAATGCAA
GTAAATCTAA CAGCTGATCA TAGAGTGATC TATGGAGCTG ATGGAGCTTC ATTCTTAAAA
GACTTGGCTT CCCTAATTGA AGATGAGCCA GAGACTCTTG TCTCCTAA
 
Protein sequence
MSHEIFMPAL SSTMTEGKIV EWLKNPGDKV ERGESVLVVE SDKADMDVES FQDGYLAAVL 
MPAGSTAPVG ETIGLIVENE DEIASVQEQN KGNQPEVSSS DQLELVSNKT EEKPVVQSEI
VEKQEKEVVL MNEKAASSFN SDQINAATSN VSSRVIASPR AKKLASQMGV DLAKVHGSGP
HGRIQADDIL KANGQPVSIP WIGEGGSPAS IPGANLGVES KPETSGNSFG NPGEIVQFNT
LQKAVNKNME SSLDVPCFRV GYSINTDKLD NFYKKVKQNG VTMTALLVKA VAKTLKKHPQ
VNSSFSENGI SYPENINIAV AVAMEDGGLI TPVLKEPCNT DLFELSREWK DLVKRSRSKQ
LEPDEYSTGT FTLSNLGMFG VDRFDAILPP GTGAILAIAS SKPTVVANSD GSISVKKIMQ
VNLTADHRVI YGADGASFLK DLASLIEDEP ETLVS