Gene A9601_04551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_04551 
SymbolpdhC 
ID4717153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp394884 
End bp396251 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content38% 
IMG OID640078167 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_001008850 
Protein GI123967992 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.230997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCACG AAATATTCAT GCCTGCCTTG AGTTCTACCA TGACGGAGGG CAAGATTGTG 
GAATGGTTGA AAAATCCAGG AGATAAAGTT GCAAGAGGTG AATCTGTCTT GGTTGTTGAA
TCTGACAAGG CAGATATGGA TGTTGAATCT TTTCAAGATG GATACCTTGC AGCTGTTTTA
ATGCCTGCTG GCAGCACTGC ACCAGTAGGA GAGACTATCG GTTTAATTGT AGAAAATGAG
GATGAGATAG CTTCTGTTCA AGAACAAAAT AAAGGAAATC AACCCGAAGT TTCTAGTTCG
GATCAACTTG AATTGGTAAG CAATAAAACT GAAGAAAAAC CTTTGGTTCA AACTGAAATT
GTTGAAAAAC AAGAAAAAGA AGTTGTATTA ATGAGTGAAA AGGCAGCCCC ATCTTCTAAT
AGTGATCAAA TAAATGCTGC TACGAGTAAT GTTTCTTCGA GGGTGATTGC ATCTCCAAGA
GCTAAAAAAC TTGCCTCTCA AATGGGTGTT GATTTAGCAA AGGTTCATGG ATCAGGACCT
CATGGAAGGA TTCAAGCCGA TGATATTTTA AAAGCTAATG GCCAACCAGT CTCTATACCA
TGGATAGGCG AAGGTGGTTC TCCTGCAAGT ATCCCTGGTG TAAATTTGGG GGTTGAAAGT
AAACCAGAAG CTTCAGGAAA TAGTTTTGGT AATCCCGGAG AAACAGTTCA ATTTAATACT
CTTCAAAAAG CGGTAAATAA AAATATGGAA TCTAGTTTAG ATGTGCCATG TTTTAGAGTG
GGTTACTCCA TCAACACAGA TAAATTAGAT AATTTTTATA AAAAAGTAAA ACAGAATGGA
GTGACTATGA CTGCTTTACT AGTTAAAGCA GTTGCTAAGA CACTAAAGAA ACACCCTCAA
GTTAACTCAA GTTTTTCAGA AAATGGAATT TCTTATCCAG AAAATATAAA TATTGCTGTT
GCTGTAGCGA TGGAAGATGG TGGACTAATA ACTCCAGTTT TAAAAGAACC ATGCAATACT
GATTTATTTG AATTGTCTAG GGAATGGAAA GATCTCGTAA AAAGATCAAG ATCAAAACAA
TTAGAACCCG ATGAATACTC AACGGGAACC TTCACTTTAT CTAACCTTGG GATGTTTGGA
GTTGATAGAT TTGACGCAAT TCTTCCTCCA GGTACCGGTG CTATTTTAGC CATAGCATCA
TCGAAACCAA CCGTTGTTGC TAATAGTGAT GGTTCAATAT CTGTTAAAAA AATAATGCAA
GTAAATCTAA CGGCTGATCA TAGAGTGATC TATGGAGCTG ATGGAGCTTC ATTCTTAAAA
GACTTGGCTT CCCTAATTCA AGATGAGCCA GAGACTCTTG TCTCCTAA
 
Protein sequence
MSHEIFMPAL SSTMTEGKIV EWLKNPGDKV ARGESVLVVE SDKADMDVES FQDGYLAAVL 
MPAGSTAPVG ETIGLIVENE DEIASVQEQN KGNQPEVSSS DQLELVSNKT EEKPLVQTEI
VEKQEKEVVL MSEKAAPSSN SDQINAATSN VSSRVIASPR AKKLASQMGV DLAKVHGSGP
HGRIQADDIL KANGQPVSIP WIGEGGSPAS IPGVNLGVES KPEASGNSFG NPGETVQFNT
LQKAVNKNME SSLDVPCFRV GYSINTDKLD NFYKKVKQNG VTMTALLVKA VAKTLKKHPQ
VNSSFSENGI SYPENINIAV AVAMEDGGLI TPVLKEPCNT DLFELSREWK DLVKRSRSKQ
LEPDEYSTGT FTLSNLGMFG VDRFDAILPP GTGAILAIAS SKPTVVANSD GSISVKKIMQ
VNLTADHRVI YGADGASFLK DLASLIQDEP ETLVS