Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_04551 |
Symbol | pdhC |
ID | 4717153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 394884 |
End bp | 396251 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640078167 |
Product | branched-chain alpha-keto acid dehydrogenase subunit E2 |
Protein accession | YP_001008850 |
Protein GI | 123967992 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.230997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCACG AAATATTCAT GCCTGCCTTG AGTTCTACCA TGACGGAGGG CAAGATTGTG GAATGGTTGA AAAATCCAGG AGATAAAGTT GCAAGAGGTG AATCTGTCTT GGTTGTTGAA TCTGACAAGG CAGATATGGA TGTTGAATCT TTTCAAGATG GATACCTTGC AGCTGTTTTA ATGCCTGCTG GCAGCACTGC ACCAGTAGGA GAGACTATCG GTTTAATTGT AGAAAATGAG GATGAGATAG CTTCTGTTCA AGAACAAAAT AAAGGAAATC AACCCGAAGT TTCTAGTTCG GATCAACTTG AATTGGTAAG CAATAAAACT GAAGAAAAAC CTTTGGTTCA AACTGAAATT GTTGAAAAAC AAGAAAAAGA AGTTGTATTA ATGAGTGAAA AGGCAGCCCC ATCTTCTAAT AGTGATCAAA TAAATGCTGC TACGAGTAAT GTTTCTTCGA GGGTGATTGC ATCTCCAAGA GCTAAAAAAC TTGCCTCTCA AATGGGTGTT GATTTAGCAA AGGTTCATGG ATCAGGACCT CATGGAAGGA TTCAAGCCGA TGATATTTTA AAAGCTAATG GCCAACCAGT CTCTATACCA TGGATAGGCG AAGGTGGTTC TCCTGCAAGT ATCCCTGGTG TAAATTTGGG GGTTGAAAGT AAACCAGAAG CTTCAGGAAA TAGTTTTGGT AATCCCGGAG AAACAGTTCA ATTTAATACT CTTCAAAAAG CGGTAAATAA AAATATGGAA TCTAGTTTAG ATGTGCCATG TTTTAGAGTG GGTTACTCCA TCAACACAGA TAAATTAGAT AATTTTTATA AAAAAGTAAA ACAGAATGGA GTGACTATGA CTGCTTTACT AGTTAAAGCA GTTGCTAAGA CACTAAAGAA ACACCCTCAA GTTAACTCAA GTTTTTCAGA AAATGGAATT TCTTATCCAG AAAATATAAA TATTGCTGTT GCTGTAGCGA TGGAAGATGG TGGACTAATA ACTCCAGTTT TAAAAGAACC ATGCAATACT GATTTATTTG AATTGTCTAG GGAATGGAAA GATCTCGTAA AAAGATCAAG ATCAAAACAA TTAGAACCCG ATGAATACTC AACGGGAACC TTCACTTTAT CTAACCTTGG GATGTTTGGA GTTGATAGAT TTGACGCAAT TCTTCCTCCA GGTACCGGTG CTATTTTAGC CATAGCATCA TCGAAACCAA CCGTTGTTGC TAATAGTGAT GGTTCAATAT CTGTTAAAAA AATAATGCAA GTAAATCTAA CGGCTGATCA TAGAGTGATC TATGGAGCTG ATGGAGCTTC ATTCTTAAAA GACTTGGCTT CCCTAATTCA AGATGAGCCA GAGACTCTTG TCTCCTAA
|
Protein sequence | MSHEIFMPAL SSTMTEGKIV EWLKNPGDKV ARGESVLVVE SDKADMDVES FQDGYLAAVL MPAGSTAPVG ETIGLIVENE DEIASVQEQN KGNQPEVSSS DQLELVSNKT EEKPLVQTEI VEKQEKEVVL MSEKAAPSSN SDQINAATSN VSSRVIASPR AKKLASQMGV DLAKVHGSGP HGRIQADDIL KANGQPVSIP WIGEGGSPAS IPGVNLGVES KPEASGNSFG NPGETVQFNT LQKAVNKNME SSLDVPCFRV GYSINTDKLD NFYKKVKQNG VTMTALLVKA VAKTLKKHPQ VNSSFSENGI SYPENINIAV AVAMEDGGLI TPVLKEPCNT DLFELSREWK DLVKRSRSKQ LEPDEYSTGT FTLSNLGMFG VDRFDAILPP GTGAILAIAS SKPTVVANSD GSISVKKIMQ VNLTADHRVI YGADGASFLK DLASLIQDEP ETLVS
|
| |