Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04561 |
Symbol | pdhC |
ID | 4779657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 417220 |
End bp | 418590 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640083733 |
Product | branched-chain alpha-keto acid dehydrogenase subunit E2 |
Protein accession | YP_001014285 |
Protein GI | 124025169 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.852087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACTC ATGACATTTT TATGCCTGCT TTAAGTTCAA CGATGACTGA GGGCAAAATA GTTGAGTGGC TAAAAAAACC TGGAGATAAA GTTGAAAGAG GTGAGTCAGT TTTAGTTGTT GAGTCAGATA AAGCTGATAT GGATGTTGAG TCTTTCCAAG ATGGCTTTCT TGCTTCGATT GTGATGCCTG CAGGCAGCTC TGCACCTGTT GGAGAGACAA TCGGATTGAT CGTTGAGACA GAAGATGAGA TCGCTGCGGC TCAAGCAAAT TCGCCTTCTC CTTCCCCTCA ATCAGGAAGT CAAGAAAAAG ATAGTTCATC GCCTCAAGTT CAAGAAAAAC AAGCCTCTGT CGACTCTCCT AAAGCAACAG TAGTTACAAA GGCATCTCCT GCACCTCTTG TTTCAGAATC CTCTGTAAAT CAGGATCAGT TTTTAAATGA TGGTCGAATA GTCGCTTCCC CAAGAGCTAA AAAACTTGCT TCTCAAATGG GAGTGGATTT GGCAACTGTT AGGGGCTCAG GACCTCATGG ACGAATACAA GCTGAGGATG TTCAAAGTGC AAAAGGGCAA CCCATAAGCG TTCCTTGGAT TGCAGAAAGT AATGCTCCGG CGAAAATAGT TTCTGATGTG CCTCGCGTAG AAAAAAAATC TGTTGACGCT GGTAAGCCAC CTGCTCCAGG GAAAAGTTTT GGATCTAGAG GGGAAACAAT TGCATTTAAT ACTCTTCAAC AAGCTGTAAA TCGGAACATG GAGGAAAGTT TAAATACTCC TTGTTTCAGA GTCGGATATT CAATTCTTAC TGATGAATTG GATGATCTTT ATAAACAAGT TAAACCTGAT GGAGTAACTA TGACTGCTTT ACTTGCTAAA GCAGTTGGCT TAACGCTGGC TAGACACCCC CAGGTGAATG CAGCTTTTAG TTCTGAGGGG ATTGCCTATC CTTCACAAAT AAATGTAGCC GTTGCAGTTG CGATGGAGGA CGGAGGGTTG ATAACTCCAG TGCTGCAAAA TGCTGATAAG ACGAGCCTTA CTGATTTATC CCTACAATGG GCTGATCTTG TTAAGCGAGC TAGGAATAAG CAATTAGAAC CGCAAGAATA TAGCAGTGGA ACGTTTACAC TCTCGAATCT AGGTATGTTT GGAGTGGATC GTTTTGATGC AATTCTGCCC CCAGGGACTG GAGCAATTTT AGCGGTAGGA GCTTCATTGT CTAAAGTTGT TGCTTCTAAA GATGGTTCGA TTTCAATCAA AAAACAAATG CAAGTAAATC TTACCGCTGA TCACAGAGTG ATATATGGGG CTGATGGAGC ACTATTCCTC AAGGATTTGG CATACTTAAT TGAAAAGAAC CCTTATAGCC TCTCGTCTTG A
|
Protein sequence | MATHDIFMPA LSSTMTEGKI VEWLKKPGDK VERGESVLVV ESDKADMDVE SFQDGFLASI VMPAGSSAPV GETIGLIVET EDEIAAAQAN SPSPSPQSGS QEKDSSSPQV QEKQASVDSP KATVVTKASP APLVSESSVN QDQFLNDGRI VASPRAKKLA SQMGVDLATV RGSGPHGRIQ AEDVQSAKGQ PISVPWIAES NAPAKIVSDV PRVEKKSVDA GKPPAPGKSF GSRGETIAFN TLQQAVNRNM EESLNTPCFR VGYSILTDEL DDLYKQVKPD GVTMTALLAK AVGLTLARHP QVNAAFSSEG IAYPSQINVA VAVAMEDGGL ITPVLQNADK TSLTDLSLQW ADLVKRARNK QLEPQEYSSG TFTLSNLGMF GVDRFDAILP PGTGAILAVG ASLSKVVASK DGSISIKKQM QVNLTADHRV IYGADGALFL KDLAYLIEKN PYSLSS
|
| |