Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2771 |
Symbol | |
ID | 3910564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3159282 |
End bp | 3160316 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637884671 |
Product | pyruvate dehydrogenase alpha subunit |
Protein accession | YP_486384 |
Protein GI | 86749888 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | [TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCAC CCAAGAAGAG CGCCGCGAAA GAGGCGGGGC AGGACAAGGA CCCTGCACCG AACAAGCCGC GGGTCCCGGA TTTCTCGAAG GAACAGGAGC TGCGCGCGTT TCGCGACATG CTGCTGATCC GCCGCTTCGA AGAGAAAGCC GGCCAGCTCT ACGGCATGGG TGCGATCGGC GGATTCTGCC ATCTCTATAT CGGCCAGGAA GCCGTCGTCG TCGGCATGCA GATGGCGCTG CGCGAGGGCG ATCAGGTCAT CACTGGCTAT CGCGACCACG GCCACATGCT CGCCTGCGAC ATGGACGCCA AGGGCGTGAT GGCCGAGCTG ACCGGCCGCC GCGGGGGCTA CTCCAAGGGC AAGGGCGGCT CCATGCATAT GTTCAGCATG GAGAAGCACT TCTACGGCGG CCACGGCATC GTCGGCGCGC AGGTCTCGCT CGGCACCGGC ATCGCCTTCG CCAACCGCTA TCGCGACAAT GGCAGCGTCT GCCTGGCCTA TTTCGGCGAC GGCGCCTCCA ATCAGGGGCA GGTCTACGAG AGCTTCAACA TGGCGGAGCT GTGGAAGCTC CCCGTGGTCT ACGTCATCGA GAACAACCGC TACGCGATGG GCACGTCGGT GACGCGTTCG TCGGCGCAGA CCGACTTCTC CAAGCGCGGC ATCTCGTTCA ACATTCCCGG CGAGCAGGTC GACGGCATGG ACGTCCGCGC GGTCAAGGCC GCGGGCGACA AGGCGGTGGC GCATTGCCGC GCCGGCAACG GCCCCTACAT CCTGGAGATG CAGACCTATC GCTATCGTGG CCACTCGATG TCGGACCCGG CGAAGTACCG GACCCGCGAG GAGGTCGACA AGATCCGCAA CGATCAGGAC CCGATCGAGC AGGTGCGGCA GCGTCTGCTC GGGCAGGACA TGACCGAGGA CGATCTGAAG AAGATCGACG CCGAGATCCG CAAGATCGTC AACGAGGCGG CCGATTTCGC GCAGAACGAT CCCGAGCCAG ATCCCGCCGA ACTCTACACC GATGTGTATC GCTGA
|
Protein sequence | MAAPKKSAAK EAGQDKDPAP NKPRVPDFSK EQELRAFRDM LLIRRFEEKA GQLYGMGAIG GFCHLYIGQE AVVVGMQMAL REGDQVITGY RDHGHMLACD MDAKGVMAEL TGRRGGYSKG KGGSMHMFSM EKHFYGGHGI VGAQVSLGTG IAFANRYRDN GSVCLAYFGD GASNQGQVYE SFNMAELWKL PVVYVIENNR YAMGTSVTRS SAQTDFSKRG ISFNIPGEQV DGMDVRAVKA AGDKAVAHCR AGNGPYILEM QTYRYRGHSM SDPAKYRTRE EVDKIRNDQD PIEQVRQRLL GQDMTEDDLK KIDAEIRKIV NEAADFAQND PEPDPAELYT DVYR
|
| |