Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0203 |
Symbol | |
ID | 3909444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 226211 |
End bp | 227941 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637882084 |
Product | pyruvate dehydrogenase |
Protein accession | YP_483825 |
Protein GI | 86747329 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0234842 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAATG TTGCTGATCG AATTGTCGAA ACCCTGCATC AAGCCGGCGT CGAGCGGATC TTCGGCGTGG TCGGAGACAG CCTCAACGGA CTGACGGAAG CGTTGCGCCA GCACGGCAGC ATCGAATGGG TGCACGTTCG CCACGAAGAA GTCGCTGCTT TCGCGGCCGC CGGGGAATCG CAGATCACCG GCAAGCTCGC GGTTTGCGCC GGATCCTGCG GTCCGGGAAA CCTGCATCTG ATCAATGGCC TGTATGATGC ACAGCGGACC CGGACTCCGG TGCTGGCCAT CGCGGCGCAG ATTCCATCGG CGGAGATCGG CGGCGGCTAC TTCCAGGAAA CGCACCCGCA GAATCTGTTT CGCGAATGCA GCGTCTATTG CGAGCTGGTG TCCGATCCGC ATCAGTTCGA CTACGTCCTC GAAAACGCGA TCCGGGCCGC GGTCGGTCAG CGGGGTGTCG CCGTCGTCGT GATTCCTGGC GATGTGGCGC TGCGCGAGGC TTCGACACGT GGGGTGACGC CCGTTGCCGG CCTGCTGCCG CCGACGCCGA TCGTGACGCC GGCCGAGCCG CAGCTCGACG CGCTGGCGGC GCTGCTGAAC GGCGCCGGGC GGGTCACGCT GTTCTGCGGC CGCGGCTGTG CAGGTGCCCA CGCGCCGTTG ATGGCGCTGG CCGAAGCTCT GAAGAGCCCG ATCGTCCACG CGCTGGGCGG CAAGGAACAC GTCGAATACG ACAATCCCTA CGACGTCGGG ATGACCGGCT TCATCGGCTA CGCGTCGGGC TACGAGGCGA TGCATGCCTG CGATGTGCTG TTGATGCTCG GCACCGACTT TCCCTACAAG CAGTTCCTGC CGACCGGCGC GCGAATCGCG CAGGTCGACA TTCGCGCCGA GAATCTCGGT CGCCGTTGCA AGCTCGCGCT TGGCCTGGTC GGCGGTGTCC ACGAAACCAT CGAGGCCTTG CTGCCGAAAT TGACGACCAA GACCGACCGC GGGCACCTCG ATCACAGCGT GGCACGCTAC GTCGCATCCC GGCAGGGCCT CGACGATCTG GCGAAGGGCA CGCCCGGCCG CAAGCCGATC CATCCGCAGT ATCTCGCCAA GCTGATCAGC GACGGCGCCG CCGACGATGC GGTGTTCAGC TTCGACGTCG GAACACCGAC GATCTGGGCC GCCCGCTATC TGAAAATGAA CGGAAGCCGG CGTCTGGTCG GCTCTCTGGT GCACGGTTCG ATGGCCAACG CGCTGCCGCA TGCCATCGGC GTGCAGGCCG CCCAGCCGAG CCGGCAGGTG ATCTCGCTGT CGGGGGATGG TGGCTTCACC ATGCTGATGG GAGACCTCAT CACGCTCACG CAGATGAAAT TGCCGGTCAA GGTCGTCATT TTCAACAATG GCGTACTCGG CTTCGTGGCG CTCGAGATGA AGGCGGCGGG ATTTGTCGAG TTGGGCACCG ATCTACAGAA TCCCGATTTC GCCGCCATGG CGCGTGCGAT GGGCATCCAT GGCGTGCGGG TTGAGGATCC CGGCGATCTG CCGGCGGCGG TGGCCGACGT GCTGGCTCAT GATGGCCCAG CCGTGCTCGA CGTCGTCACC GCGACCCAGG AGCTGTCGAT GCCGCCCACC ATCGGCGCCG AACAGGTCAA GGGCTTCAGT CTCTGGCTGC TCCGCGCGGT GATGAGCGGC CGCGGTGACG AAGTGATTGA TCTCGCGAAG CAGAACCTGC TGCCCCGGTA G
|
Protein sequence | MSNVADRIVE TLHQAGVERI FGVVGDSLNG LTEALRQHGS IEWVHVRHEE VAAFAAAGES QITGKLAVCA GSCGPGNLHL INGLYDAQRT RTPVLAIAAQ IPSAEIGGGY FQETHPQNLF RECSVYCELV SDPHQFDYVL ENAIRAAVGQ RGVAVVVIPG DVALREASTR GVTPVAGLLP PTPIVTPAEP QLDALAALLN GAGRVTLFCG RGCAGAHAPL MALAEALKSP IVHALGGKEH VEYDNPYDVG MTGFIGYASG YEAMHACDVL LMLGTDFPYK QFLPTGARIA QVDIRAENLG RRCKLALGLV GGVHETIEAL LPKLTTKTDR GHLDHSVARY VASRQGLDDL AKGTPGRKPI HPQYLAKLIS DGAADDAVFS FDVGTPTIWA ARYLKMNGSR RLVGSLVHGS MANALPHAIG VQAAQPSRQV ISLSGDGGFT MLMGDLITLT QMKLPVKVVI FNNGVLGFVA LEMKAAGFVE LGTDLQNPDF AAMARAMGIH GVRVEDPGDL PAAVADVLAH DGPAVLDVVT ATQELSMPPT IGAEQVKGFS LWLLRAVMSG RGDEVIDLAK QNLLPR
|
| |