Gene RPB_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0203 
Symbol 
ID3909444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp226211 
End bp227941 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content65% 
IMG OID637882084 
Productpyruvate dehydrogenase 
Protein accessionYP_483825 
Protein GI86747329 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0234842 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATG TTGCTGATCG AATTGTCGAA ACCCTGCATC AAGCCGGCGT CGAGCGGATC 
TTCGGCGTGG TCGGAGACAG CCTCAACGGA CTGACGGAAG CGTTGCGCCA GCACGGCAGC
ATCGAATGGG TGCACGTTCG CCACGAAGAA GTCGCTGCTT TCGCGGCCGC CGGGGAATCG
CAGATCACCG GCAAGCTCGC GGTTTGCGCC GGATCCTGCG GTCCGGGAAA CCTGCATCTG
ATCAATGGCC TGTATGATGC ACAGCGGACC CGGACTCCGG TGCTGGCCAT CGCGGCGCAG
ATTCCATCGG CGGAGATCGG CGGCGGCTAC TTCCAGGAAA CGCACCCGCA GAATCTGTTT
CGCGAATGCA GCGTCTATTG CGAGCTGGTG TCCGATCCGC ATCAGTTCGA CTACGTCCTC
GAAAACGCGA TCCGGGCCGC GGTCGGTCAG CGGGGTGTCG CCGTCGTCGT GATTCCTGGC
GATGTGGCGC TGCGCGAGGC TTCGACACGT GGGGTGACGC CCGTTGCCGG CCTGCTGCCG
CCGACGCCGA TCGTGACGCC GGCCGAGCCG CAGCTCGACG CGCTGGCGGC GCTGCTGAAC
GGCGCCGGGC GGGTCACGCT GTTCTGCGGC CGCGGCTGTG CAGGTGCCCA CGCGCCGTTG
ATGGCGCTGG CCGAAGCTCT GAAGAGCCCG ATCGTCCACG CGCTGGGCGG CAAGGAACAC
GTCGAATACG ACAATCCCTA CGACGTCGGG ATGACCGGCT TCATCGGCTA CGCGTCGGGC
TACGAGGCGA TGCATGCCTG CGATGTGCTG TTGATGCTCG GCACCGACTT TCCCTACAAG
CAGTTCCTGC CGACCGGCGC GCGAATCGCG CAGGTCGACA TTCGCGCCGA GAATCTCGGT
CGCCGTTGCA AGCTCGCGCT TGGCCTGGTC GGCGGTGTCC ACGAAACCAT CGAGGCCTTG
CTGCCGAAAT TGACGACCAA GACCGACCGC GGGCACCTCG ATCACAGCGT GGCACGCTAC
GTCGCATCCC GGCAGGGCCT CGACGATCTG GCGAAGGGCA CGCCCGGCCG CAAGCCGATC
CATCCGCAGT ATCTCGCCAA GCTGATCAGC GACGGCGCCG CCGACGATGC GGTGTTCAGC
TTCGACGTCG GAACACCGAC GATCTGGGCC GCCCGCTATC TGAAAATGAA CGGAAGCCGG
CGTCTGGTCG GCTCTCTGGT GCACGGTTCG ATGGCCAACG CGCTGCCGCA TGCCATCGGC
GTGCAGGCCG CCCAGCCGAG CCGGCAGGTG ATCTCGCTGT CGGGGGATGG TGGCTTCACC
ATGCTGATGG GAGACCTCAT CACGCTCACG CAGATGAAAT TGCCGGTCAA GGTCGTCATT
TTCAACAATG GCGTACTCGG CTTCGTGGCG CTCGAGATGA AGGCGGCGGG ATTTGTCGAG
TTGGGCACCG ATCTACAGAA TCCCGATTTC GCCGCCATGG CGCGTGCGAT GGGCATCCAT
GGCGTGCGGG TTGAGGATCC CGGCGATCTG CCGGCGGCGG TGGCCGACGT GCTGGCTCAT
GATGGCCCAG CCGTGCTCGA CGTCGTCACC GCGACCCAGG AGCTGTCGAT GCCGCCCACC
ATCGGCGCCG AACAGGTCAA GGGCTTCAGT CTCTGGCTGC TCCGCGCGGT GATGAGCGGC
CGCGGTGACG AAGTGATTGA TCTCGCGAAG CAGAACCTGC TGCCCCGGTA G
 
Protein sequence
MSNVADRIVE TLHQAGVERI FGVVGDSLNG LTEALRQHGS IEWVHVRHEE VAAFAAAGES 
QITGKLAVCA GSCGPGNLHL INGLYDAQRT RTPVLAIAAQ IPSAEIGGGY FQETHPQNLF
RECSVYCELV SDPHQFDYVL ENAIRAAVGQ RGVAVVVIPG DVALREASTR GVTPVAGLLP
PTPIVTPAEP QLDALAALLN GAGRVTLFCG RGCAGAHAPL MALAEALKSP IVHALGGKEH
VEYDNPYDVG MTGFIGYASG YEAMHACDVL LMLGTDFPYK QFLPTGARIA QVDIRAENLG
RRCKLALGLV GGVHETIEAL LPKLTTKTDR GHLDHSVARY VASRQGLDDL AKGTPGRKPI
HPQYLAKLIS DGAADDAVFS FDVGTPTIWA ARYLKMNGSR RLVGSLVHGS MANALPHAIG
VQAAQPSRQV ISLSGDGGFT MLMGDLITLT QMKLPVKVVI FNNGVLGFVA LEMKAAGFVE
LGTDLQNPDF AAMARAMGIH GVRVEDPGDL PAAVADVLAH DGPAVLDVVT ATQELSMPPT
IGAEQVKGFS LWLLRAVMSG RGDEVIDLAK QNLLPR