Gene RPD_0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0788 
Symbol 
ID4021262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp884649 
End bp885743 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID637960978 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_567927 
Protein GI91975268 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.564423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGCG CCTTCGACGC CTTCTCGCTG CCGCTGCTGC GCCTGTTCGA CGCCGAGGAC 
GCGCACCGGC TCGCGATCCA GGGATTGCGG CTGCTGCCGC AGGTGAAGCC GCGCCCGGAC
GATCCCAAGC TCGCGGTGCG CGCTTTCGGG CTGAATTTTC CCAACCCGGT CGGTATCGCG
GCCGGCTTCG ACAAGAACGC CGAGGCGCCG GATGCGCTGA TGCGGCTCGG CTTCGGCTTC
GTCGAAATCG GCACCGTGAC GCCGAAACCG CAGGCCGGCA ATCCGCGGCC GCGACTGTTC
CGGCTGGAGC GCGACGAGGC CGTCATCAAC CGGATGGGCT TCAACAATGA CGGCAGCGAA
GCGGTGCTGC GGCGGCTCGC GGCGCGGGCG CAGCAAGGCG GAATCCTCGG CGTCAATGTC
GGCGCCAACA AGGACAGCTC GGACCGCGTC GCCGACTACG TCGCGCTGAT CGAGACCTTC
GCTCCGGTGG CGAGCTACTT CACCGTCAAC GTCTCGTCGC CGAACACGCC GGGCTTGCGC
AATCTGCAGC AGGCGGCGGC GCTCGACGAT CTGCTGGCGC GGGTGATCGA AGCCCGCGAG
CGTGTACGCC CCAGCGCTGG AGATACTCCG GTGCTGCTGA AGATCGCGCC CGATCTCACG
CTCGGCGAAC TCGACGACGT CGTGCACATC GCCCGTTCGC GAAAGGTCGA CGGCATGATC
GTCGCCAACA CCACGCTGTC GCGCTCGCCG CTGCTGCGCG AGCGAACGCG GATGAACGAG
CAGGGCGGCC TCAGCGGCCG GCCGCTATTC CGGCTGTCGA CGAGGATGGT GGCGGAGACC
TATGTCCGCG CCGAGGGCGC ATTCCCGCTG ATCGGGGTCG GCGGCATCGA CTCCGGCGGC
GCCGCGCTGA CCAAGATCCG CGCCGGCGCC AGCCTCGTGC AGCTTTACTC GGCGCTGATC
TACAAGGGCC TCGGCCTGGT CGAGAGCATC AAGACCGATC TCGCCTCGAC GCTGCTACGC
ACCGGCCGGG ATTCGCTGGC CGAAATCGTG GGCGCCGATG CGCCGACCAT CACCGCCGAA
GAGTGGCCGG TGTGA
 
Protein sequence
MIRAFDAFSL PLLRLFDAED AHRLAIQGLR LLPQVKPRPD DPKLAVRAFG LNFPNPVGIA 
AGFDKNAEAP DALMRLGFGF VEIGTVTPKP QAGNPRPRLF RLERDEAVIN RMGFNNDGSE
AVLRRLAARA QQGGILGVNV GANKDSSDRV ADYVALIETF APVASYFTVN VSSPNTPGLR
NLQQAAALDD LLARVIEARE RVRPSAGDTP VLLKIAPDLT LGELDDVVHI ARSRKVDGMI
VANTTLSRSP LLRERTRMNE QGGLSGRPLF RLSTRMVAET YVRAEGAFPL IGVGGIDSGG
AALTKIRAGA SLVQLYSALI YKGLGLVESI KTDLASTLLR TGRDSLAEIV GADAPTITAE
EWPV