Gene RPB_4622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4622 
Symbol 
ID3912439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5223342 
End bp5224436 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID637886526 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_488216 
Protein GI86751720 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGCG CCTTCGACGC GTTCTCGCTC CCGCTGCTGC GTCTGCTCGA CGCCGAAGAC 
GCCCACCGCC TCGCCATCCA GGGGTTGCGG CTGCTGCCGC AGGTGAAGCC GCGCCCGGAC
GATTCCAAGC TCGCGGTGCG CGCCTTCGGG CTGAACTTCC CCAATCCGGT CGGCATCGCC
GCCGGTTTCG ACAAGAATGC CGAAGCGCCG GATGCGCTGC TGCGGCTCGG CTTCGGCTTC
GTCGAGATCG GCACGGTGAC GCCGAAGCCG CAGGCCGGCA ATCCGCGGCC GCGGTTGTTC
CGGCTGGAGC GCGACGAGGC TATCATCAAC CGGATGGGCT TCAACAATGA CGGCGCCGAG
GCCGTGCTAC GCCGGCTTGC GGCGCGGGCG CAGCAGGGCG GCATCGTCGG CGTCAATGTC
GGCGCCAACA AGGACAGCAC CGATCGCGTC GCCGACTACG TGTCGCTGAT CGAGACCTTT
GCGCCGGTGG CGAGCTATTT CACCGTCAAC GTGTCGTCGC CGAATACGCC GGGCCTGCGC
AATCTGCAGC AGGCGGCGGC GCTCGACGAT CTGCTGGCGC GGGTGATCGA GGCCCGCGAA
CGGGTCCGCG CCAGCGCCGG CGACACTCCT GTGCTGCTGA AGATTGCGCC CGACCTCACG
CTCAGTGAAC TCGACGACGT GGTGCACATC GCCCGCTCGC GCCGGGTCGA CGGCATGATC
GTCGCCAACA CCACGCTGTC GCGCTCCCCG ATGCTGCGCG AACGGACGCG GCTGAACGAG
CAGGGCGGCC TCAGCGGCCG GCCGCTGTTC CGGCTGTCGA CCCGGATGGT GGCGGAGACC
TATGTCCGGG CCGAGGGCGC ATTTCCGCTG ATCGGCGTCG GCGGCATCGA TTCCGGCGGC
GCGGCGCTGA CCAAGATCCG CGCCGGCGCC AGCCTGGTGC AGCTGTATTC GGCGCTGATC
TACAAGGGCC TCGGCCTCGT CGACAGCATC AAGGCCGATC TCGCCTCGAC GCTGCTGCGC
ACCGGGCGTG ACTCGCTTTC CGAAATCGTC GGTGCCGACG CGCCGACCAT CACCGCGGAA
GAGTGGCCGG TGTAA
 
Protein sequence
MIRAFDAFSL PLLRLLDAED AHRLAIQGLR LLPQVKPRPD DSKLAVRAFG LNFPNPVGIA 
AGFDKNAEAP DALLRLGFGF VEIGTVTPKP QAGNPRPRLF RLERDEAIIN RMGFNNDGAE
AVLRRLAARA QQGGIVGVNV GANKDSTDRV ADYVSLIETF APVASYFTVN VSSPNTPGLR
NLQQAAALDD LLARVIEARE RVRASAGDTP VLLKIAPDLT LSELDDVVHI ARSRRVDGMI
VANTTLSRSP MLRERTRLNE QGGLSGRPLF RLSTRMVAET YVRAEGAFPL IGVGGIDSGG
AALTKIRAGA SLVQLYSALI YKGLGLVDSI KADLASTLLR TGRDSLSEIV GADAPTITAE
EWPV