Gene RPD_0359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0359 
Symbol 
ID4020825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp424666 
End bp425946 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID637960544 
Productpseudouridine synthase RluD 
Protein accessionYP_567498 
Protein GI91974839 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000104546 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTCGCCG GCGGTGGCGG TGGCGACGAC GCGCGCCCCC TTCGGCAGCC GCAGCGTGGT 
CTCGGCGGCG ACCGGAACGC TTCCCTCGGA GCGGAAAAGC TTATAGCCGA TGGCGATGAG
AACCGCCGCG AGCGCCAGCG CCGTGGTCAG GCCCGCGATC ATCATCATCC GCCGCACCCG
CGCAAACAGC GCGGCCTGTT CGGGGGTCGG TTCGGGCAGG GCGGTTTCGG TCATGCAATG
CTCGGTTTTG GCTCGGTTTT TTTGGAAGGC TCGATCGTGA ACGATTCGTC GGGACATCTG
CTGGAAGTGG TCGTTGCCGG CGACGAGGGG TCGCCGCGGC TCGACCGGGT ACTGGCGACG
CGGTGCCCGG CTCTGTCGCG GTCGCGGCTG AAGGCCCTGA TCCTCGACGG TCGCGTCGCG
ATTCGCGGCG CCCCGGTCCG CGACCCCGCT TATCACGCCG CCTCGGGGGA GACGATCACA
ATCGACGTGC CGCCGCCGGT GGCGCCGGAG CCGGCCGGCG AGGCGATCGC GCTCGAGATC
GTCCACGAGG ACGACGACAT CATCGTCATC GATAAGCCGC GCGGCCTCGT GGTGCATCCC
GCCGCCGGCC ACGAGACCGG CACCCTGGTC AACGCCTTGA TCGCGCATTG CGGCGAATCG
CTGTCCGGGA TCGGCGGGGT GCGGCGGCCG GGGATCGTCC ACCGGCTCGA CAAGGACACC
ACCGGGCTGA TGGTCGCGGC AAAGAATGAT CGAGCCCACC AATCCTTGAG CGCGCAATTC
GCCGACCATG GCCGCACCGG AGAGCTGCGC CGCGGCTATT ACGCCTTTGT CTGGGGGGCG
CCGAACCGCA TCCGCGGCAC CATCGACGCG CCGATCGACC GGCATCCCCA TGCCCGCGAA
AAGATGGCGG TGCGTGACGG CGGACGCGAG GCGATCACCC ATTGGGAGGT GCTGGAGACC
TTCACGGGCC GCAGCGGCGG CGAGATCGTG TCGCTGATCG CCTGCCAGCT CGAGACCGGC
CGCACCCACC AGATCCGGGT GCATCTCGCC CATATCGGCC ACCCGCTGCT CGGCGACGAC
GTCTATGGCC CGCATTTCAA AACCAAGGCC AGCCAGCTCC GCCCGGACGC CCGCGCCGCG
CTGACGGATC TGGGCCGGCA GGCGCTGCAT GCCTATCTGC TGGTGCTCGA GCACCCCTCC
ACCGGGGAAG TCGTCGCGTG GGAATCCGGC CTGCCGGCCG ATCTGAAGCG CCTGAAAGCC
GCCCTGACGG CGACGGAATG A
 
Protein sequence
MVAGGGGGDD ARPLRQPQRG LGGDRNASLG AEKLIADGDE NRRERQRRGQ ARDHHHPPHP 
RKQRGLFGGR FGQGGFGHAM LGFGSVFLEG SIVNDSSGHL LEVVVAGDEG SPRLDRVLAT
RCPALSRSRL KALILDGRVA IRGAPVRDPA YHAASGETIT IDVPPPVAPE PAGEAIALEI
VHEDDDIIVI DKPRGLVVHP AAGHETGTLV NALIAHCGES LSGIGGVRRP GIVHRLDKDT
TGLMVAAKND RAHQSLSAQF ADHGRTGELR RGYYAFVWGA PNRIRGTIDA PIDRHPHARE
KMAVRDGGRE AITHWEVLET FTGRSGGEIV SLIACQLETG RTHQIRVHLA HIGHPLLGDD
VYGPHFKTKA SQLRPDARAA LTDLGRQALH AYLLVLEHPS TGEVVAWESG LPADLKRLKA
ALTATE