Gene RPB_2335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2335 
Symbol 
ID3908966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2683716 
End bp2685077 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content71% 
IMG OID637884232 
Productpseudouridine synthase RluD 
Protein accessionYP_485951 
Protein GI86749455 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.134936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTC GTACCAAGAA ACCCCTCGCC AAGCGCGGCC CCGGCGGCGA TCGCCCCAAA 
GCCGGCCGGC CCGCCGCGAA GCGGGGGCCG GCTGCCAGCG GGCCGGTGCG CGGCAATCGC
GGCGAGGACG GCCGCCCGAT CTCGGCGCGG CCGCAGCGCG AGCCGCGTGG CGAGCGCGCC
GAGCGGCCTG CCACGAAGCG TCCCGCCGCC GAGCGACCCG AACGTCCCGC GTCCGATCGC
CCGCAGAGCG CGCGTCCCGG CCGTCCGCCG GCGCGGTTCG GCGCGTCATC CGCCACGCGG
CCGCCGCGTG CGCAGCGTCC CGAGGCCGCA ATCGAGCCCG AGAAGGCCGC CAAGCCCGAC
GCGCCGCTGC CGACCAAGGT CGAGACCGTG GTCGTCACCG CCGACGAGAA CGGCATGCGG
GTCGATCGCT TCCTCGAGGG GCGGTTTCCG GGTCTGTCAT TCTCCCACAT CCAGCGCATC
GTCCGCAAAG GCGAGCTGCG GGTGAACGGC AAGCGCGCCG ATTCCAAGGA CCGGCTGGAG
GAGGGCCAGA GCGTGCGGAT TCCGCCGCTC AAGCTCGACA CGCCGAAGGC GCCCGGGCAT
CTGTCGGAGG CCGAGCAGAA GACGCTGGCG GCGCTGAAGG CGATGACGCT GTACGAGGAC
GCCGACGTGC TGGTGCTGAA CAAGCCCGCG GGCCTGGCGG TGCAGGGCGG CAGCGGCACC
ACCAAGCACA TCGACCTGAT GCTCGAGGTG ATGCGCGACA GCAAGGGCCA GAAGCCGCGG
CTGGTGCACC GCATCGACAA GGAGACCGCG GGCTGCCTGC TGGTGGCGAA GACGCGCTTC
GCCGCCACCG CGCTGACCGG CTCGTTCCGC CACCGCTCGG CGCGGAAGAT CTACTGGGCG
CTGGTCGCCG GCGTGCCGAA GCCGAAGCAG GGCCGGATCT CGACCTATCT GGCCAAGGAG
GAGAGCGAGG ACGACAGCAT CATGCGGATC GCCGAGCACG GCGACGAGGG CGCCAGCCAC
GCGGTGACCT ACTACGCGGT GGTCGAGACC TCGGCACAGA AGCTCGCCTG GGTGTCGCTG
AAGCCGGTGA CCGGGCGCAC CCATCAGCTC CGCGCCCATA TGGCGCATAT CGGCCACGCC
ATCGTCGGCG ATCCGAAATA CTTCAACATC GAGAACTGGC AATTGCCGGG CGGCCTGCAG
AAGCGGCTGC ATCTGTTGGC GCGACGCATC GTGATTCCGC ATCCGCGCGG CGGCACCATC
GACGTCTCGG CGCCGCTGCC GCCGCACATG CTGCAGAGCT GGAACCTGCT CGGGCTGGAG
CACGACCGGT TCGATCCGAT CGAGCATGCG CCGGAAGAAT GA
 
Protein sequence
MSRRTKKPLA KRGPGGDRPK AGRPAAKRGP AASGPVRGNR GEDGRPISAR PQREPRGERA 
ERPATKRPAA ERPERPASDR PQSARPGRPP ARFGASSATR PPRAQRPEAA IEPEKAAKPD
APLPTKVETV VVTADENGMR VDRFLEGRFP GLSFSHIQRI VRKGELRVNG KRADSKDRLE
EGQSVRIPPL KLDTPKAPGH LSEAEQKTLA ALKAMTLYED ADVLVLNKPA GLAVQGGSGT
TKHIDLMLEV MRDSKGQKPR LVHRIDKETA GCLLVAKTRF AATALTGSFR HRSARKIYWA
LVAGVPKPKQ GRISTYLAKE ESEDDSIMRI AEHGDEGASH AVTYYAVVET SAQKLAWVSL
KPVTGRTHQL RAHMAHIGHA IVGDPKYFNI ENWQLPGGLQ KRLHLLARRI VIPHPRGGTI
DVSAPLPPHM LQSWNLLGLE HDRFDPIEHA PEE