Gene RPD_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3098 
Symbol 
ID4023602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3441798 
End bp3443261 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content68% 
IMG OID637963298 
Productdeoxyribodipyrimidine photolyase 
Protein accessionYP_570225 
Protein GI91977566 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.383524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0341678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAACA ACACGCCCCG CCCCGTCATC GTTTGGTTTC GCGACGATCT GCGGCTGTCC 
GATCACCCTG CCCTGCATCA AGCTGCCGCA TCCGGCGGAC CGCTGATCTG CATCTACGTC
TTCGACGAAG ACAGCGCGCA GCTCCGCTCG CCGCAGGCCA GACCGCTCGG CGGCGCATCG
CGCTGGTGGC TGGCGCAATC GCTGCGCGCG CTCGCTGCCA GTTTGGAGAA GCGCGGCGCG
CGGCTGATCC TGCGCCGCGG ACCGGCCGCC GCGATCATCG CCGAGCTGGC GCGCCAGGTC
GACGCCAGCG CGGTGCACTG GAACGAGATC GAGATCGCGC CGCATCGCGC GGTCGCCGAC
GACCTCGCCG AAGCGTTGAG CGTCGCCGGG ATCGATCACC ACCGCCATCG CGGCGATCTG
CTCGCTTCGC CAGCGGAGGT GCGCACCAAA GAAGGCCGCG GACTGCGCGT GTTCACGCCG
TTCTGGCGAC GCGTGCTCGG CCTTGGCGAT CCGCCGAAGC TGCTGCCGGC GCCGAAAACC
CTGAGTGCCG CGCAAGGTCC GTCCGGCGAT CAGCTTGATA GCTGGATGCT CGAGCCGACC
GAACCGGACT GGGCCGGCGG CCTGCGCGAA AGCTGGACGC CCGGCGAAGG CGCCGCGCAA
GACAACCTCA CCGCATTCCT CGACGCCCTG CCCGGCTACA CCGAAGGCCG CGACCGGCCC
GACTGCGCTG CGACGTCGCG GCTGTCGCCG CATCTGCGGT TCGGCGAGAT CAGCCCGCGT
CAGGTCTGGT ACGCGGCGCG GTTCGCCGCG GCGGAGCGGC CCGCGATCGC CGGCGACATC
GACAAGTTCC TGAGCGAACT CGGCTGGCGC GAGTTCTGCC GGCATTTGCT GCACGATCAT
CCCGATCTCG CCGAGCGCAA TCTGCAGGCC TCATTCGACG CCTTTCCCTG GATCACCGAC
GCCGCCGCGC TGCACGCCTG GCAGCGCGGC TGCACCGGTT ATCCGATCGT CGATGCGGGA
ATGCGCGAGC TCTGGCACAC CGGCGTGATG CACAATCGCG TCCGCATGGT GGTGGCGTCG
TTCCTGGTGA AGCATCTGCT GATCGACTGG CGCTGCGGCG AGCAATGGTT CTGGGACACG
CTGGTCGACG CCGATGCCGG CAGCAATCCG GCCAATTGGC AGTGGGTCGC GGGCTCCGGC
GCCGATGCCG CGCCGTATTT TCGCGTGTTC AATCCCATCC TGCAGGGAGA AAAATTCGAC
CCGGCCGGCG ACTATGTGCG TCGCTGGGTG CCTGAACTCG CCTCGCTTCC CGCTAAATTC
ATCCACCAGC CATGGACTGC GACGCCGTTC GAACTCGCAG CGGCGGGCGT CACACTCGGC
GGCAATTATC CGGAGCCGAT CATCGATCAC CGGGTCGGAC GCGAGCGCGC GCTTGCGGCT
TACGCCAAAA CGCGTCAGCA TTGA
 
Protein sequence
MPNNTPRPVI VWFRDDLRLS DHPALHQAAA SGGPLICIYV FDEDSAQLRS PQARPLGGAS 
RWWLAQSLRA LAASLEKRGA RLILRRGPAA AIIAELARQV DASAVHWNEI EIAPHRAVAD
DLAEALSVAG IDHHRHRGDL LASPAEVRTK EGRGLRVFTP FWRRVLGLGD PPKLLPAPKT
LSAAQGPSGD QLDSWMLEPT EPDWAGGLRE SWTPGEGAAQ DNLTAFLDAL PGYTEGRDRP
DCAATSRLSP HLRFGEISPR QVWYAARFAA AERPAIAGDI DKFLSELGWR EFCRHLLHDH
PDLAERNLQA SFDAFPWITD AAALHAWQRG CTGYPIVDAG MRELWHTGVM HNRVRMVVAS
FLVKHLLIDW RCGEQWFWDT LVDADAGSNP ANWQWVAGSG ADAAPYFRVF NPILQGEKFD
PAGDYVRRWV PELASLPAKF IHQPWTATPF ELAAAGVTLG GNYPEPIIDH RVGRERALAA
YAKTRQH