Gene Rsph17029_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3804 
Symbol 
ID4898416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp934056 
End bp935582 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content70% 
IMG OID640114408 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_001045656 
Protein GI126464543 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.206805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACGGC TCATCCTCGT GCTGGGCGAC CAGCTCAGCG ACGATCTTCG GGCGCTCCGG 
GCGGCGGATC CGGCCGCAGA TCTCGTGGTC ATGGCCGAGG TGATGGAGGA GGGCACCTAT
GTGCCGCACC ATCCGCAGAA GATCGCCCTG ATCCTCGCCG CCATGCGCAA GTTCGCCCGC
CGCCTGCAGG AACGCGGCTT CCGCGTGGCC TATTCCCGGC TGGACGATCC CGAGACCGGG
CCCTCGATCG GCGCCGAGCT CCTGCGGCGG GCCGCAGAGA CCGGGGCCCG CGAGGCGGTC
GCCACCCGGC CCGGCGACTG GCGGCTGATC GAAGCGCTCG AGGCCCTGCC CCTGCCCGTC
CGCTTCCTGC CCGACGACCG TTTCCTCTGC CCGGCAGACG AGTTCGCCCG CTGGGCCGAG
GGGCGCAAGC AGCTGCGCAT GGAGTGGTTC TATCGCGAGA TGCGCCGCAG GACCGGCCTC
CTGATGGAGG GGGACGAGCC CGCGGGCGGG AAGTGGAACT TCGACACAGA GAACCGCAAG
CCCGCGGCGC CCGACCTGCT GCGTCCGCGG CCGCTGCGCT TCGAGCCCGA TGCCGAGGTG
CGCGCAGTCC TCGATCTCGT CGAGGCGCGC TTTCCGCGCC ATTTCGGGCG GCTCCGCCCG
TTCCACTGGC CCACCGACCG GGCCGAGGCG CTGCGGGCGC TCGATCACTT CATCCGCGAA
AGCCTGCCGC GCTTCGGCGA CGAGCAGGAT GCGATGCTGG CCGACGATCC GTTCCTGAGC
CATGCGCTGC TGTCCTCGTC GATGAACCTC GGGCTTCTCG GGCCGATGGA GGTTTGCCGC
CGCGCCGAGA CCGAATGGCG CGAGGGCCGC GCGCCGCTGA ACGCGGTCGA GGGCTTCATC
CGGCAGATCC TCGGCTGGCG GGAATATGTG CGGGGGATCT GGGCGCTCTC GGGGCCGGAC
TACATGCGCT CGAACGGGCT CGGCCACAGC GCCGCCCTGC CGCCACTCTA CTGGGGCAAG
CCCACGCAGA TGGCCTGCCT CTCGGCCGCG GTCGCCCAGA CCCGCGATCT CGCCTATGCC
CACCACATCC AGCGACTGAT GGTGACGGGC AATTTTGCGC TGCTGGCGGG TGTCGATCCC
GCCGAGGTGC ACGAATGGTA TCTCTCGGTC TATATCGATG CGCTGGAATG GGTCGAGGCG
CCGAACACGA TCGGGATGAG CCAGTTCGCC GATCACGGGC TCCTCGGCTC GAAACCCTAT
GTCTCGTCCG GCGCCTATAT CGACCGGATG TCGGATTACT GCCGCGGCTG CGCCTATGCG
GTGAAGGACC GGACGGGGCC CCGCGCCTGC CCCTTCAACC TGCTCTACTG GCACTTCCTG
AACCGGCACC GCGCGCGGTT CGAGCGCAAC CCCCGCATGG TCCAGATGTA TCGCACCTGG
GACCGGATGG AGGAGACCCA TCGCTCGCGG GTTCTGACCG AGGCAGAGGC CTTCCTCGGC
CGGCTCCACG CGGGCGAGCC GGTCTGA
 
Protein sequence
MTRLILVLGD QLSDDLRALR AADPAADLVV MAEVMEEGTY VPHHPQKIAL ILAAMRKFAR 
RLQERGFRVA YSRLDDPETG PSIGAELLRR AAETGAREAV ATRPGDWRLI EALEALPLPV
RFLPDDRFLC PADEFARWAE GRKQLRMEWF YREMRRRTGL LMEGDEPAGG KWNFDTENRK
PAAPDLLRPR PLRFEPDAEV RAVLDLVEAR FPRHFGRLRP FHWPTDRAEA LRALDHFIRE
SLPRFGDEQD AMLADDPFLS HALLSSSMNL GLLGPMEVCR RAETEWREGR APLNAVEGFI
RQILGWREYV RGIWALSGPD YMRSNGLGHS AALPPLYWGK PTQMACLSAA VAQTRDLAYA
HHIQRLMVTG NFALLAGVDP AEVHEWYLSV YIDALEWVEA PNTIGMSQFA DHGLLGSKPY
VSSGAYIDRM SDYCRGCAYA VKDRTGPRAC PFNLLYWHFL NRHRARFERN PRMVQMYRTW
DRMEETHRSR VLTEAEAFLG RLHAGEPV