Gene Rsph17029_0817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0817 
Symbol 
ID4896480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp828341 
End bp829756 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content73% 
IMG OID640111401 
Productdeoxyribodipyrimidine photo-lyase 
Protein accessionYP_001042700 
Protein GI126461586 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCCG ACGCTCCCCT GATCCTGTGG TTCCGGCGCG ACCTGCGGCT GGCCGACAAT 
CCGATGCTGG CAGAGGCGGC GGCCACGGGC CGGCCGCTGA TCCCGCTGTT CATCCTGGAT
CCCGAGACCG AGGCGCTGGG CGCCGCGCCG AAATGGCGGC TGGGTCTCGG GGTCGAGGCC
TTCGCTCAGG CGCTGGAAGG ACTGGGCAGC CGGCTCGTGC TGCGGCGGGG GCCGGCGCTC
GCCGTGCTCA AGACGCTGGT GGCCGAGACC GGGGCTGCGG GGGTGCACTG GTCGCGGCTC
TGGGAGCCGG ACTGGCGGGC GCGCGACGAG GGGGTGACGG CGGGGCTCCG GCAGGCGGGC
ATCGAGGCCG CGCGCCATGC CGGCCACACG ATCTTCGAGC CCCGGGAGGT GGAGACCGGG
CAGGGCGGCT TCTACCGGGT CTATACGCCG TTCTGGAAAG CGGTGAAGGA CCGCCCGGTC
GCGGCCTCCT TCCCGCCGCC CGCGCGGCTG CCGTCTCCCG CGGAGTGGCC GGTCTCCGAG
CGACTGGCCT CTTGGGATCT CGGGCGGGCG ATGAACCGGG GCGCGGCCGT GGTGGCGCCG
CATCTGGCGG TGGGCGAGGC GGCGGCGGCC GAACGGCTGG CGCGGTTCCT GAGCGGGCCG
CTCGACCGCT ATGCCGCGGA GCGCGACCGG CCGGATGCGC CCGTGACCTC GCGCCTGTCG
GAAAACCTCA CCTATGGCGA GATCTCGGCC CGCAGCCTCT GGCACGCCGG CATGCGCGCC
CGTGCGGAGG GGCGGGCGGG GGCCGAGAAG TTCCTCCAGG AGCTCGCCTG GCGCGAGTTC
GGCTGGCATC TGCTCTACCA CACGCCCGAG ATCGCGCGCC GCAACTGGCG CGGCGACTGG
GATGCCTTTC CCTGGCGCGG CGACAATCCC GACGCCGAAC GCTGGCGGCG CGGCATGACC
GGCGAGCCCT TCGTCGATGC GGCCATGCGC GAGCTGTTCG TGACCGGCAC CATGCACAAC
CGCGCGCGGC TGATCGCGGG CAGTTACCTT ACGAAGCATC TGCTGACCGA CTGGCGCGTG
GGCAAGGCCT GGTTCGAGGA CTGCCTGATC GACTGGGACC CGGCGTCGAA CGCGCTCGGC
TGGCAGTGGG TCGCGGGGTC GGGGCCCGAT GCCTCGCCCT ATTTCCGCAT CTTCAACCCC
GCGACCCAGG CCGAGAAGTT CGATCCCGAG AGTGCCTATC GCCGGAGGTT CCTTGCTGAA
ATCGCGCGCA GGCCCGGCCC CGAGGCGCTT GCCTTCTTCG AGGCGGTGCC GCGAAGCTGG
GGCCTTCGGC CCGATCGATG CTACCCTCGG CCCGTCGTGG GGCTGGCGGA GGGGCGGGAG
CGGGCACTGG CCGCCTACGG GCGGCGCAAC AACTGA
 
Protein sequence
MMADAPLILW FRRDLRLADN PMLAEAAATG RPLIPLFILD PETEALGAAP KWRLGLGVEA 
FAQALEGLGS RLVLRRGPAL AVLKTLVAET GAAGVHWSRL WEPDWRARDE GVTAGLRQAG
IEAARHAGHT IFEPREVETG QGGFYRVYTP FWKAVKDRPV AASFPPPARL PSPAEWPVSE
RLASWDLGRA MNRGAAVVAP HLAVGEAAAA ERLARFLSGP LDRYAAERDR PDAPVTSRLS
ENLTYGEISA RSLWHAGMRA RAEGRAGAEK FLQELAWREF GWHLLYHTPE IARRNWRGDW
DAFPWRGDNP DAERWRRGMT GEPFVDAAMR ELFVTGTMHN RARLIAGSYL TKHLLTDWRV
GKAWFEDCLI DWDPASNALG WQWVAGSGPD ASPYFRIFNP ATQAEKFDPE SAYRRRFLAE
IARRPGPEAL AFFEAVPRSW GLRPDRCYPR PVVGLAEGRE RALAAYGRRN N