Gene Dshi_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1225 
Symbol 
ID5711783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1267586 
End bp1269130 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content72% 
IMG OID641267137 
Productdeoxyribodipyrimidine photo-lyase 
Protein accessionYP_001532568 
Protein GI159043774 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.474608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.333965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGATC CCGCCCCCAT CCTTGTCTGG TTCAAGCGCG ACCTGCGCGT GGCCGATCAC 
CCGGCGCTGG CGCGGGCGGC CGCACTTGGG CCTGTCCTGC CGGTTTACAT CGTGGAGCCG
GAGTATTGGC AGCTTCGGGA CGTCTCGGCG CGGCAGTGGG CGTTCACGCG GGAATGTGTG
GCGGACCTGT CGCGGGAGCT TGCGGCGCTG GGCGCGCCCT TGCGGATCGA GACCGGAGAG
GCGGTCGCGG TGCTGGAGCG GCTGCGCGCC GCGCACGGGA TCACCCTGAT GATCAGCCAT
GAGGAGATCG GCAATGGCTG GACCTATGCC CGCGACCGGG CAGTGGCCGA TTGGGCGCGG
GCGCAGGGCG TGGCCTGGGA GGAGTTGGCG CAATCCGGGG TCGTGCGGCG GCTGAACGGG
CGGGACGGCT GGGCGCGTAC GCGGGACCGG TTCATGGCAC AGCCGCAGGT GGCGGTGCCC
GTGCTGCGCG GTGTCGACGG CGGGGGCGGG GCCTTGCCCG ATGCGGCCGC GCTGGGCCTT
GCCGACGATC CCTGCCCGGG GCGGCAGCGC GGCGGGCGGG ATGAGGCCTT GTCCCTGTTG
GGCGGGTTTC TGACCGAGCG GGGCCGGACC TACCGCGCGG CGATGGCCAA TCCGCTGGAC
GGGGCGGAAG CGTGCTCGCG CCTGTCGCCG CACCTGGCGC TCGGCACCCT GTCGGGGCGG
GAGGCGGTTC AGGCGGCGGC AATGCGCAAG GCCGAGGTGA AGGGCACGCG GGACGGCTGG
ATCGGGGCGA TGAAGAGCTT CGAGGCGCGG CTCGCCTGGC GCGATCACTT CATGCAGAAG
CTGGAGGATG CGCCGCGGCT GGAGCATGCG TGCCTCCATT CGGCTTATGA GGGGTTGCGG
CCCGCGGTGC CGGACCCGGT GCGGCTGGGG GCCTGGGCCA AGGGGGAGAC GGGATTGCCC
TTCGTCGATG CCTGCATGCG GTCGCTGATC GCCACGGGGT GGCTGAATTT TCGGGCGCGC
GCGATGCTGG TGGCGGTGGC GTCCTATCAC CTGTGGCTGG ATTGGCGCGC CTCCGGCACG
ATCCTGGGGC GGTATTTCAC CGATTTCGAG CCGGGGATTC ACTGGCCGCA GGTGCAGATG
CAGTCGGGCA CCACGGGGAT GAACACGGTG CGGATCTACA ACCCGGTCAA GCAGGGGCAT
GACAACGACC CCGAGGGCGT GTTCACCCGC CGCTGGCTGC CGGAACTGGC GGAAGTTCCG
GACCGGTACC TGCAGGAGCC CTGGCGCTGG GAGGGGGCGG ACAGCGTGCT GAACCGGACC
TATCCCGCGC CGATCGTGGA GCCCAAGGCC GCGGCGGCAG CGGCCCGCGA CAAGGTCTGG
GCCGTGCGGC GCGGAGAGGC GTTTCGCAGC GAGGCCGCCC GGGTGGTCGA AAAGCACGCC
AGCCGGAAGG ACGCGCAGGG GCGGTTCGTC AATGACCGCG CCCCGCGCAA GACCCGCCGC
CGGGCGCCGA AGGCCCCGCC GGGGCAGATG AGCCTCGACC TGTGA
 
Protein sequence
MTDPAPILVW FKRDLRVADH PALARAAALG PVLPVYIVEP EYWQLRDVSA RQWAFTRECV 
ADLSRELAAL GAPLRIETGE AVAVLERLRA AHGITLMISH EEIGNGWTYA RDRAVADWAR
AQGVAWEELA QSGVVRRLNG RDGWARTRDR FMAQPQVAVP VLRGVDGGGG ALPDAAALGL
ADDPCPGRQR GGRDEALSLL GGFLTERGRT YRAAMANPLD GAEACSRLSP HLALGTLSGR
EAVQAAAMRK AEVKGTRDGW IGAMKSFEAR LAWRDHFMQK LEDAPRLEHA CLHSAYEGLR
PAVPDPVRLG AWAKGETGLP FVDACMRSLI ATGWLNFRAR AMLVAVASYH LWLDWRASGT
ILGRYFTDFE PGIHWPQVQM QSGTTGMNTV RIYNPVKQGH DNDPEGVFTR RWLPELAEVP
DRYLQEPWRW EGADSVLNRT YPAPIVEPKA AAAAARDKVW AVRRGEAFRS EAARVVEKHA
SRKDAQGRFV NDRAPRKTRR RAPKAPPGQM SLDL