Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4411 |
Symbol | |
ID | 5541924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5663229 |
End bp | 5664692 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640896509 |
Product | deoxyribodipyrimidine photo-lyase |
Protein accession | YP_001434445 |
Protein GI | 156744316 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | [TIGR02765] cryptochrome, DASH family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00088119 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0752784 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTGGA TCCACTGGTT TCGCCGCGAT CTGCGTCTGC ACGACAATCC AGCGTTGCAC ACCGCCAGCA TTCGGAGCGA TGGGCGAGTC ATCCCGCTCT TCATCCTTGA CGACGCCATT CTGCATGCGC CACGCACCGG CGCCGCGCGT ATCGCGTTCA TGATCGCCGC ACTGCGCGAT CTCGACGCCA ACCTGCGCGC GCGCGGCAGT CGCCTGGTGA TCCGGCGTGG ACGGACGCTC GACGTGATCC GCGCCATGGT GCAGGAAACC GGCGCCACCG GCGTTGCCTG GAACCGTGAC TACACCCCGT TTGCGCGACG GCGCGATGCG CAGGTCGAAG CGGCGCTGCG TGATCTGAAC GTCGAGACCT CTATTGCGGA AGACGCGGTC GTCTTCAGTC CCGACGACGT GCGTACCGGC GATGGACGCC CCTATACGGT GTACACGCCT TACCGGCGAC GCTGGCGCGC GCTGACAGAA CAGCGGCGCG CGGAGGTGCT GCGCGCGATC GAGCCGCCGC TTTTGCGCCC GGCGCCCGAA GCCGTTGCAG ATCAGACAGT TCCAGACCAT GCAGACCTCG GCATCGTGGT TTCCCAACGC ATTCCTCCAG GCGGCGAAAC GCATGGCGCT GCGCGACTGG CGGCATTTGT CGATCTGGCT GCCGCACACA GCATCGCGGG CTACGCAGAA GGGCGCGATC TGCTGGCAGA ACCGGCGACC TCGCGCCTGT CGCCCTACCT GCGGTTTGGG TGCGTGGCGC CGCGGCAGGC GCTGCGCGCA GCGCTGCGCC TGCTCGACAT CGTCGGCGAT GATCACCGTA CCGTCCGATC GATTGAAACC TGGATCGGGG AACTCGCCTG GCGCGATTTC TACTATCAGA TTCTATGGCA CTACCCGCAT GTGGTGCGCC GATCGTTCAA ACCGCAGTAC GATGCGCTGG CGTGGGAAAA CGACCCGGCG CTGTTCGACG CCTGGAAGGA AGGTCGCACC GGCTACCCGA TCATTGATGC TGCTATGCAC CAGTTGCGTC AGGAAGCCTG GATGCACAAC CGGGCGCGCA TGATCGTCGC TTCGTTTCTG ACGAAAGATC TGTTGATCGA CTGGCGCTGG GGCGAACGCT ACTTTATGCA GCAACTGGTC GATGGCGATC ACGCCGCCAA TAATGGCGGA TGGCAGTGGA GCGCCGGGAC AGGCACCGAT GCACAACCCT ACTTCCGAAT CTTCAATCCG GTCAGTCAGG GACAAAAATT CGATCCAAAG GGCTTGTATG TGCGCCGCTA TCTGCCAGAA CTGGCGCAGG TTCCAGACCG CTACATCCAC GCGCCGTGGA CAATGCCGCG CGCAGAACAG CAACGGTGCG GCGTTGTCAT CGGGCGCGAC TACCCCGCGC CGATTGTCGA TCATGCAGAA CGACGAATGC GCGCACTGGC GCTCTATCGC GCAGTATCGT CGGTTGCGCT TTAG
|
Protein sequence | MIWIHWFRRD LRLHDNPALH TASIRSDGRV IPLFILDDAI LHAPRTGAAR IAFMIAALRD LDANLRARGS RLVIRRGRTL DVIRAMVQET GATGVAWNRD YTPFARRRDA QVEAALRDLN VETSIAEDAV VFSPDDVRTG DGRPYTVYTP YRRRWRALTE QRRAEVLRAI EPPLLRPAPE AVADQTVPDH ADLGIVVSQR IPPGGETHGA ARLAAFVDLA AAHSIAGYAE GRDLLAEPAT SRLSPYLRFG CVAPRQALRA ALRLLDIVGD DHRTVRSIET WIGELAWRDF YYQILWHYPH VVRRSFKPQY DALAWENDPA LFDAWKEGRT GYPIIDAAMH QLRQEAWMHN RARMIVASFL TKDLLIDWRW GERYFMQQLV DGDHAANNGG WQWSAGTGTD AQPYFRIFNP VSQGQKFDPK GLYVRRYLPE LAQVPDRYIH APWTMPRAEQ QRCGVVIGRD YPAPIVDHAE RRMRALALYR AVSSVAL
|
| |