Gene Rcas_4411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4411 
Symbol 
ID5541924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5663229 
End bp5664692 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content63% 
IMG OID640896509 
Productdeoxyribodipyrimidine photo-lyase 
Protein accessionYP_001434445 
Protein GI156744316 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR02765] cryptochrome, DASH family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00088119 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0752784 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGA TCCACTGGTT TCGCCGCGAT CTGCGTCTGC ACGACAATCC AGCGTTGCAC 
ACCGCCAGCA TTCGGAGCGA TGGGCGAGTC ATCCCGCTCT TCATCCTTGA CGACGCCATT
CTGCATGCGC CACGCACCGG CGCCGCGCGT ATCGCGTTCA TGATCGCCGC ACTGCGCGAT
CTCGACGCCA ACCTGCGCGC GCGCGGCAGT CGCCTGGTGA TCCGGCGTGG ACGGACGCTC
GACGTGATCC GCGCCATGGT GCAGGAAACC GGCGCCACCG GCGTTGCCTG GAACCGTGAC
TACACCCCGT TTGCGCGACG GCGCGATGCG CAGGTCGAAG CGGCGCTGCG TGATCTGAAC
GTCGAGACCT CTATTGCGGA AGACGCGGTC GTCTTCAGTC CCGACGACGT GCGTACCGGC
GATGGACGCC CCTATACGGT GTACACGCCT TACCGGCGAC GCTGGCGCGC GCTGACAGAA
CAGCGGCGCG CGGAGGTGCT GCGCGCGATC GAGCCGCCGC TTTTGCGCCC GGCGCCCGAA
GCCGTTGCAG ATCAGACAGT TCCAGACCAT GCAGACCTCG GCATCGTGGT TTCCCAACGC
ATTCCTCCAG GCGGCGAAAC GCATGGCGCT GCGCGACTGG CGGCATTTGT CGATCTGGCT
GCCGCACACA GCATCGCGGG CTACGCAGAA GGGCGCGATC TGCTGGCAGA ACCGGCGACC
TCGCGCCTGT CGCCCTACCT GCGGTTTGGG TGCGTGGCGC CGCGGCAGGC GCTGCGCGCA
GCGCTGCGCC TGCTCGACAT CGTCGGCGAT GATCACCGTA CCGTCCGATC GATTGAAACC
TGGATCGGGG AACTCGCCTG GCGCGATTTC TACTATCAGA TTCTATGGCA CTACCCGCAT
GTGGTGCGCC GATCGTTCAA ACCGCAGTAC GATGCGCTGG CGTGGGAAAA CGACCCGGCG
CTGTTCGACG CCTGGAAGGA AGGTCGCACC GGCTACCCGA TCATTGATGC TGCTATGCAC
CAGTTGCGTC AGGAAGCCTG GATGCACAAC CGGGCGCGCA TGATCGTCGC TTCGTTTCTG
ACGAAAGATC TGTTGATCGA CTGGCGCTGG GGCGAACGCT ACTTTATGCA GCAACTGGTC
GATGGCGATC ACGCCGCCAA TAATGGCGGA TGGCAGTGGA GCGCCGGGAC AGGCACCGAT
GCACAACCCT ACTTCCGAAT CTTCAATCCG GTCAGTCAGG GACAAAAATT CGATCCAAAG
GGCTTGTATG TGCGCCGCTA TCTGCCAGAA CTGGCGCAGG TTCCAGACCG CTACATCCAC
GCGCCGTGGA CAATGCCGCG CGCAGAACAG CAACGGTGCG GCGTTGTCAT CGGGCGCGAC
TACCCCGCGC CGATTGTCGA TCATGCAGAA CGACGAATGC GCGCACTGGC GCTCTATCGC
GCAGTATCGT CGGTTGCGCT TTAG
 
Protein sequence
MIWIHWFRRD LRLHDNPALH TASIRSDGRV IPLFILDDAI LHAPRTGAAR IAFMIAALRD 
LDANLRARGS RLVIRRGRTL DVIRAMVQET GATGVAWNRD YTPFARRRDA QVEAALRDLN
VETSIAEDAV VFSPDDVRTG DGRPYTVYTP YRRRWRALTE QRRAEVLRAI EPPLLRPAPE
AVADQTVPDH ADLGIVVSQR IPPGGETHGA ARLAAFVDLA AAHSIAGYAE GRDLLAEPAT
SRLSPYLRFG CVAPRQALRA ALRLLDIVGD DHRTVRSIET WIGELAWRDF YYQILWHYPH
VVRRSFKPQY DALAWENDPA LFDAWKEGRT GYPIIDAAMH QLRQEAWMHN RARMIVASFL
TKDLLIDWRW GERYFMQQLV DGDHAANNGG WQWSAGTGTD AQPYFRIFNP VSQGQKFDPK
GLYVRRYLPE LAQVPDRYIH APWTMPRAEQ QRCGVVIGRD YPAPIVDHAE RRMRALALYR
AVSSVAL