Gene Rcas_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4251 
Symbol 
ID5541762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5492720 
End bp5494246 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content62% 
IMG OID640896358 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_001434296 
Protein GI156744167 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATA CGCCCGTTAC CGTCTGGATT CTCGGCGACC AGTTGCTTCG CGCGCATCCG 
GCCATCGAGG CGGTCGAGGC GCAGGCAGGG CGACGGAATA TGCGTATTGT GCTGGTCGAG
AGTGATGCGC GCATCGCGCA GATGCCGTAC CAGCGCAAGA AGATTGTTCT GATCCTCAGC
GCCATGCGCC ACTATGCGGC GTCGCTGCGC GAGCGCGGAT ACCTGGTCGA TGAGGTGCGC
GCGCCGTCGT TTGTCGCAGG GTTGCAGCGC CACATTGCGC AGCATCGGTC GCAACGCCTG
GTGACGATGG CAGCCGCCGA GTATGAGACC CGTCGCTTTC AACAGGATTA TCTGAGCCAG
GCGCTTGGCA TTTCGGTCGA GGTGTTGCCC AACACCCAGT TTCTCACCGG TCGGCGCAAT
CCGTTCCCCG ATGTCGAACC TTCGCAGCGC GTGATCATGG AGCGCTTCTA CCGTGCGATG
CGCCGCCATT TTGACGTGCT GCTCGAACCA GATGGATCAC CGACCGGCGG TGCGTGGAAC
TTCGACCGGT TGAACCGTCG TCCGTTGCCG CGTTCGGTAG CGCCGCCGCC GCCGTTGACC
TTCGAGCCGG ATGCGATCAC GCGCCGGGTC ATCGCCGAAG TCGAGGCGCG CAGCGGCGGG
GTCGGCACAG CAACCGGATT TGCGCTCGCG GTTACCCATG CGCAGGCGGA AGCGGCGCTT
GCCGATTTCG TTGCGCATCG CCTGGCAGCG TTCGGACCCT ACGAAGATGC GATGAGCGCC
GCGCATGATG TGCTGTTCCA CTCGATGCTC TCCCCCTACC TCAACATTGG GCTGCTCGAA
CCGTTGCAGA TAGTGCGCAC CGTTGAGGAA GCGTATCGCG CCGGCGCCGC TCCGATCCAG
TCGGTCGAAG GGTTCGCGCG CCAGATCATC GGCTGGCGTG AGTACATCTA CTGGCAGTAC
TGGCGATTGA TGCCCGGTTT GCGCGACGCG AACGCCTGGA ATGCAACGCG CCCGCTACCG
CGCTTTTTCT GGAGTGGCGA CACCGATATG CGTTGCCTGC GTCACGTCAT TGAACGCGCG
ATTGCGACCG GCTACACCCA CCACATCGAA CGTTTGATGA TCGTCTGCAA CTTTTGCCTG
CTGGCAGGAA TCCGTCCCTC AGAGGTCAAC GACTGGTTCC TTGCGCACTA CGTCGATGCC
TATGATTGGG TGATGCAGCC GAATGTGATC GGTATGGGGC TGAACGCCGA CGGTGGGTTG
ACGGCGACCA AACCGTACAT CGCTTCGGCT GCGTATATCA GAAAGATGAG CGATTACTGC
GACGGATGTC GCTATGACCC AAAGCGGCGC ATTGGACCGG ATGCCTGTCC GTTCAACACA
CTCTACTGGA ACTTTCTGAT CCGGCACGAA GCGACACTCC GCGCCAATCC GCGCATGGGT
CCGGCGGTGC TGGGACTGGC GCGGTTCGAC GCCTCCGAGC GTGAGGCGGT GACGCGGCAG
GCGGAGATGG TGTTGGGGGA GAACTGA
 
Protein sequence
MNDTPVTVWI LGDQLLRAHP AIEAVEAQAG RRNMRIVLVE SDARIAQMPY QRKKIVLILS 
AMRHYAASLR ERGYLVDEVR APSFVAGLQR HIAQHRSQRL VTMAAAEYET RRFQQDYLSQ
ALGISVEVLP NTQFLTGRRN PFPDVEPSQR VIMERFYRAM RRHFDVLLEP DGSPTGGAWN
FDRLNRRPLP RSVAPPPPLT FEPDAITRRV IAEVEARSGG VGTATGFALA VTHAQAEAAL
ADFVAHRLAA FGPYEDAMSA AHDVLFHSML SPYLNIGLLE PLQIVRTVEE AYRAGAAPIQ
SVEGFARQII GWREYIYWQY WRLMPGLRDA NAWNATRPLP RFFWSGDTDM RCLRHVIERA
IATGYTHHIE RLMIVCNFCL LAGIRPSEVN DWFLAHYVDA YDWVMQPNVI GMGLNADGGL
TATKPYIASA AYIRKMSDYC DGCRYDPKRR IGPDACPFNT LYWNFLIRHE ATLRANPRMG
PAVLGLARFD ASEREAVTRQ AEMVLGEN