Gene Rcas_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0474 
Symbol 
ID5537937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp613696 
End bp614790 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content61% 
IMG OID640892636 
Productrestriction endonuclease 
Protein accessionYP_001430622 
Protein GI156740493 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.674991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGTC GTCGCTCTCG CCGCTCATCG GATGCCGGAA GCGCTGTGAG TATTCTGCTC 
GTCGCCCCGG TATGCGCCGG GTTTTTCTGG CAATCATGGA CGCAACTCGC ACTCGCCTGG
CAGGTCGCTG CCGTCGTGCT GGCGTGTTCA ATCCTGTTTC TGCTGTTTCT GTTCTTCATT
GATCTGTTCC GCGCTCTTCG TCAGCGCGCT CTGCTGCAGA AAGCATTGTT GGCGCTGACG
CCTTCAGAAT TTGAGGAACG GGTGCTGCTG CTCCTAAAGG ATCTCGGATG GACCAATCTT
CGACTGCGTG GCGGCAGCGG CGACCGAGGG GTCGATCTCG AAGGCGAGTT CCAGGGACAA
CGCTATATTG TCCAGTGCAA ACGACACACC AAAGCAGTGC CGCCTTCGAT GGTGCGCGAT
CTGGCGGGCG CTTTGCATAT TCAGCGCGCT GATCGTGCGT TGCTGGTGAC CACAAGTTCT
TTTACACGGC AAGGGTACGA AGAGGCTTGT AATCAGCCGA TTGAATTATG GGACGGCGAC
ATCCTGGCGC GCAAGATCAA AGAAGCCGAT GCGTTACGCG CGAACCCGGC GCACCGGCGC
AATGCCTGGC GAGGTCGGGT TGCGGTGTTG GCGACATTCG CTGTGGCGAA CGCATCGTGT
GTGCTGTTCG CATTCGTCAG CGCCGGCGCG CCGGCGCTGA CGGCGCCCGC CGCGCGAACA
AGCGGCGCTC CATCGCACAC AAACGTCACC AATCCGACGG CGATCCCCGT ACAAACGGAT
ATTCCCCCCG CCTCTCCGAC GGCGATCCCC GTACAAACGG ATATTCCCCC CGCCTCTCCA
GCGCCGACGC CAACCGAGCG CCCGGTCCTG ACAACAACCG TCTTCAACGG CGGGAATGTG
CGCGCAGCGC CCAACCTTCG AGGGGCGGTG CTCGATCAGG TCCATGCGGG AGAAATCGTC
GAACTGCTTG GTCGCTCCCC CGACGGAAAC TGGTTCTACA TTCGCAACCC GCGCAATCAG
GTCGGATGGA CGCATCGCAC GTTGCTGAAC CTCGACGCAG GCGTGGATGA TCGCCTGGAT
GTGCTGCGAC CTTGA
 
Protein sequence
MSRRRSRRSS DAGSAVSILL VAPVCAGFFW QSWTQLALAW QVAAVVLACS ILFLLFLFFI 
DLFRALRQRA LLQKALLALT PSEFEERVLL LLKDLGWTNL RLRGGSGDRG VDLEGEFQGQ
RYIVQCKRHT KAVPPSMVRD LAGALHIQRA DRALLVTTSS FTRQGYEEAC NQPIELWDGD
ILARKIKEAD ALRANPAHRR NAWRGRVAVL ATFAVANASC VLFAFVSAGA PALTAPAART
SGAPSHTNVT NPTAIPVQTD IPPASPTAIP VQTDIPPASP APTPTERPVL TTTVFNGGNV
RAAPNLRGAV LDQVHAGEIV ELLGRSPDGN WFYIRNPRNQ VGWTHRTLLN LDAGVDDRLD
VLRP