Gene RoseRS_0281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0281 
Symbol 
ID5207216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp356471 
End bp359557 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content64% 
IMG OID640593907 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001274663 
Protein GI148654458 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000241298 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGATGAGTG AGCGTCACTT CTGTGAATCC ACCATCGAAT CCGCTGCCCT CGCCCGGCTA 
GAAGCTTCTG GCTGGCAGGT TGCGCACGGT CCCGACCTCG CCCCGGGCAC GCGGACAGCT
GAGCGGCGCG ACTATGGCGA AGTGGTGCTG ACCGGTCGGC TCCGCGACGC GCTTGCCCGC
CTCAACCCCG ACCTGCCTGT CGAGGCGCTG GACGACGCCT TCCGCCAGCT CACGCAGCCC
CAGGGCGCCG ATCTGATCCA GCGCAACCGC GCGCAGCACC GCCTGCTGGT GGACGGGGTG
ACGGTCGAGC ACCGCAATGC CGGGGGCGCG ATCTGCGGGG CGCAAGCCAG GGTGATTGAC
TTCGACGACC CGGCGAACAA CGACTGGCTC GCAGTCAATC AGTTCACGGT GGTGGAAAAC
AAGCACACGC GCCGTCCAGA CATAGTGCTG TTCGTCAACG GCCTGCCGCT GGCAGTGCTG
GAGCTGAAGA ACGCCGCCGA TGAAAAGGCG ACGATCTGGA CGGCCTGGCA ACAACTCCAG
ACCTATCAGG CTCAGATCCC CTCGTTGCTG GCTACCAACG CCGCACTCGT CATCTCCGAC
GGCCTGACAG CGCGCGTCGG CGCGCTCGGT GCGGGGCGCG AGTGGTTCAA GCCGTGGCGC
ACCATTCACG GCGAGGCGCT GGCGGATCCC CATATGCCCG AATTGCAGGT GGTGATCGAG
GGACTCTTCG CGCCGCGCCG TTTCCTCGAC TTCGTCCGTG ACTTCATCGT CTTCGAGGAC
GATGGCAGCG GACGGCTGAT CAAGAAGATG GCCGGGTACC ACCAGTTCCA TGCGGTGCAG
GTGGCAGTCG CCGAAACGCT GCGCGCCGCC GAACTGGCGC GCACTGATCA GGTGGCGGAG
GAGCGTGGGC GCTATGAGGC GGGACGCAGG CCCGGGGGCA AGCCTGGAGA TCGGCGTATC
GGCGTGGTGT GGCACACGCA GGGTTCGGGG AAGAGCCTGA CCATGGTGTT CTATGCTGGC
CGTATCGTCC GCGAGCCGGC AATGGAGAAC CCGACGATTG TCGTGCTCAC CGACCGCAAC
GACCTCGACG ACCAGCTCTT CGCCACCTTC GCGCGCTGCC GCGACCTGCT GCGCCAGCCG
CCGGTGCAGG CCGAGAGTCG CGTCCACCTG CGCCAGATTC TGTCTGTAGC AGCGGGCGGC
GTGGTGTTCA CCACGATTCA TAAGTTCTTC CCGGAAGAGA AAGGCGACCG CCACCCGACG
CTCTCCGAGC GGCGTAATAT CGTGGTGATC GCCGATGAGG CGCACCGCAG CCAGTACGAC
TTCATTGACG GCTTCGCGCG CCACATGCGC GACGCCCTGC CGAACGCTTC GTTCATCGGC
TTTACCGGCA CGCCGATTGA AAAGACCGAC GCCAACACCC GCGCGGTCTT CGGCGATTAC
ATCAGCATTT ACGACATCCA GCGCGCCGTC GAGGATGGCG TGACCGTGCC GATCTACTAC
GAGAGTCGCC TGGCGAAGCT TGCCCTCGAC GAGACGGAGC GACCGAAGAT CGACCCGGAC
TTCGAGGAGG TCACCGAAGG CGAGGAGGTC GAGCGCAAGG AGAGGCTCAA GAGTAAGTGG
GCGCAGTTCG AGGCCATCGT CGGCGCGGAG AAACGGCTCA GGCTGGTGGC GCGCGACATC
GTCGAGCATT TCGAGAAGCG GCTGGAGGCA CTGGACGGAA AGGCGATGAT CGTCTGTATG
AGCCGACGTA TCTGCGTCGA GCTGTATCGG GAGATTGCCC AATTGCGGCC GCAGTGGGCA
GCCGATGACG ACGAGCGGGG CGCGATGAAG GTGGTGATGA CCGGCTCGGC CTCCGACCCG
CTCGACTGGC AACCTCACAT CCGCAACAAA CCGCGACGCG AGGCGCTTGC CAACCGTTTT
CGCAATCCAG GCGATCCGTT CAGGCTCGTG ATTGTGCGCG ACATGTGGCT GACCGGCTTT
GACGCGCCCA GCCTGCACAC GATGTACATC GACAAGCCGA TGCGCGGCCA CGGTCTGATG
CAGGCCATTG CGCGCGTCAA CCGCGTCTTC CATGACAAGC CCGGCGGCCT GGTGGTGGAC
TACCTGGGGC TGGCGCACGA ACTCAAGGCT GCGCTCGCCA CCTATACCGA GAGCGGCGGT
ACCGGCCAGA CGACCGTCGA TCAACAAGAG GCCATAGCAC TGATGCAGGA AAAGTACGAG
ATCTGCTGTG GCATCCTGCA CGGCTTCGAC TGGTCGGATT GGATAAGCGG CGACCCGCAA
GCGCGTATCG GTCTGTTGCC CGCAGCGCAG GAGCAGGTGC TGTCGCGCGT GAACGGCAAG
GAGCGGTTCA TCCAGGCGGT ACGCGACCTG ACCAGGGCGT TCGCGCTCGC GACGCTGCAC
GAGGAGGCAA TCAAGATCCG CGACGATGTC GCCTTTTTCC AGGCGGTGCA GGCAGCGCTG
ACCAAACGCG CACCCGGCGA GATGCGTCCG GAGGATGAGC TGGATCACGC CGTCCGCCAG
ATCATCGCTC GCGCAGTCGC TCCTGCAGGC GTTATAGATA TCTTCACGGC AGCCGGGCTT
CAGAAGCCCG ACATCTCGAT TCTCTCCAAC GAATTCCTGG CCGAGGTGCG CGGCATGCCA
CAGCGAAACC TGGCGGTAGA AACGCTACAG AAACTGCTCA AGGGGGAGAT CAGTACGCGC
CAGCGAAAAA ATGTCGTCCA GGCGCGCTCC TTCGCCGAGA TGCTGGAGCA AACCATCCGC
CGCTACCAGA ACCGCGCCAT CGAGGCGGCG CAGGTGATCG AGGAGCTTAT CTCCCTGGCC
AGGGACATGC GCGAGGCCGA TGCGCGTGGC AAGAAGCTGG GACTCTCCGA GGAGGAACTG
GCTTTCTACG ACGCTTTGGA GACCAACGAT AGCGCGGTGA AGGTGCTGGG AGACGAGACA
TTGAGGACGA TCGCGCGCGA ATTGGTCAAG ACCGTGCGGA GCAACGTCAC TATCGACTGG
ACAATCCGTG AGAACGTCCG GGCGCAACTG CGCGTGTTGG TGAAGCGCAT TCTGCGCAAG
TACGGCTACC CCCCCGACAG GCAGTAG
 
Protein sequence
MMSERHFCES TIESAALARL EASGWQVAHG PDLAPGTRTA ERRDYGEVVL TGRLRDALAR 
LNPDLPVEAL DDAFRQLTQP QGADLIQRNR AQHRLLVDGV TVEHRNAGGA ICGAQARVID
FDDPANNDWL AVNQFTVVEN KHTRRPDIVL FVNGLPLAVL ELKNAADEKA TIWTAWQQLQ
TYQAQIPSLL ATNAALVISD GLTARVGALG AGREWFKPWR TIHGEALADP HMPELQVVIE
GLFAPRRFLD FVRDFIVFED DGSGRLIKKM AGYHQFHAVQ VAVAETLRAA ELARTDQVAE
ERGRYEAGRR PGGKPGDRRI GVVWHTQGSG KSLTMVFYAG RIVREPAMEN PTIVVLTDRN
DLDDQLFATF ARCRDLLRQP PVQAESRVHL RQILSVAAGG VVFTTIHKFF PEEKGDRHPT
LSERRNIVVI ADEAHRSQYD FIDGFARHMR DALPNASFIG FTGTPIEKTD ANTRAVFGDY
ISIYDIQRAV EDGVTVPIYY ESRLAKLALD ETERPKIDPD FEEVTEGEEV ERKERLKSKW
AQFEAIVGAE KRLRLVARDI VEHFEKRLEA LDGKAMIVCM SRRICVELYR EIAQLRPQWA
ADDDERGAMK VVMTGSASDP LDWQPHIRNK PRREALANRF RNPGDPFRLV IVRDMWLTGF
DAPSLHTMYI DKPMRGHGLM QAIARVNRVF HDKPGGLVVD YLGLAHELKA ALATYTESGG
TGQTTVDQQE AIALMQEKYE ICCGILHGFD WSDWISGDPQ ARIGLLPAAQ EQVLSRVNGK
ERFIQAVRDL TRAFALATLH EEAIKIRDDV AFFQAVQAAL TKRAPGEMRP EDELDHAVRQ
IIARAVAPAG VIDIFTAAGL QKPDISILSN EFLAEVRGMP QRNLAVETLQ KLLKGEISTR
QRKNVVQARS FAEMLEQTIR RYQNRAIEAA QVIEELISLA RDMREADARG KKLGLSEEEL
AFYDALETND SAVKVLGDET LRTIARELVK TVRSNVTIDW TIRENVRAQL RVLVKRILRK
YGYPPDRQ