Gene RoseRS_2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2687 
Symbol 
ID5209656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3338039 
End bp3340978 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content61% 
IMG OID640596289 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001277011 
Protein GI148656806 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTCT CTTCCGAACG CCGCGTTGTG CAGATCTCGC TCATCCGCTA CGCCGAAGAG 
GCTAGGTGGG AATACCTGTC GCCGGGGGAC GCCCTGAGCA GGCGGGGTGG CCTCAGTCAA
CCATTCTTAC GGGATGTGTT AATGGCGAAG TTGCAGGAAC TCAACCCCGG CGTGATCACC
TCCGCGGAAC AGGCCGACGA CGTCATCGCC CGCCTGGTGC GCCTCCGCCC CGACATTGAA
GGCAATCGCG AAGCCTGGGA ATACCTCAAG GGGCTGAAGA CCGTCTTCGT GCAGGCTGAG
CGCCGCGAGC GCAACCTGCG CCTGCTCGAC CCGCAGCAGG TCGAGGCTAA CACCTTCCAC
GTCACCGATG AATTCCGCTT CCACATCGGC CCGCACAAAA TCCGCGCCGA TGTGGTCTTC
CTGGTCAACG GCATCCCCGT CATCCTAGTT GAGACCAAAG CCGCCACCCG ACTGGAGGGC
ATCGCCGAGG CCCTTGATCA GGTGCGCCGC TATCACCGCG AAGCGCCCGA CCTGCTGGTG
CAAACGCAAC TCTTCGCCCT CACCCAACTG GTGGAATTCT TCTACGGCGC TACCTGGTCC
CTTTCGCGCA AGGCGCTCTT CAACTGGCGG GAAGAAGTCG GGGCTAATTC TAATTCGCCC
CGACCCGACT TTGAAACGCT GGTCAAATCC TTCGTCGCCC GGCGTCGCGT CCTGCGTGTG
CTGACCGATT ATATCCTCTT CGCCCGCAAA GATGGCGAAC TCTCCAAGAT CGTCCTGCGT
CCGCATCAAA TGCGCGCGGT GGAGCGCTGC CTATCGCGTG CCCGCGATCC GCAGAAACGT
CGCGGCCTGA TCTGGCACAC GCAAGGCTCC GGCAAGACCT ACACCATGCT CAACCTGGCG
CGCCTGCTGC TTGAAACGCC TGCATTCCAG AACCCCACCG TGCTGCTCAT CGTGGATCGC
AACGAGCTGC AAAGTCAGCT CTTCCAGAAC CTGGAAGCGA TCGGCTTCGG GAAGGTACAC
CTGGCGCTCT CCAAACGTCA CCTGCGCGAC CTGCTCGAAG CCGACACGCG CGGCGTCATC
GTCTCGATGA TTCACAAATT CGACGACATC CCGGAGAACC TGAACGCCCG CGCCAACATC
TTCGTGCTGG TGGATGAGGC GCATCGCAGC ACGGGCGGGG ATCTCGGCAA CTACCTGATG
GGCGCGCTGC CCAACGCCAC TTTCATCGGC TTCACCGGCA CGCCCATCGA CCGCACCGCG
CATGGCCAGG GCACCTTCAA GACCTTCGGC GCCGACGACC CGCAGGGGTA TCTCGACAAA
TACTCCATCC GTGAATCCAT CGAGGACGGC ACGACTGTTC CGCTGCATTA TCAACTGGCG
CCCAACGACC TGATCGCCAA CCGCGAAGCG ATGGAAAAAG ACTTTTGGGC AGCAGCAGGA
CTGGAAGGTG TGGCCGACGT GGAGGAACTC AACCGCGTGC TGGACCGCGC CGTCACCCTG
ACCAACATGC TCAAAAACCG CGAGCGGGTA GACAAAATCG CCCACTTCGT GGCCGGGCAT
TTCCGGACAT ACGTCCAACC GATGGGCTAC AAAGCCTTCC TCGTGGCAGT TGACCGCGAG
TCCTGCTGCT TCTACAAAGA AGCGCTCGAT CGGTTATACA AAGAAGCGCC CGACCGCTAC
CTGCCACCTG AAGCCAGCGC CGTGGTCATC AGCGCCGGGC ACAACGATCC GCCGCACATT
AAGCGCTATC ATCTGAGCGA GGATGAGGAA AGTCGCATCC GCAAAGCCTT CCGCAAGCCG
GACGAGAACC CGCAGATCCT CATCGTCACC GAAAAACTGC TCACGGGCTA CGATGCGCCC
ATCCTCTATT GCATGTATCT CGACAAACCC ATGCGCGACC ACGTGCTGCT GCAGGCGATT
GCCCGCGTCA ACCGCCCTTA CGAAAGCGAG GATGGGCGGC GCAAGACCAC TGGCCTGATC
TTGGACTTCG TCGGCGTCTT CGAGAACCTG GAGCGGGCGC TAGCCTTTGA CTCAAAGGAT
GTCAGCGGCG TGGTCGAGGG GCTGGAGGTT CTACAGGAGC GTTTTGCCCA ATTGATGGCG
CAAGGGCGAG CCGAGTACCT GCCGCTCACG GCTGGAAAAA CCGAAGACAA GGCTGCCGAG
GCTGCACTGG AACATTTCCG CAACAAGGAG CACCGCGAAA CCTTCTATGC CTTCTTCCGC
GAGGTGCAGG AGATCTACGA AATCCTCTCC CCCGATCCCT TCCTGCGTCC CTTTCTTGAA
GACTATGAGC GGTTGGTGGA GATGTACCGC CTGGTGCGCA GCGCCTACGA GCCGCACGTG
CCGGTGGACA AATCCTTCCT GCGCAAGACG GCGAAGATCG TCCAGCAGCA TACGCGGACC
TCCCAGATTC GTGAGCCCCA GGCTACCTAC GAAATCGGGC CGGCGGCGCT GCTGGCTCTG
CTGCACGAGG ACAAACCCGA TACGGTCAAA GTCTTCAACC TGCTCAAGGA ATTGCATTGC
CTGGTTGCTG ACGAAGCGGC GTGCGCCCCT CATCTCATCC CCATCGGCGA ACGAGCAGAG
GAGATCCGCC GCCACTTCGA GGAGCGCCTG ATCTCGGCTC AGGAGGCGCT ACAGCACCTG
GACAAAGTCG TGCGTCAACT GCAAACCGCC CAGGAGGAAC GCCGCTCCAG CCCGCTCTCG
CCACAAGCCT TTGCCGTCGA GTGGTGGCTG CGCACCCAGG GCACAGAAGC GGAAAAAGCA
GCCCAAACCG CAGCCTCGCT GGAAGAAGCC TTTGCCCGTT TCCCAAACTG GACCATCAGC
GTAGCCGATG AGCGCGAACT GCGCACCCGG CTTTACAAAG CCCTGCTGTC CCTGGGCGTC
AAAGAAATCG TCGCCTGGGC GGATCGCATC CTCGACCTGT TGCGGAGGGC TATCGAATGA
 
Protein sequence
MTLSSERRVV QISLIRYAEE ARWEYLSPGD ALSRRGGLSQ PFLRDVLMAK LQELNPGVIT 
SAEQADDVIA RLVRLRPDIE GNREAWEYLK GLKTVFVQAE RRERNLRLLD PQQVEANTFH
VTDEFRFHIG PHKIRADVVF LVNGIPVILV ETKAATRLEG IAEALDQVRR YHREAPDLLV
QTQLFALTQL VEFFYGATWS LSRKALFNWR EEVGANSNSP RPDFETLVKS FVARRRVLRV
LTDYILFARK DGELSKIVLR PHQMRAVERC LSRARDPQKR RGLIWHTQGS GKTYTMLNLA
RLLLETPAFQ NPTVLLIVDR NELQSQLFQN LEAIGFGKVH LALSKRHLRD LLEADTRGVI
VSMIHKFDDI PENLNARANI FVLVDEAHRS TGGDLGNYLM GALPNATFIG FTGTPIDRTA
HGQGTFKTFG ADDPQGYLDK YSIRESIEDG TTVPLHYQLA PNDLIANREA MEKDFWAAAG
LEGVADVEEL NRVLDRAVTL TNMLKNRERV DKIAHFVAGH FRTYVQPMGY KAFLVAVDRE
SCCFYKEALD RLYKEAPDRY LPPEASAVVI SAGHNDPPHI KRYHLSEDEE SRIRKAFRKP
DENPQILIVT EKLLTGYDAP ILYCMYLDKP MRDHVLLQAI ARVNRPYESE DGRRKTTGLI
LDFVGVFENL ERALAFDSKD VSGVVEGLEV LQERFAQLMA QGRAEYLPLT AGKTEDKAAE
AALEHFRNKE HRETFYAFFR EVQEIYEILS PDPFLRPFLE DYERLVEMYR LVRSAYEPHV
PVDKSFLRKT AKIVQQHTRT SQIREPQATY EIGPAALLAL LHEDKPDTVK VFNLLKELHC
LVADEAACAP HLIPIGERAE EIRRHFEERL ISAQEALQHL DKVVRQLQTA QEERRSSPLS
PQAFAVEWWL RTQGTEAEKA AQTAASLEEA FARFPNWTIS VADERELRTR LYKALLSLGV
KEIVAWADRI LDLLRRAIE