Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2687 |
Symbol | |
ID | 5209656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3338039 |
End bp | 3340978 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640596289 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001277011 |
Protein GI | 148656806 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCTCT CTTCCGAACG CCGCGTTGTG CAGATCTCGC TCATCCGCTA CGCCGAAGAG GCTAGGTGGG AATACCTGTC GCCGGGGGAC GCCCTGAGCA GGCGGGGTGG CCTCAGTCAA CCATTCTTAC GGGATGTGTT AATGGCGAAG TTGCAGGAAC TCAACCCCGG CGTGATCACC TCCGCGGAAC AGGCCGACGA CGTCATCGCC CGCCTGGTGC GCCTCCGCCC CGACATTGAA GGCAATCGCG AAGCCTGGGA ATACCTCAAG GGGCTGAAGA CCGTCTTCGT GCAGGCTGAG CGCCGCGAGC GCAACCTGCG CCTGCTCGAC CCGCAGCAGG TCGAGGCTAA CACCTTCCAC GTCACCGATG AATTCCGCTT CCACATCGGC CCGCACAAAA TCCGCGCCGA TGTGGTCTTC CTGGTCAACG GCATCCCCGT CATCCTAGTT GAGACCAAAG CCGCCACCCG ACTGGAGGGC ATCGCCGAGG CCCTTGATCA GGTGCGCCGC TATCACCGCG AAGCGCCCGA CCTGCTGGTG CAAACGCAAC TCTTCGCCCT CACCCAACTG GTGGAATTCT TCTACGGCGC TACCTGGTCC CTTTCGCGCA AGGCGCTCTT CAACTGGCGG GAAGAAGTCG GGGCTAATTC TAATTCGCCC CGACCCGACT TTGAAACGCT GGTCAAATCC TTCGTCGCCC GGCGTCGCGT CCTGCGTGTG CTGACCGATT ATATCCTCTT CGCCCGCAAA GATGGCGAAC TCTCCAAGAT CGTCCTGCGT CCGCATCAAA TGCGCGCGGT GGAGCGCTGC CTATCGCGTG CCCGCGATCC GCAGAAACGT CGCGGCCTGA TCTGGCACAC GCAAGGCTCC GGCAAGACCT ACACCATGCT CAACCTGGCG CGCCTGCTGC TTGAAACGCC TGCATTCCAG AACCCCACCG TGCTGCTCAT CGTGGATCGC AACGAGCTGC AAAGTCAGCT CTTCCAGAAC CTGGAAGCGA TCGGCTTCGG GAAGGTACAC CTGGCGCTCT CCAAACGTCA CCTGCGCGAC CTGCTCGAAG CCGACACGCG CGGCGTCATC GTCTCGATGA TTCACAAATT CGACGACATC CCGGAGAACC TGAACGCCCG CGCCAACATC TTCGTGCTGG TGGATGAGGC GCATCGCAGC ACGGGCGGGG ATCTCGGCAA CTACCTGATG GGCGCGCTGC CCAACGCCAC TTTCATCGGC TTCACCGGCA CGCCCATCGA CCGCACCGCG CATGGCCAGG GCACCTTCAA GACCTTCGGC GCCGACGACC CGCAGGGGTA TCTCGACAAA TACTCCATCC GTGAATCCAT CGAGGACGGC ACGACTGTTC CGCTGCATTA TCAACTGGCG CCCAACGACC TGATCGCCAA CCGCGAAGCG ATGGAAAAAG ACTTTTGGGC AGCAGCAGGA CTGGAAGGTG TGGCCGACGT GGAGGAACTC AACCGCGTGC TGGACCGCGC CGTCACCCTG ACCAACATGC TCAAAAACCG CGAGCGGGTA GACAAAATCG CCCACTTCGT GGCCGGGCAT TTCCGGACAT ACGTCCAACC GATGGGCTAC AAAGCCTTCC TCGTGGCAGT TGACCGCGAG TCCTGCTGCT TCTACAAAGA AGCGCTCGAT CGGTTATACA AAGAAGCGCC CGACCGCTAC CTGCCACCTG AAGCCAGCGC CGTGGTCATC AGCGCCGGGC ACAACGATCC GCCGCACATT AAGCGCTATC ATCTGAGCGA GGATGAGGAA AGTCGCATCC GCAAAGCCTT CCGCAAGCCG GACGAGAACC CGCAGATCCT CATCGTCACC GAAAAACTGC TCACGGGCTA CGATGCGCCC ATCCTCTATT GCATGTATCT CGACAAACCC ATGCGCGACC ACGTGCTGCT GCAGGCGATT GCCCGCGTCA ACCGCCCTTA CGAAAGCGAG GATGGGCGGC GCAAGACCAC TGGCCTGATC TTGGACTTCG TCGGCGTCTT CGAGAACCTG GAGCGGGCGC TAGCCTTTGA CTCAAAGGAT GTCAGCGGCG TGGTCGAGGG GCTGGAGGTT CTACAGGAGC GTTTTGCCCA ATTGATGGCG CAAGGGCGAG CCGAGTACCT GCCGCTCACG GCTGGAAAAA CCGAAGACAA GGCTGCCGAG GCTGCACTGG AACATTTCCG CAACAAGGAG CACCGCGAAA CCTTCTATGC CTTCTTCCGC GAGGTGCAGG AGATCTACGA AATCCTCTCC CCCGATCCCT TCCTGCGTCC CTTTCTTGAA GACTATGAGC GGTTGGTGGA GATGTACCGC CTGGTGCGCA GCGCCTACGA GCCGCACGTG CCGGTGGACA AATCCTTCCT GCGCAAGACG GCGAAGATCG TCCAGCAGCA TACGCGGACC TCCCAGATTC GTGAGCCCCA GGCTACCTAC GAAATCGGGC CGGCGGCGCT GCTGGCTCTG CTGCACGAGG ACAAACCCGA TACGGTCAAA GTCTTCAACC TGCTCAAGGA ATTGCATTGC CTGGTTGCTG ACGAAGCGGC GTGCGCCCCT CATCTCATCC CCATCGGCGA ACGAGCAGAG GAGATCCGCC GCCACTTCGA GGAGCGCCTG ATCTCGGCTC AGGAGGCGCT ACAGCACCTG GACAAAGTCG TGCGTCAACT GCAAACCGCC CAGGAGGAAC GCCGCTCCAG CCCGCTCTCG CCACAAGCCT TTGCCGTCGA GTGGTGGCTG CGCACCCAGG GCACAGAAGC GGAAAAAGCA GCCCAAACCG CAGCCTCGCT GGAAGAAGCC TTTGCCCGTT TCCCAAACTG GACCATCAGC GTAGCCGATG AGCGCGAACT GCGCACCCGG CTTTACAAAG CCCTGCTGTC CCTGGGCGTC AAAGAAATCG TCGCCTGGGC GGATCGCATC CTCGACCTGT TGCGGAGGGC TATCGAATGA
|
Protein sequence | MTLSSERRVV QISLIRYAEE ARWEYLSPGD ALSRRGGLSQ PFLRDVLMAK LQELNPGVIT SAEQADDVIA RLVRLRPDIE GNREAWEYLK GLKTVFVQAE RRERNLRLLD PQQVEANTFH VTDEFRFHIG PHKIRADVVF LVNGIPVILV ETKAATRLEG IAEALDQVRR YHREAPDLLV QTQLFALTQL VEFFYGATWS LSRKALFNWR EEVGANSNSP RPDFETLVKS FVARRRVLRV LTDYILFARK DGELSKIVLR PHQMRAVERC LSRARDPQKR RGLIWHTQGS GKTYTMLNLA RLLLETPAFQ NPTVLLIVDR NELQSQLFQN LEAIGFGKVH LALSKRHLRD LLEADTRGVI VSMIHKFDDI PENLNARANI FVLVDEAHRS TGGDLGNYLM GALPNATFIG FTGTPIDRTA HGQGTFKTFG ADDPQGYLDK YSIRESIEDG TTVPLHYQLA PNDLIANREA MEKDFWAAAG LEGVADVEEL NRVLDRAVTL TNMLKNRERV DKIAHFVAGH FRTYVQPMGY KAFLVAVDRE SCCFYKEALD RLYKEAPDRY LPPEASAVVI SAGHNDPPHI KRYHLSEDEE SRIRKAFRKP DENPQILIVT EKLLTGYDAP ILYCMYLDKP MRDHVLLQAI ARVNRPYESE DGRRKTTGLI LDFVGVFENL ERALAFDSKD VSGVVEGLEV LQERFAQLMA QGRAEYLPLT AGKTEDKAAE AALEHFRNKE HRETFYAFFR EVQEIYEILS PDPFLRPFLE DYERLVEMYR LVRSAYEPHV PVDKSFLRKT AKIVQQHTRT SQIREPQATY EIGPAALLAL LHEDKPDTVK VFNLLKELHC LVADEAACAP HLIPIGERAE EIRRHFEERL ISAQEALQHL DKVVRQLQTA QEERRSSPLS PQAFAVEWWL RTQGTEAEKA AQTAASLEEA FARFPNWTIS VADERELRTR LYKALLSLGV KEIVAWADRI LDLLRRAIE
|
| |