Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0281 |
Symbol | |
ID | 5207216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 356471 |
End bp | 359557 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640593907 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001274663 |
Protein GI | 148654458 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000241298 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGATGAGTG AGCGTCACTT CTGTGAATCC ACCATCGAAT CCGCTGCCCT CGCCCGGCTA GAAGCTTCTG GCTGGCAGGT TGCGCACGGT CCCGACCTCG CCCCGGGCAC GCGGACAGCT GAGCGGCGCG ACTATGGCGA AGTGGTGCTG ACCGGTCGGC TCCGCGACGC GCTTGCCCGC CTCAACCCCG ACCTGCCTGT CGAGGCGCTG GACGACGCCT TCCGCCAGCT CACGCAGCCC CAGGGCGCCG ATCTGATCCA GCGCAACCGC GCGCAGCACC GCCTGCTGGT GGACGGGGTG ACGGTCGAGC ACCGCAATGC CGGGGGCGCG ATCTGCGGGG CGCAAGCCAG GGTGATTGAC TTCGACGACC CGGCGAACAA CGACTGGCTC GCAGTCAATC AGTTCACGGT GGTGGAAAAC AAGCACACGC GCCGTCCAGA CATAGTGCTG TTCGTCAACG GCCTGCCGCT GGCAGTGCTG GAGCTGAAGA ACGCCGCCGA TGAAAAGGCG ACGATCTGGA CGGCCTGGCA ACAACTCCAG ACCTATCAGG CTCAGATCCC CTCGTTGCTG GCTACCAACG CCGCACTCGT CATCTCCGAC GGCCTGACAG CGCGCGTCGG CGCGCTCGGT GCGGGGCGCG AGTGGTTCAA GCCGTGGCGC ACCATTCACG GCGAGGCGCT GGCGGATCCC CATATGCCCG AATTGCAGGT GGTGATCGAG GGACTCTTCG CGCCGCGCCG TTTCCTCGAC TTCGTCCGTG ACTTCATCGT CTTCGAGGAC GATGGCAGCG GACGGCTGAT CAAGAAGATG GCCGGGTACC ACCAGTTCCA TGCGGTGCAG GTGGCAGTCG CCGAAACGCT GCGCGCCGCC GAACTGGCGC GCACTGATCA GGTGGCGGAG GAGCGTGGGC GCTATGAGGC GGGACGCAGG CCCGGGGGCA AGCCTGGAGA TCGGCGTATC GGCGTGGTGT GGCACACGCA GGGTTCGGGG AAGAGCCTGA CCATGGTGTT CTATGCTGGC CGTATCGTCC GCGAGCCGGC AATGGAGAAC CCGACGATTG TCGTGCTCAC CGACCGCAAC GACCTCGACG ACCAGCTCTT CGCCACCTTC GCGCGCTGCC GCGACCTGCT GCGCCAGCCG CCGGTGCAGG CCGAGAGTCG CGTCCACCTG CGCCAGATTC TGTCTGTAGC AGCGGGCGGC GTGGTGTTCA CCACGATTCA TAAGTTCTTC CCGGAAGAGA AAGGCGACCG CCACCCGACG CTCTCCGAGC GGCGTAATAT CGTGGTGATC GCCGATGAGG CGCACCGCAG CCAGTACGAC TTCATTGACG GCTTCGCGCG CCACATGCGC GACGCCCTGC CGAACGCTTC GTTCATCGGC TTTACCGGCA CGCCGATTGA AAAGACCGAC GCCAACACCC GCGCGGTCTT CGGCGATTAC ATCAGCATTT ACGACATCCA GCGCGCCGTC GAGGATGGCG TGACCGTGCC GATCTACTAC GAGAGTCGCC TGGCGAAGCT TGCCCTCGAC GAGACGGAGC GACCGAAGAT CGACCCGGAC TTCGAGGAGG TCACCGAAGG CGAGGAGGTC GAGCGCAAGG AGAGGCTCAA GAGTAAGTGG GCGCAGTTCG AGGCCATCGT CGGCGCGGAG AAACGGCTCA GGCTGGTGGC GCGCGACATC GTCGAGCATT TCGAGAAGCG GCTGGAGGCA CTGGACGGAA AGGCGATGAT CGTCTGTATG AGCCGACGTA TCTGCGTCGA GCTGTATCGG GAGATTGCCC AATTGCGGCC GCAGTGGGCA GCCGATGACG ACGAGCGGGG CGCGATGAAG GTGGTGATGA CCGGCTCGGC CTCCGACCCG CTCGACTGGC AACCTCACAT CCGCAACAAA CCGCGACGCG AGGCGCTTGC CAACCGTTTT CGCAATCCAG GCGATCCGTT CAGGCTCGTG ATTGTGCGCG ACATGTGGCT GACCGGCTTT GACGCGCCCA GCCTGCACAC GATGTACATC GACAAGCCGA TGCGCGGCCA CGGTCTGATG CAGGCCATTG CGCGCGTCAA CCGCGTCTTC CATGACAAGC CCGGCGGCCT GGTGGTGGAC TACCTGGGGC TGGCGCACGA ACTCAAGGCT GCGCTCGCCA CCTATACCGA GAGCGGCGGT ACCGGCCAGA CGACCGTCGA TCAACAAGAG GCCATAGCAC TGATGCAGGA AAAGTACGAG ATCTGCTGTG GCATCCTGCA CGGCTTCGAC TGGTCGGATT GGATAAGCGG CGACCCGCAA GCGCGTATCG GTCTGTTGCC CGCAGCGCAG GAGCAGGTGC TGTCGCGCGT GAACGGCAAG GAGCGGTTCA TCCAGGCGGT ACGCGACCTG ACCAGGGCGT TCGCGCTCGC GACGCTGCAC GAGGAGGCAA TCAAGATCCG CGACGATGTC GCCTTTTTCC AGGCGGTGCA GGCAGCGCTG ACCAAACGCG CACCCGGCGA GATGCGTCCG GAGGATGAGC TGGATCACGC CGTCCGCCAG ATCATCGCTC GCGCAGTCGC TCCTGCAGGC GTTATAGATA TCTTCACGGC AGCCGGGCTT CAGAAGCCCG ACATCTCGAT TCTCTCCAAC GAATTCCTGG CCGAGGTGCG CGGCATGCCA CAGCGAAACC TGGCGGTAGA AACGCTACAG AAACTGCTCA AGGGGGAGAT CAGTACGCGC CAGCGAAAAA ATGTCGTCCA GGCGCGCTCC TTCGCCGAGA TGCTGGAGCA AACCATCCGC CGCTACCAGA ACCGCGCCAT CGAGGCGGCG CAGGTGATCG AGGAGCTTAT CTCCCTGGCC AGGGACATGC GCGAGGCCGA TGCGCGTGGC AAGAAGCTGG GACTCTCCGA GGAGGAACTG GCTTTCTACG ACGCTTTGGA GACCAACGAT AGCGCGGTGA AGGTGCTGGG AGACGAGACA TTGAGGACGA TCGCGCGCGA ATTGGTCAAG ACCGTGCGGA GCAACGTCAC TATCGACTGG ACAATCCGTG AGAACGTCCG GGCGCAACTG CGCGTGTTGG TGAAGCGCAT TCTGCGCAAG TACGGCTACC CCCCCGACAG GCAGTAG
|
Protein sequence | MMSERHFCES TIESAALARL EASGWQVAHG PDLAPGTRTA ERRDYGEVVL TGRLRDALAR LNPDLPVEAL DDAFRQLTQP QGADLIQRNR AQHRLLVDGV TVEHRNAGGA ICGAQARVID FDDPANNDWL AVNQFTVVEN KHTRRPDIVL FVNGLPLAVL ELKNAADEKA TIWTAWQQLQ TYQAQIPSLL ATNAALVISD GLTARVGALG AGREWFKPWR TIHGEALADP HMPELQVVIE GLFAPRRFLD FVRDFIVFED DGSGRLIKKM AGYHQFHAVQ VAVAETLRAA ELARTDQVAE ERGRYEAGRR PGGKPGDRRI GVVWHTQGSG KSLTMVFYAG RIVREPAMEN PTIVVLTDRN DLDDQLFATF ARCRDLLRQP PVQAESRVHL RQILSVAAGG VVFTTIHKFF PEEKGDRHPT LSERRNIVVI ADEAHRSQYD FIDGFARHMR DALPNASFIG FTGTPIEKTD ANTRAVFGDY ISIYDIQRAV EDGVTVPIYY ESRLAKLALD ETERPKIDPD FEEVTEGEEV ERKERLKSKW AQFEAIVGAE KRLRLVARDI VEHFEKRLEA LDGKAMIVCM SRRICVELYR EIAQLRPQWA ADDDERGAMK VVMTGSASDP LDWQPHIRNK PRREALANRF RNPGDPFRLV IVRDMWLTGF DAPSLHTMYI DKPMRGHGLM QAIARVNRVF HDKPGGLVVD YLGLAHELKA ALATYTESGG TGQTTVDQQE AIALMQEKYE ICCGILHGFD WSDWISGDPQ ARIGLLPAAQ EQVLSRVNGK ERFIQAVRDL TRAFALATLH EEAIKIRDDV AFFQAVQAAL TKRAPGEMRP EDELDHAVRQ IIARAVAPAG VIDIFTAAGL QKPDISILSN EFLAEVRGMP QRNLAVETLQ KLLKGEISTR QRKNVVQARS FAEMLEQTIR RYQNRAIEAA QVIEELISLA RDMREADARG KKLGLSEEEL AFYDALETND SAVKVLGDET LRTIARELVK TVRSNVTIDW TIRENVRAQL RVLVKRILRK YGYPPDRQ
|
| |