Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3651 |
Symbol | hsdR |
ID | 3972023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4061871 |
End bp | 4065251 |
Gene Length | 3381 bp |
Protein Length | 1126 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637926761 |
Product | type I restriction enzyme EcoKI subunit R |
Protein accession | YP_533505 |
Protein GI | 90425135 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.408969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTCGT CGGCGAACTT CGGCTTTTTG GGCGATCATG ATGTCAGGCT GGCGCAGCTT GGGGCGCTAG CCGAGAGCTA TTTCCGTGAT GATCCCACCA CGTCGATCTT CAAGCTCCGG CAATTCGCGG AGCTGATGTC CAAGCTCATC GCGGCCCGTC ATGCGGCCTA TCGTGATGAG CGTGAGAGTT TCGAGGAGAC GCTCCGTCGC CTGTCTTTCG AACGGATCAT CCCCAAAGAG GCGGCCGACG TATTCCACGC CTTGCGAAAA AGCGGAAACC GGGCGGCCCA CGACATTGCC GGAACCCAAT CCGATGCCCT GACGGCACTC AAGCTCGCCC GCCAACTCGG AATCTGGTTT CACCGGACCT ACGGCAAGCA GCCCGACATT GCGTTTGGTG CGTTCGTCCC GCCCCCGGAG CCGATCGATG CCACGGTGGC GCTCAGAGAA GAGATTGCGG CCTTGGGTCG CCGGCTTACG GAAAGCGAAG ACGCCGCCGC CGGAGCCCAA CGCGAAGCCG AGGAGCATGC TCGCGCACGA GAAACCCTTG AGCAGCGGTT GGCTCGGGAA GCCGAAGAGA GGGCGATCTG GGAAAAGCTC GCGCAGGACA TCGAAAACGA GAAGGCCGAG ATTGCGGCCC GATTGGCGGC GCTCCAAGCG GTTGCCGAAC ATACGCCAGC GGCCGACATC CTCCAGCTCG TTCAGCGAGC CGAGGTCGCC TCTAGCAAGA TCGATCTTGA TGAAGCAACG ACGCGCGCCC TGATCGATCA GCAGCTTCGT GACGCCGGAT GGGACGCTGA CACGAAGGCC CTGCGGTACA GCGAGGGCGT GAGGCCATCC AAGGGGCGCA ATCTGGCGAT TGCCGAATGG CCGACATCGA GCGGTCCGGC TGACTACGCG CTGTTCATCG GCTTAACGCT GGTCGGCGTC GTCGAAGCCA AGCGCAAACG TAAGAACGTC TCCGCAGCAA TCGATCAGGC GGAGCGGTAC TCGGTCGGAC TGGCAGCACG AGCAGACTTT ACGTTCGCGG GCGGCCCTTG GACGGATCAC AAAGTCCCCT TCGTATTTGC TGCCAATGGT CGCTCCTATT TGAAGCAGCT TGAGACCGAA AGCGGCATCT GGTTTCGCGA CACGCGCCGT GCAGCCAACC ACCGTCGTGC TTTGGTGAGC TGGCTAACAC CCTCAGGGCT TTCAGGCCTT TTGGAGGTGG ATCAGGATGC CGCCACCGAC GCGCTCAAGA CGCTGCCTTT CGACTTCGGC TTTCCGCTGC GCGACTACCA GCAGGCCGCC ATCACGGCAA TTGAGAAAGG CCTTGAAGCT GAGCGGCGCT CCATGTTGCT GGCGATGGCA ACGGGCACGG GCAAGACCAA GCTCGCCATC GCAATGCTCT ATCGTCTGCT AACAACCAAG CGGTTTCGCC GCGTCTGCTT TGTGGTCGAT CGTTCGGCCC TCGGAATTCA GGCGGCGGGC GAGTTCACCA CCACCAAAAT CGTCTCCGGC AAAGCCTTCG CGGATATCTT CGGCCTCAAG AAGCTGGAGG ATGTTACGCC AGAGACCGAA ACCAAGGTCC ACATCTGCAC GATCCAAGGC CTAGTAAAGC GCGTCCTTTA TGCTGCTGAC ACTTCGGAAG CACCTCCGGT CGATCAGTAC GACCTCATTG TGATCGACGA GTGTCATCGC GGCTACCTGT TGGACCGCGA AATGTCGGAT GCCGAGCTAA GTTTCCGGGG GCAGGACGAC TACATTTCGA AATATCGTCG CGTGCTGGAA TACTTCGACG CCGTAAAGAT CGGCCTTACG GCGACCCCTG CCCTGCACAC CACGGAGATC TTCGGGGAGC CGATCTTCAA GTATTCCTAC CGGGATGCTG TGATCGAGGG GCATCTGATC GACCATGAAC CCCCGGTGCG CATCGAGACC GCGTTGGCAC GCGCCGGCAT CGTCTTTGAG AAGGATGAAC AGCTCGAACT TCTTAACACT CGGACGGGCG AAGTCGATCT TGCCCATGCG CCGGATGAAA TCCGCTTCGA GGTCGAGCAG TTCAACAAGC AGGTGGTGAC GCCGGACTTC AACCGGGCGG TTGCCGAAGA GTTGGCCAAG CACATCGACC CCGCCCTTCC AGGCAAGACG CTGATCTTCG CGGCCACGGA CGCGCATGCC GACATCGTCG TTGCTAAGAT CAAGGACGCC TTCGCGGCAG CCTATGGCGA GATTGACGAC GCGGCAGTCA AGAAGATCAC CGGCAGCGTG GACCGCGTAC AGAAACTCAT CCGGTCGTTC CGCAATGAAG CCAATCCGAA GATCGTGGTG ACAGTCGATC TTCTCACCAC CGGCATCGAT GTGCCGTCGA TCACCAACCT GGTGTTCCTG CGGCGGGTGA ATAGCCGCAT CCTCTATGAA CAGATGATCG GGCGGGCCAC GCGGCAATGT CCCGACATTG GCAAAGAGGT GTTCCGCATC TTCGATGCCG TCGATCTCTA TCCGCACTTG CAGAACCTCA CCGACATGAA GCCGGTGGTG GTCAACCCGT CGATCAGCTT CGAGCAACTG GTGAAGGAGT TGGTGGAAGC GGAGCAGCAC GCCCACCGCG AGAGCATCCG CGACCAGTTC GCCGTGAAGC TGCGCCGCCG GCTCAAGAAG CTGCCTGAGG AGGCGCGGGC GCGCTTTGAG GCCGCTGCCG GCGAGACGCC GGAGCAAACA CTGAAGCGGC TGCTTGAGAA CGGCCCGACG GAGTTCGCCA AATGGCTCTC CAGCCACTCC GCCATCGGTC CCATTCTCGA CTGGCAGAGT GACGGCGACA CTCCCCGCTT CATGCCGATC TCGCACCATC CCGATCAAGT CGTGGCGGTG ACGCGCGGCT ATGGCGAAGC GAGCAAGCCC GAAGACTTCC TTGACGGCTT CTCCGCCTTC GTCCGCGACA ACGTCAATAC CATCGCGGCC CTCAAGCTGG TGGTGCAACG CCCCCGCGAC CTGACCCGAG CCGGCCTGCA AGACCTGCGC AGGGCGCTCG ACCTAAAAGG ATTCTCGGAG GCCAATCTCC GCCGGGCTTG GGCCGACGCC AAGAATCAGG ACATCGCTGC GTCCATCATC GGTTTCGTGC GGCAGGCCGC CTTGGGCGAT CCGCTGACGC CCTACGGGGA TCGAGTCAGA GTTGCGATGC AGAAAGTCAT GGCCAGCCGC GCGTGGACGG AACCGCAAAA ACGATGGCTG ACGAGGATCG GCGAGCAGAT CACCAAGGAA ATCGTGGTAG ACCGCAGCAC GATCGACCGA GAGCCATTCA TCGCGGATGG TGGCTTCACG CGCCTCAACA AGGTATTTGG CGGCGAGCTT GAAGCGGTGC TTGCCGGGAT AAACGAAGAA ATGTGGAAGA AAACGGGCTG A
|
Protein sequence | MRSSANFGFL GDHDVRLAQL GALAESYFRD DPTTSIFKLR QFAELMSKLI AARHAAYRDE RESFEETLRR LSFERIIPKE AADVFHALRK SGNRAAHDIA GTQSDALTAL KLARQLGIWF HRTYGKQPDI AFGAFVPPPE PIDATVALRE EIAALGRRLT ESEDAAAGAQ REAEEHARAR ETLEQRLARE AEERAIWEKL AQDIENEKAE IAARLAALQA VAEHTPAADI LQLVQRAEVA SSKIDLDEAT TRALIDQQLR DAGWDADTKA LRYSEGVRPS KGRNLAIAEW PTSSGPADYA LFIGLTLVGV VEAKRKRKNV SAAIDQAERY SVGLAARADF TFAGGPWTDH KVPFVFAANG RSYLKQLETE SGIWFRDTRR AANHRRALVS WLTPSGLSGL LEVDQDAATD ALKTLPFDFG FPLRDYQQAA ITAIEKGLEA ERRSMLLAMA TGTGKTKLAI AMLYRLLTTK RFRRVCFVVD RSALGIQAAG EFTTTKIVSG KAFADIFGLK KLEDVTPETE TKVHICTIQG LVKRVLYAAD TSEAPPVDQY DLIVIDECHR GYLLDREMSD AELSFRGQDD YISKYRRVLE YFDAVKIGLT ATPALHTTEI FGEPIFKYSY RDAVIEGHLI DHEPPVRIET ALARAGIVFE KDEQLELLNT RTGEVDLAHA PDEIRFEVEQ FNKQVVTPDF NRAVAEELAK HIDPALPGKT LIFAATDAHA DIVVAKIKDA FAAAYGEIDD AAVKKITGSV DRVQKLIRSF RNEANPKIVV TVDLLTTGID VPSITNLVFL RRVNSRILYE QMIGRATRQC PDIGKEVFRI FDAVDLYPHL QNLTDMKPVV VNPSISFEQL VKELVEAEQH AHRESIRDQF AVKLRRRLKK LPEEARARFE AAAGETPEQT LKRLLENGPT EFAKWLSSHS AIGPILDWQS DGDTPRFMPI SHHPDQVVAV TRGYGEASKP EDFLDGFSAF VRDNVNTIAA LKLVVQRPRD LTRAGLQDLR RALDLKGFSE ANLRRAWADA KNQDIAASII GFVRQAALGD PLTPYGDRVR VAMQKVMASR AWTEPQKRWL TRIGEQITKE IVVDRSTIDR EPFIADGGFT RLNKVFGGEL EAVLAGINEE MWKKTG
|
| |