Gene RPC_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3651 
SymbolhsdR 
ID3972023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4061871 
End bp4065251 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content61% 
IMG OID637926761 
Producttype I restriction enzyme EcoKI subunit R 
Protein accessionYP_533505 
Protein GI90425135 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.408969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTCGT CGGCGAACTT CGGCTTTTTG GGCGATCATG ATGTCAGGCT GGCGCAGCTT 
GGGGCGCTAG CCGAGAGCTA TTTCCGTGAT GATCCCACCA CGTCGATCTT CAAGCTCCGG
CAATTCGCGG AGCTGATGTC CAAGCTCATC GCGGCCCGTC ATGCGGCCTA TCGTGATGAG
CGTGAGAGTT TCGAGGAGAC GCTCCGTCGC CTGTCTTTCG AACGGATCAT CCCCAAAGAG
GCGGCCGACG TATTCCACGC CTTGCGAAAA AGCGGAAACC GGGCGGCCCA CGACATTGCC
GGAACCCAAT CCGATGCCCT GACGGCACTC AAGCTCGCCC GCCAACTCGG AATCTGGTTT
CACCGGACCT ACGGCAAGCA GCCCGACATT GCGTTTGGTG CGTTCGTCCC GCCCCCGGAG
CCGATCGATG CCACGGTGGC GCTCAGAGAA GAGATTGCGG CCTTGGGTCG CCGGCTTACG
GAAAGCGAAG ACGCCGCCGC CGGAGCCCAA CGCGAAGCCG AGGAGCATGC TCGCGCACGA
GAAACCCTTG AGCAGCGGTT GGCTCGGGAA GCCGAAGAGA GGGCGATCTG GGAAAAGCTC
GCGCAGGACA TCGAAAACGA GAAGGCCGAG ATTGCGGCCC GATTGGCGGC GCTCCAAGCG
GTTGCCGAAC ATACGCCAGC GGCCGACATC CTCCAGCTCG TTCAGCGAGC CGAGGTCGCC
TCTAGCAAGA TCGATCTTGA TGAAGCAACG ACGCGCGCCC TGATCGATCA GCAGCTTCGT
GACGCCGGAT GGGACGCTGA CACGAAGGCC CTGCGGTACA GCGAGGGCGT GAGGCCATCC
AAGGGGCGCA ATCTGGCGAT TGCCGAATGG CCGACATCGA GCGGTCCGGC TGACTACGCG
CTGTTCATCG GCTTAACGCT GGTCGGCGTC GTCGAAGCCA AGCGCAAACG TAAGAACGTC
TCCGCAGCAA TCGATCAGGC GGAGCGGTAC TCGGTCGGAC TGGCAGCACG AGCAGACTTT
ACGTTCGCGG GCGGCCCTTG GACGGATCAC AAAGTCCCCT TCGTATTTGC TGCCAATGGT
CGCTCCTATT TGAAGCAGCT TGAGACCGAA AGCGGCATCT GGTTTCGCGA CACGCGCCGT
GCAGCCAACC ACCGTCGTGC TTTGGTGAGC TGGCTAACAC CCTCAGGGCT TTCAGGCCTT
TTGGAGGTGG ATCAGGATGC CGCCACCGAC GCGCTCAAGA CGCTGCCTTT CGACTTCGGC
TTTCCGCTGC GCGACTACCA GCAGGCCGCC ATCACGGCAA TTGAGAAAGG CCTTGAAGCT
GAGCGGCGCT CCATGTTGCT GGCGATGGCA ACGGGCACGG GCAAGACCAA GCTCGCCATC
GCAATGCTCT ATCGTCTGCT AACAACCAAG CGGTTTCGCC GCGTCTGCTT TGTGGTCGAT
CGTTCGGCCC TCGGAATTCA GGCGGCGGGC GAGTTCACCA CCACCAAAAT CGTCTCCGGC
AAAGCCTTCG CGGATATCTT CGGCCTCAAG AAGCTGGAGG ATGTTACGCC AGAGACCGAA
ACCAAGGTCC ACATCTGCAC GATCCAAGGC CTAGTAAAGC GCGTCCTTTA TGCTGCTGAC
ACTTCGGAAG CACCTCCGGT CGATCAGTAC GACCTCATTG TGATCGACGA GTGTCATCGC
GGCTACCTGT TGGACCGCGA AATGTCGGAT GCCGAGCTAA GTTTCCGGGG GCAGGACGAC
TACATTTCGA AATATCGTCG CGTGCTGGAA TACTTCGACG CCGTAAAGAT CGGCCTTACG
GCGACCCCTG CCCTGCACAC CACGGAGATC TTCGGGGAGC CGATCTTCAA GTATTCCTAC
CGGGATGCTG TGATCGAGGG GCATCTGATC GACCATGAAC CCCCGGTGCG CATCGAGACC
GCGTTGGCAC GCGCCGGCAT CGTCTTTGAG AAGGATGAAC AGCTCGAACT TCTTAACACT
CGGACGGGCG AAGTCGATCT TGCCCATGCG CCGGATGAAA TCCGCTTCGA GGTCGAGCAG
TTCAACAAGC AGGTGGTGAC GCCGGACTTC AACCGGGCGG TTGCCGAAGA GTTGGCCAAG
CACATCGACC CCGCCCTTCC AGGCAAGACG CTGATCTTCG CGGCCACGGA CGCGCATGCC
GACATCGTCG TTGCTAAGAT CAAGGACGCC TTCGCGGCAG CCTATGGCGA GATTGACGAC
GCGGCAGTCA AGAAGATCAC CGGCAGCGTG GACCGCGTAC AGAAACTCAT CCGGTCGTTC
CGCAATGAAG CCAATCCGAA GATCGTGGTG ACAGTCGATC TTCTCACCAC CGGCATCGAT
GTGCCGTCGA TCACCAACCT GGTGTTCCTG CGGCGGGTGA ATAGCCGCAT CCTCTATGAA
CAGATGATCG GGCGGGCCAC GCGGCAATGT CCCGACATTG GCAAAGAGGT GTTCCGCATC
TTCGATGCCG TCGATCTCTA TCCGCACTTG CAGAACCTCA CCGACATGAA GCCGGTGGTG
GTCAACCCGT CGATCAGCTT CGAGCAACTG GTGAAGGAGT TGGTGGAAGC GGAGCAGCAC
GCCCACCGCG AGAGCATCCG CGACCAGTTC GCCGTGAAGC TGCGCCGCCG GCTCAAGAAG
CTGCCTGAGG AGGCGCGGGC GCGCTTTGAG GCCGCTGCCG GCGAGACGCC GGAGCAAACA
CTGAAGCGGC TGCTTGAGAA CGGCCCGACG GAGTTCGCCA AATGGCTCTC CAGCCACTCC
GCCATCGGTC CCATTCTCGA CTGGCAGAGT GACGGCGACA CTCCCCGCTT CATGCCGATC
TCGCACCATC CCGATCAAGT CGTGGCGGTG ACGCGCGGCT ATGGCGAAGC GAGCAAGCCC
GAAGACTTCC TTGACGGCTT CTCCGCCTTC GTCCGCGACA ACGTCAATAC CATCGCGGCC
CTCAAGCTGG TGGTGCAACG CCCCCGCGAC CTGACCCGAG CCGGCCTGCA AGACCTGCGC
AGGGCGCTCG ACCTAAAAGG ATTCTCGGAG GCCAATCTCC GCCGGGCTTG GGCCGACGCC
AAGAATCAGG ACATCGCTGC GTCCATCATC GGTTTCGTGC GGCAGGCCGC CTTGGGCGAT
CCGCTGACGC CCTACGGGGA TCGAGTCAGA GTTGCGATGC AGAAAGTCAT GGCCAGCCGC
GCGTGGACGG AACCGCAAAA ACGATGGCTG ACGAGGATCG GCGAGCAGAT CACCAAGGAA
ATCGTGGTAG ACCGCAGCAC GATCGACCGA GAGCCATTCA TCGCGGATGG TGGCTTCACG
CGCCTCAACA AGGTATTTGG CGGCGAGCTT GAAGCGGTGC TTGCCGGGAT AAACGAAGAA
ATGTGGAAGA AAACGGGCTG A
 
Protein sequence
MRSSANFGFL GDHDVRLAQL GALAESYFRD DPTTSIFKLR QFAELMSKLI AARHAAYRDE 
RESFEETLRR LSFERIIPKE AADVFHALRK SGNRAAHDIA GTQSDALTAL KLARQLGIWF
HRTYGKQPDI AFGAFVPPPE PIDATVALRE EIAALGRRLT ESEDAAAGAQ REAEEHARAR
ETLEQRLARE AEERAIWEKL AQDIENEKAE IAARLAALQA VAEHTPAADI LQLVQRAEVA
SSKIDLDEAT TRALIDQQLR DAGWDADTKA LRYSEGVRPS KGRNLAIAEW PTSSGPADYA
LFIGLTLVGV VEAKRKRKNV SAAIDQAERY SVGLAARADF TFAGGPWTDH KVPFVFAANG
RSYLKQLETE SGIWFRDTRR AANHRRALVS WLTPSGLSGL LEVDQDAATD ALKTLPFDFG
FPLRDYQQAA ITAIEKGLEA ERRSMLLAMA TGTGKTKLAI AMLYRLLTTK RFRRVCFVVD
RSALGIQAAG EFTTTKIVSG KAFADIFGLK KLEDVTPETE TKVHICTIQG LVKRVLYAAD
TSEAPPVDQY DLIVIDECHR GYLLDREMSD AELSFRGQDD YISKYRRVLE YFDAVKIGLT
ATPALHTTEI FGEPIFKYSY RDAVIEGHLI DHEPPVRIET ALARAGIVFE KDEQLELLNT
RTGEVDLAHA PDEIRFEVEQ FNKQVVTPDF NRAVAEELAK HIDPALPGKT LIFAATDAHA
DIVVAKIKDA FAAAYGEIDD AAVKKITGSV DRVQKLIRSF RNEANPKIVV TVDLLTTGID
VPSITNLVFL RRVNSRILYE QMIGRATRQC PDIGKEVFRI FDAVDLYPHL QNLTDMKPVV
VNPSISFEQL VKELVEAEQH AHRESIRDQF AVKLRRRLKK LPEEARARFE AAAGETPEQT
LKRLLENGPT EFAKWLSSHS AIGPILDWQS DGDTPRFMPI SHHPDQVVAV TRGYGEASKP
EDFLDGFSAF VRDNVNTIAA LKLVVQRPRD LTRAGLQDLR RALDLKGFSE ANLRRAWADA
KNQDIAASII GFVRQAALGD PLTPYGDRVR VAMQKVMASR AWTEPQKRWL TRIGEQITKE
IVVDRSTIDR EPFIADGGFT RLNKVFGGEL EAVLAGINEE MWKKTG