Gene RPB_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3053 
Symbol 
ID3910854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3480367 
End bp3483483 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content52% 
IMG OID637884960 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_486665 
Protein GI86750169 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGC AGACCACTCC CATAGCTGAG ACCAATCGCT TCATCGTGCT CGATAAATAT 
GTGCGCGCCT GGGAAGCCGC TGACAGCTAT CAGTCCGAGG CCGATCTTGA ACGCGAGCTG
ATCCAGGACC TACGCAATCA GGGCTATGAA GTTCGGGCCG ATCTGAAGTC AACACACGCC
ATGCTGGCCA ATGTGCGTGT TCAGTTGCAA GCGCTCAATG ATGTTCAGTT TACTGATAAG
GAATGGGCGC GCTTCGTCGA AACCTATTTG GACAATCCAA GCGACAGCGC AACGGACAAG
GCCCGCAAGC TCCATGACGA CTACATCTTC GATTTCGTCT TCGACAATGG GCGGATTCAG
AATATCTACC TCGTCGACAA GGGCAATGTC AGCCGCAACA AGGTTCAGGT CATCAAGCAG
TTCGAGCAGG CAGGAACGCA TGCGAATCGC TATGATGTAA CGATCCTCGT GAACGGTCTG
CCGCTGCTGC AGGTCGAGCT GAAAAAGCGC GGCGTTGCCA TCCGCGAAGC GTTCAATCAG
ATTCACCGCT ACAGCAAAGA GAGCTTCAAC TCTGACAGCT CGCTTTTCAA ATATCTCCAG
ATCTTCGTGA TCTCCAATGG CACCGACACG CGCTATTTTG CCAACACGAC CAAGCGCGAC
AAGAACAGCT TCGATTTCAC GATGAACTGG GCGCAGGCTG ACAACAGTCT GATCCGGGAC
TTAAAGGATT TCACAGCGAC CTTCTTCGAG AAGCGCACCC TTTTGAAGGT GTTGCTGGAC
TATTCCGTGT TTGATGTAAG CGATACCCTT TTGGTGATGC GCCCCTACCA GATCGCGGCG
ACAGAACGCA TCCTAAGGAA AATCAGAAGC TCCTTCGAAG CCAAGACCTC TGAAGGCAAG
ATTTTGGGCA AGCCCGAAAG CGGCGGATTT ATCTGGCACA CGACAGGTTC AGGCAAGACG
CTGACCAGCT TTAAGGCGGC GCGCCTCGCC ACACAGCTCG ACTTCATCGA CAAGGTCTTT
TTCGTGGTCG ACCGCAAAGA TCTCGATTAC CAGACGATGA AGGAGTATCA GCGCTTCTCT
CCGGACAGCG TGAACGGTTC AGACAGTACC GCCGGTTTGA AGCGCAATCT GTCCAAGGAC
GACAACAAAA TCATCGTGAC GACGATTCAG AAGCTCAACA ATCTGATGAA AAGCGAAGGC
GACCTGCCTG TCTATAGCCA GCGCGTTGTC GTCATTTTCG ATGAGTGTCA CCGCAGCCAG
TTTGGTGAGG CGCAGAAGAA CCTCAAACGC AAATTCAAAT CCTTCTGCCA GTTTGGCTTC
ACTGGCACGC CGATCTTCCC TGAAAATGCT TCGGGTGGGG AGACAACGGC CAGCGTCTTC
GGGAGCGAGC TTCACTCCTA CGTCATCACC GACGCCATCC GTGACGAAAA GGTGCTGAAA
TTCAAGGTTG ATTATAACGA TGTGCGCCCG CAATTTAAGG TCATTGAGAC CGAGCAGGAT
GAGAAGAAGC TGACTGCCGC TGAGAATAGG CAGGCGCTTC AGAATCCTCA GCGCATCGGC
GAGATTTCCC AGTACATCCT AGACAACTTT CGCCGCAAAA CGCATCGACT GTATGGGGAC
AATAAGGGCT TCAATGCCAT GTTTGCCGTG AGCAGTGTCG AGGCGGCAAA GCTCTATTAT
GAGAGCCTGA ACAAGCTGCA GGCAGACAGC GATAAAACAT TGAAAATCGC GACGATCTTT
TCATTTGCGG CTAATGAGGA GCAGGACGCA ATTGGCGACA TTCCCGACGA GAGCTTTGAG
GTTTCCGCGC TAAACAGCAG CGCCAAGGAG TTCCTGAGCG CGGCCATCGC GGATTACAAT
GCGTTTTTCA AAACGAACTT CAGCGTCGAT AGCAAAGGCT TCCAGAACTA TTATCGGGAT
CTGGCCAAGC GGGTGAAGTC CAAAGAAGTG GATCTGCTCA TTGTGGTGGG CATGTTCCTG
ACTGGCTTTG ATGCGCCGAC GCTCAACACG CTGTTCGTGG ACAAGAACCT GCGCTTTCAT
GGTCTCATTC AGGCCTATTC CCGTACCAAC CGCATTTATG ATGCGACCAA GTCATTCGGA
AACATCGTCA CCTTCCGCGA CCTTGAAGAG GCGACCGTCA AAGCGATTAC GCTGTTCGGT
AATGCGAACA CCAGGAACGT CGTCCTCGAA AAAAGCTACT CGGAATACAT GGAGGGCTTT
ACGGACCAGA CGACCGGAGA AGCAAGACGC GGCTTCATGG ATGTTGTGAA GGAGCTTGAA
GAGCGTTTCC CCGACCCCTC GGCGATTGCA AAGGAGGCTG ACAAAAAGGC CTTTGCGAAG
CTCTTCGGTG AGTATCTCCG TGTTGAGAAT ATCCTGCAGA ACTACGATGA ATTTGCCAAC
CTCAGAGAGC TTCAGGAGAT CGATCAGAGG GATGCCGAAG CTCTAACAGC CTTCAAAGAA
CGGCATCATC TGAGCGATGA TGACGTTGAA AACCTGAAGA CTGTCAAAAT GCCCGAAGAC
CGGAGGATCC AGGACTACCG TTCGACCTAC AACGATATCC GGGACTGGCT TCGCCGAGAA
AAGGCAAGCT CTGAGCAAGT CGAGTCGAAT ATTGATTGGG GCGATGTTGT CTTCGAGGTT
GATCTGCTGA AATCCCAGGA AATCAATCTT GATTACATTC TTGAGCTCAT TTTTGAGCAT
AATAAAAAGA CGAAAAGCAA GTCTGAGCTC GTCGGTGAAA TTCGCCGCGT CATCCGCGCG
AGTATCGGCA ACCGTGCAAA GGAAAGCCTC ATCGTCGATT TCATAAATCA GACAAATCTC
GATGAACTCG GGGACAAAGC GGGGGTGATC GAGGCCTTCT TTGCATTCGC ACAAGCGGAA
CAACGCCGCG AAGCGGAAGA GCTCATTATC GGCGAAAAGC TGAATCCGGA CGCTGCGAAG
CGCTATATCG CGACATCAAT TAAGCGAGAG TATGCGAGCG AGAACGGTAC GGAGCTGAAT
TCGATCCTGC CAAAAATGAG CCCCCTCAAT CCGGAGTATC TCAACAAGAA ACGCAACGTG
TTCCAGCGCA TTTCTGCTTT CGTTGAAAAG TTTAAAGGTG TTGGTGGAAA AATCTAA
 
Protein sequence
MPEQTTPIAE TNRFIVLDKY VRAWEAADSY QSEADLEREL IQDLRNQGYE VRADLKSTHA 
MLANVRVQLQ ALNDVQFTDK EWARFVETYL DNPSDSATDK ARKLHDDYIF DFVFDNGRIQ
NIYLVDKGNV SRNKVQVIKQ FEQAGTHANR YDVTILVNGL PLLQVELKKR GVAIREAFNQ
IHRYSKESFN SDSSLFKYLQ IFVISNGTDT RYFANTTKRD KNSFDFTMNW AQADNSLIRD
LKDFTATFFE KRTLLKVLLD YSVFDVSDTL LVMRPYQIAA TERILRKIRS SFEAKTSEGK
ILGKPESGGF IWHTTGSGKT LTSFKAARLA TQLDFIDKVF FVVDRKDLDY QTMKEYQRFS
PDSVNGSDST AGLKRNLSKD DNKIIVTTIQ KLNNLMKSEG DLPVYSQRVV VIFDECHRSQ
FGEAQKNLKR KFKSFCQFGF TGTPIFPENA SGGETTASVF GSELHSYVIT DAIRDEKVLK
FKVDYNDVRP QFKVIETEQD EKKLTAAENR QALQNPQRIG EISQYILDNF RRKTHRLYGD
NKGFNAMFAV SSVEAAKLYY ESLNKLQADS DKTLKIATIF SFAANEEQDA IGDIPDESFE
VSALNSSAKE FLSAAIADYN AFFKTNFSVD SKGFQNYYRD LAKRVKSKEV DLLIVVGMFL
TGFDAPTLNT LFVDKNLRFH GLIQAYSRTN RIYDATKSFG NIVTFRDLEE ATVKAITLFG
NANTRNVVLE KSYSEYMEGF TDQTTGEARR GFMDVVKELE ERFPDPSAIA KEADKKAFAK
LFGEYLRVEN ILQNYDEFAN LRELQEIDQR DAEALTAFKE RHHLSDDDVE NLKTVKMPED
RRIQDYRSTY NDIRDWLRRE KASSEQVESN IDWGDVVFEV DLLKSQEINL DYILELIFEH
NKKTKSKSEL VGEIRRVIRA SIGNRAKESL IVDFINQTNL DELGDKAGVI EAFFAFAQAE
QRREAEELII GEKLNPDAAK RYIATSIKRE YASENGTELN SILPKMSPLN PEYLNKKRNV
FQRISAFVEK FKGVGGKI