Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3053 |
Symbol | |
ID | 3910854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3480367 |
End bp | 3483483 |
Gene Length | 3117 bp |
Protein Length | 1038 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637884960 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_486665 |
Protein GI | 86750169 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGC AGACCACTCC CATAGCTGAG ACCAATCGCT TCATCGTGCT CGATAAATAT GTGCGCGCCT GGGAAGCCGC TGACAGCTAT CAGTCCGAGG CCGATCTTGA ACGCGAGCTG ATCCAGGACC TACGCAATCA GGGCTATGAA GTTCGGGCCG ATCTGAAGTC AACACACGCC ATGCTGGCCA ATGTGCGTGT TCAGTTGCAA GCGCTCAATG ATGTTCAGTT TACTGATAAG GAATGGGCGC GCTTCGTCGA AACCTATTTG GACAATCCAA GCGACAGCGC AACGGACAAG GCCCGCAAGC TCCATGACGA CTACATCTTC GATTTCGTCT TCGACAATGG GCGGATTCAG AATATCTACC TCGTCGACAA GGGCAATGTC AGCCGCAACA AGGTTCAGGT CATCAAGCAG TTCGAGCAGG CAGGAACGCA TGCGAATCGC TATGATGTAA CGATCCTCGT GAACGGTCTG CCGCTGCTGC AGGTCGAGCT GAAAAAGCGC GGCGTTGCCA TCCGCGAAGC GTTCAATCAG ATTCACCGCT ACAGCAAAGA GAGCTTCAAC TCTGACAGCT CGCTTTTCAA ATATCTCCAG ATCTTCGTGA TCTCCAATGG CACCGACACG CGCTATTTTG CCAACACGAC CAAGCGCGAC AAGAACAGCT TCGATTTCAC GATGAACTGG GCGCAGGCTG ACAACAGTCT GATCCGGGAC TTAAAGGATT TCACAGCGAC CTTCTTCGAG AAGCGCACCC TTTTGAAGGT GTTGCTGGAC TATTCCGTGT TTGATGTAAG CGATACCCTT TTGGTGATGC GCCCCTACCA GATCGCGGCG ACAGAACGCA TCCTAAGGAA AATCAGAAGC TCCTTCGAAG CCAAGACCTC TGAAGGCAAG ATTTTGGGCA AGCCCGAAAG CGGCGGATTT ATCTGGCACA CGACAGGTTC AGGCAAGACG CTGACCAGCT TTAAGGCGGC GCGCCTCGCC ACACAGCTCG ACTTCATCGA CAAGGTCTTT TTCGTGGTCG ACCGCAAAGA TCTCGATTAC CAGACGATGA AGGAGTATCA GCGCTTCTCT CCGGACAGCG TGAACGGTTC AGACAGTACC GCCGGTTTGA AGCGCAATCT GTCCAAGGAC GACAACAAAA TCATCGTGAC GACGATTCAG AAGCTCAACA ATCTGATGAA AAGCGAAGGC GACCTGCCTG TCTATAGCCA GCGCGTTGTC GTCATTTTCG ATGAGTGTCA CCGCAGCCAG TTTGGTGAGG CGCAGAAGAA CCTCAAACGC AAATTCAAAT CCTTCTGCCA GTTTGGCTTC ACTGGCACGC CGATCTTCCC TGAAAATGCT TCGGGTGGGG AGACAACGGC CAGCGTCTTC GGGAGCGAGC TTCACTCCTA CGTCATCACC GACGCCATCC GTGACGAAAA GGTGCTGAAA TTCAAGGTTG ATTATAACGA TGTGCGCCCG CAATTTAAGG TCATTGAGAC CGAGCAGGAT GAGAAGAAGC TGACTGCCGC TGAGAATAGG CAGGCGCTTC AGAATCCTCA GCGCATCGGC GAGATTTCCC AGTACATCCT AGACAACTTT CGCCGCAAAA CGCATCGACT GTATGGGGAC AATAAGGGCT TCAATGCCAT GTTTGCCGTG AGCAGTGTCG AGGCGGCAAA GCTCTATTAT GAGAGCCTGA ACAAGCTGCA GGCAGACAGC GATAAAACAT TGAAAATCGC GACGATCTTT TCATTTGCGG CTAATGAGGA GCAGGACGCA ATTGGCGACA TTCCCGACGA GAGCTTTGAG GTTTCCGCGC TAAACAGCAG CGCCAAGGAG TTCCTGAGCG CGGCCATCGC GGATTACAAT GCGTTTTTCA AAACGAACTT CAGCGTCGAT AGCAAAGGCT TCCAGAACTA TTATCGGGAT CTGGCCAAGC GGGTGAAGTC CAAAGAAGTG GATCTGCTCA TTGTGGTGGG CATGTTCCTG ACTGGCTTTG ATGCGCCGAC GCTCAACACG CTGTTCGTGG ACAAGAACCT GCGCTTTCAT GGTCTCATTC AGGCCTATTC CCGTACCAAC CGCATTTATG ATGCGACCAA GTCATTCGGA AACATCGTCA CCTTCCGCGA CCTTGAAGAG GCGACCGTCA AAGCGATTAC GCTGTTCGGT AATGCGAACA CCAGGAACGT CGTCCTCGAA AAAAGCTACT CGGAATACAT GGAGGGCTTT ACGGACCAGA CGACCGGAGA AGCAAGACGC GGCTTCATGG ATGTTGTGAA GGAGCTTGAA GAGCGTTTCC CCGACCCCTC GGCGATTGCA AAGGAGGCTG ACAAAAAGGC CTTTGCGAAG CTCTTCGGTG AGTATCTCCG TGTTGAGAAT ATCCTGCAGA ACTACGATGA ATTTGCCAAC CTCAGAGAGC TTCAGGAGAT CGATCAGAGG GATGCCGAAG CTCTAACAGC CTTCAAAGAA CGGCATCATC TGAGCGATGA TGACGTTGAA AACCTGAAGA CTGTCAAAAT GCCCGAAGAC CGGAGGATCC AGGACTACCG TTCGACCTAC AACGATATCC GGGACTGGCT TCGCCGAGAA AAGGCAAGCT CTGAGCAAGT CGAGTCGAAT ATTGATTGGG GCGATGTTGT CTTCGAGGTT GATCTGCTGA AATCCCAGGA AATCAATCTT GATTACATTC TTGAGCTCAT TTTTGAGCAT AATAAAAAGA CGAAAAGCAA GTCTGAGCTC GTCGGTGAAA TTCGCCGCGT CATCCGCGCG AGTATCGGCA ACCGTGCAAA GGAAAGCCTC ATCGTCGATT TCATAAATCA GACAAATCTC GATGAACTCG GGGACAAAGC GGGGGTGATC GAGGCCTTCT TTGCATTCGC ACAAGCGGAA CAACGCCGCG AAGCGGAAGA GCTCATTATC GGCGAAAAGC TGAATCCGGA CGCTGCGAAG CGCTATATCG CGACATCAAT TAAGCGAGAG TATGCGAGCG AGAACGGTAC GGAGCTGAAT TCGATCCTGC CAAAAATGAG CCCCCTCAAT CCGGAGTATC TCAACAAGAA ACGCAACGTG TTCCAGCGCA TTTCTGCTTT CGTTGAAAAG TTTAAAGGTG TTGGTGGAAA AATCTAA
|
Protein sequence | MPEQTTPIAE TNRFIVLDKY VRAWEAADSY QSEADLEREL IQDLRNQGYE VRADLKSTHA MLANVRVQLQ ALNDVQFTDK EWARFVETYL DNPSDSATDK ARKLHDDYIF DFVFDNGRIQ NIYLVDKGNV SRNKVQVIKQ FEQAGTHANR YDVTILVNGL PLLQVELKKR GVAIREAFNQ IHRYSKESFN SDSSLFKYLQ IFVISNGTDT RYFANTTKRD KNSFDFTMNW AQADNSLIRD LKDFTATFFE KRTLLKVLLD YSVFDVSDTL LVMRPYQIAA TERILRKIRS SFEAKTSEGK ILGKPESGGF IWHTTGSGKT LTSFKAARLA TQLDFIDKVF FVVDRKDLDY QTMKEYQRFS PDSVNGSDST AGLKRNLSKD DNKIIVTTIQ KLNNLMKSEG DLPVYSQRVV VIFDECHRSQ FGEAQKNLKR KFKSFCQFGF TGTPIFPENA SGGETTASVF GSELHSYVIT DAIRDEKVLK FKVDYNDVRP QFKVIETEQD EKKLTAAENR QALQNPQRIG EISQYILDNF RRKTHRLYGD NKGFNAMFAV SSVEAAKLYY ESLNKLQADS DKTLKIATIF SFAANEEQDA IGDIPDESFE VSALNSSAKE FLSAAIADYN AFFKTNFSVD SKGFQNYYRD LAKRVKSKEV DLLIVVGMFL TGFDAPTLNT LFVDKNLRFH GLIQAYSRTN RIYDATKSFG NIVTFRDLEE ATVKAITLFG NANTRNVVLE KSYSEYMEGF TDQTTGEARR GFMDVVKELE ERFPDPSAIA KEADKKAFAK LFGEYLRVEN ILQNYDEFAN LRELQEIDQR DAEALTAFKE RHHLSDDDVE NLKTVKMPED RRIQDYRSTY NDIRDWLRRE KASSEQVESN IDWGDVVFEV DLLKSQEINL DYILELIFEH NKKTKSKSEL VGEIRRVIRA SIGNRAKESL IVDFINQTNL DELGDKAGVI EAFFAFAQAE QRREAEELII GEKLNPDAAK RYIATSIKRE YASENGTELN SILPKMSPLN PEYLNKKRNV FQRISAFVEK FKGVGGKI
|
| |