Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0013 |
Symbol | |
ID | 4020467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 15807 |
End bp | 18746 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637960189 |
Product | DNA methylase containing a Zn-ribbon |
Protein accession | YP_567154 |
Protein GI | 91974495 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.607191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCCA AGGTCATGAA GTCGAAGGTT GTCCCATTCT CGCTGAAGGA CGCGCCATCG TTGATCGAAC GCGTGTGGCC GGCGCAAATG ATCTCTGTTG AGGCTCAACG CGAGCGCAAG GCTGTTCATG GTCAAACCCT GACCAGCCTC GGATCATACT GGAAAGGGCG CAAACCGTTG GTGCTCGTCA GGGCGTGCAT CCTTGGAGCT TTGTTGCCAT CCACTGGAGA CGACGAGAAG GATCTGGCGA TCTTTGAAAA GCTTATGGGA ATGAGCGACG ACCAAGTCGC TTGTCGATTC AAGGAAAAGG TGTCGGTCGA AGAAGTCCAA AAATTCGGAA CTGTTTCTCA ACAAAGTGCG CTGATCGACG ACGATGGCCA AGGGACAACG CTTAGGAAGC TCCCGAAAGC ACAGCGCGAA TCATTGATGG GATCGGTGAT CCAGCGCATG CCGTATGATG TGCGCGCCCA GAAGCTTCTG CGTCCCGAAG AGGTCGATGA AAGTGTCTTA ACCGGGCCGG CGCTCGATGA AGTCAACAAG CATCTTGGAA CTGACGCGAG CAATCTGTCT GAACTGCTCG AACAGTTGGG GCTCATGCGG TTCGGTAGAC GCCCGGTCGT GGTCGACACT TTTTCGGGTA GCGGATCAAT ACCATTCGAG GCTGCCAGGC TGGGGTGCGA TGTCTTTGCC TCAGATCTCA ATCCAATTGC CTGCATGTTG ACGTGGGCAG CACTAAACGT AGTCGGTGGC TCGGACGCGT CGAGGAAAGA AATCCAGCAG GAGCAAGCGG CTCTAGCAAA GGCTGTCGAT GAGGAGATCA CCCGGCTCGG GATAGAACAC GATAGCCACG GCAATCGCGC CAAGGTCTTT TTGTACTGCC TTGAGGCGCG CTGTCCTCAA ACCCAATGGT TGGTCCCGCT TGCGCCTACT TTTGTTGTTT CCGAAGCAAG AAGAGTGGTC GCTCGTCTTG TTCCGAACAA ATCCAAGAAA CAGTTCGATC TGGAAATTGT TTCTGGTGCA AATGCCAAGG AGATGGCCGA GGCAGCCGCT GGAACGGTGC GCGACGGAAA GATGGTTTAC TCGCTGGACG GCGAAACTTA CTCCACACCA ATAAAGACGA TTCGGGGCGA TCGCCGCGAA GCCGGGGGGA TCGCGGCGAA CAATCTGCGA CTCTGGGAAA AGTCAGATGT TGCTCCGAGG TCGGGCGATA TTCTTAGCGA GCGTCTCTAC TGCATCCAGT GGATCGAGCA GGAGACCGTG GGAAAAAGTC GGCAGGTCAC GTTTTTTGCC TCGCCTACTG CCGCCGACCT TAAGCGAGAA TTGAAGGTAA AGAACCTCGT CAGTCACAAA CTTAAGGAGT GGCAGGAGGG TGGGTTCGTC TCGGACATGG AGATCGAAAC GGGCAAGGAA AATGAAGGGC CGATCCGCAC TAACGGCTGG AAGTTTTGGC ACCAGATGTT CATGCCAAGG GCGATACTGA CCGCGGCGCT CATCGCTGAG CACGGCCAAG GTCGTGCCCA TGCCGCACTG TCTCTTTGCA AGTACCTAGA TAACAATTCA AAATCGTGCC GTTGGAAAGT GTCGCAGTCA GGAGGAGACG GCGGCGCCGT GTCTACGTTC GATAGCCAAG CCTTGAAGAC AATCTTCAAT TGGGCTTTCC GGGCATTGAA CGTTGCGCCT TGGCAGCTCG GAGAATTTGC GAGATCACCG ATAAGAACCA ATGCAACGAT AGAGTGCGTT GATGCCCGCA ATTTCCGGCA CAGTATGGAT CTATCGATCA CAGACCCGCC ATACGCCGAT GCCGTGAATT ACCATGAGAT CACAGAATTC TTCATCGCAT GGATCAGGAA AAATACGCCG ACTGAATTCG AGGAGTGGAC CTGGGATTCA AGGAGGCCTC TTGCGATTAA GGGAGACGGG GAAGATTTCC GCCACGGAAT GGTGGACGCC TATAAGGCGA TGTCGGAAAA AATGGCGCCT AATGGCTTAG AAATTGTCAT GTTCACCCAT CAGTCGGCTT CTGTCTGGGC GGACATGGCT CAAATCTTCT GGGGCGCCGG CCTTCAGGTC ATGGCCGCTT GGTACATCGC CACCGAGACC ACTTCCGATC TGAAGAAAGG CGGGTACGTG CAGGGCACGG TCATCCTGGT TGCGCGGAAA CGTCGGGAGA GAGAGGCTGG CTACAAAGAT GAGATCGTCC AAGAGGTGAA GGCCGAGGTT GCAGACCAGA TCGATACTAT GTCCGGGTTG AGCCAGAACC TAAAAGGTCG CGGCCGTATC GAAAACCTCT TCGAGGATGC GGATCTTCAA ATGGCAGGTT ATGCGGCAGC ACTACGTGTG CTCACCAAGT ATGAGAGAAT TGACGGCACC GACATGACGA AGGAAGCCCT GCGCCCCCGT CGAGCGGGAG AAAAGAACAT CGTCGGAGAG ATCATCGACT TTGCCGTACA GGTGGCCAAT GAGCATATGG TGCCTGAGCA CATGCCTCCC AAGCTTTGGG GGGATTTATC CGGTGCCGAG CGTTTCTACT TCAAAATGCT GGATATCGAG ACAACCTCAC TCCGGAAGCT GGACAACTAC CAGAATTTCG CGAAGGCTTT CCGAGTAAAC GACTACAGTG CGCTTATGGG CAGCATGGAG CCGAACAAAG CGCGGCTGAA ATCCGCTAAG GAATTCAAGA AGGCAGGCTT CGAAATTCCG GACTTCGGTC CATCGTCAAC TCGTTCTGCG CTGTTTGCAA TCTTCGAATT GGAGAGCGAT GTCGAGGGGG ATGACGTACT ATCTCATCTG AAAGATCTGA TGCCCAATTA CCACAATAAG CGGGAGGACT TGGCGGCGAT CGCGGATTAC ATTGCCCGAA AGCGGGAGAC CGTGGACGAG ACAGAATCGC GGGCAGCGCG TATCCTCCAC GGACTGATCC GGAACGAGCG GCTGGGATGA
|
Protein sequence | MDAKVMKSKV VPFSLKDAPS LIERVWPAQM ISVEAQRERK AVHGQTLTSL GSYWKGRKPL VLVRACILGA LLPSTGDDEK DLAIFEKLMG MSDDQVACRF KEKVSVEEVQ KFGTVSQQSA LIDDDGQGTT LRKLPKAQRE SLMGSVIQRM PYDVRAQKLL RPEEVDESVL TGPALDEVNK HLGTDASNLS ELLEQLGLMR FGRRPVVVDT FSGSGSIPFE AARLGCDVFA SDLNPIACML TWAALNVVGG SDASRKEIQQ EQAALAKAVD EEITRLGIEH DSHGNRAKVF LYCLEARCPQ TQWLVPLAPT FVVSEARRVV ARLVPNKSKK QFDLEIVSGA NAKEMAEAAA GTVRDGKMVY SLDGETYSTP IKTIRGDRRE AGGIAANNLR LWEKSDVAPR SGDILSERLY CIQWIEQETV GKSRQVTFFA SPTAADLKRE LKVKNLVSHK LKEWQEGGFV SDMEIETGKE NEGPIRTNGW KFWHQMFMPR AILTAALIAE HGQGRAHAAL SLCKYLDNNS KSCRWKVSQS GGDGGAVSTF DSQALKTIFN WAFRALNVAP WQLGEFARSP IRTNATIECV DARNFRHSMD LSITDPPYAD AVNYHEITEF FIAWIRKNTP TEFEEWTWDS RRPLAIKGDG EDFRHGMVDA YKAMSEKMAP NGLEIVMFTH QSASVWADMA QIFWGAGLQV MAAWYIATET TSDLKKGGYV QGTVILVARK RREREAGYKD EIVQEVKAEV ADQIDTMSGL SQNLKGRGRI ENLFEDADLQ MAGYAAALRV LTKYERIDGT DMTKEALRPR RAGEKNIVGE IIDFAVQVAN EHMVPEHMPP KLWGDLSGAE RFYFKMLDIE TTSLRKLDNY QNFAKAFRVN DYSALMGSME PNKARLKSAK EFKKAGFEIP DFGPSSTRSA LFAIFELESD VEGDDVLSHL KDLMPNYHNK REDLAAIADY IARKRETVDE TESRAARILH GLIRNERLG
|
| |