Gene RPD_0013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0013 
Symbol 
ID4020467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp15807 
End bp18746 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content55% 
IMG OID637960189 
ProductDNA methylase containing a Zn-ribbon 
Protein accessionYP_567154 
Protein GI91974495 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.607191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCA AGGTCATGAA GTCGAAGGTT GTCCCATTCT CGCTGAAGGA CGCGCCATCG 
TTGATCGAAC GCGTGTGGCC GGCGCAAATG ATCTCTGTTG AGGCTCAACG CGAGCGCAAG
GCTGTTCATG GTCAAACCCT GACCAGCCTC GGATCATACT GGAAAGGGCG CAAACCGTTG
GTGCTCGTCA GGGCGTGCAT CCTTGGAGCT TTGTTGCCAT CCACTGGAGA CGACGAGAAG
GATCTGGCGA TCTTTGAAAA GCTTATGGGA ATGAGCGACG ACCAAGTCGC TTGTCGATTC
AAGGAAAAGG TGTCGGTCGA AGAAGTCCAA AAATTCGGAA CTGTTTCTCA ACAAAGTGCG
CTGATCGACG ACGATGGCCA AGGGACAACG CTTAGGAAGC TCCCGAAAGC ACAGCGCGAA
TCATTGATGG GATCGGTGAT CCAGCGCATG CCGTATGATG TGCGCGCCCA GAAGCTTCTG
CGTCCCGAAG AGGTCGATGA AAGTGTCTTA ACCGGGCCGG CGCTCGATGA AGTCAACAAG
CATCTTGGAA CTGACGCGAG CAATCTGTCT GAACTGCTCG AACAGTTGGG GCTCATGCGG
TTCGGTAGAC GCCCGGTCGT GGTCGACACT TTTTCGGGTA GCGGATCAAT ACCATTCGAG
GCTGCCAGGC TGGGGTGCGA TGTCTTTGCC TCAGATCTCA ATCCAATTGC CTGCATGTTG
ACGTGGGCAG CACTAAACGT AGTCGGTGGC TCGGACGCGT CGAGGAAAGA AATCCAGCAG
GAGCAAGCGG CTCTAGCAAA GGCTGTCGAT GAGGAGATCA CCCGGCTCGG GATAGAACAC
GATAGCCACG GCAATCGCGC CAAGGTCTTT TTGTACTGCC TTGAGGCGCG CTGTCCTCAA
ACCCAATGGT TGGTCCCGCT TGCGCCTACT TTTGTTGTTT CCGAAGCAAG AAGAGTGGTC
GCTCGTCTTG TTCCGAACAA ATCCAAGAAA CAGTTCGATC TGGAAATTGT TTCTGGTGCA
AATGCCAAGG AGATGGCCGA GGCAGCCGCT GGAACGGTGC GCGACGGAAA GATGGTTTAC
TCGCTGGACG GCGAAACTTA CTCCACACCA ATAAAGACGA TTCGGGGCGA TCGCCGCGAA
GCCGGGGGGA TCGCGGCGAA CAATCTGCGA CTCTGGGAAA AGTCAGATGT TGCTCCGAGG
TCGGGCGATA TTCTTAGCGA GCGTCTCTAC TGCATCCAGT GGATCGAGCA GGAGACCGTG
GGAAAAAGTC GGCAGGTCAC GTTTTTTGCC TCGCCTACTG CCGCCGACCT TAAGCGAGAA
TTGAAGGTAA AGAACCTCGT CAGTCACAAA CTTAAGGAGT GGCAGGAGGG TGGGTTCGTC
TCGGACATGG AGATCGAAAC GGGCAAGGAA AATGAAGGGC CGATCCGCAC TAACGGCTGG
AAGTTTTGGC ACCAGATGTT CATGCCAAGG GCGATACTGA CCGCGGCGCT CATCGCTGAG
CACGGCCAAG GTCGTGCCCA TGCCGCACTG TCTCTTTGCA AGTACCTAGA TAACAATTCA
AAATCGTGCC GTTGGAAAGT GTCGCAGTCA GGAGGAGACG GCGGCGCCGT GTCTACGTTC
GATAGCCAAG CCTTGAAGAC AATCTTCAAT TGGGCTTTCC GGGCATTGAA CGTTGCGCCT
TGGCAGCTCG GAGAATTTGC GAGATCACCG ATAAGAACCA ATGCAACGAT AGAGTGCGTT
GATGCCCGCA ATTTCCGGCA CAGTATGGAT CTATCGATCA CAGACCCGCC ATACGCCGAT
GCCGTGAATT ACCATGAGAT CACAGAATTC TTCATCGCAT GGATCAGGAA AAATACGCCG
ACTGAATTCG AGGAGTGGAC CTGGGATTCA AGGAGGCCTC TTGCGATTAA GGGAGACGGG
GAAGATTTCC GCCACGGAAT GGTGGACGCC TATAAGGCGA TGTCGGAAAA AATGGCGCCT
AATGGCTTAG AAATTGTCAT GTTCACCCAT CAGTCGGCTT CTGTCTGGGC GGACATGGCT
CAAATCTTCT GGGGCGCCGG CCTTCAGGTC ATGGCCGCTT GGTACATCGC CACCGAGACC
ACTTCCGATC TGAAGAAAGG CGGGTACGTG CAGGGCACGG TCATCCTGGT TGCGCGGAAA
CGTCGGGAGA GAGAGGCTGG CTACAAAGAT GAGATCGTCC AAGAGGTGAA GGCCGAGGTT
GCAGACCAGA TCGATACTAT GTCCGGGTTG AGCCAGAACC TAAAAGGTCG CGGCCGTATC
GAAAACCTCT TCGAGGATGC GGATCTTCAA ATGGCAGGTT ATGCGGCAGC ACTACGTGTG
CTCACCAAGT ATGAGAGAAT TGACGGCACC GACATGACGA AGGAAGCCCT GCGCCCCCGT
CGAGCGGGAG AAAAGAACAT CGTCGGAGAG ATCATCGACT TTGCCGTACA GGTGGCCAAT
GAGCATATGG TGCCTGAGCA CATGCCTCCC AAGCTTTGGG GGGATTTATC CGGTGCCGAG
CGTTTCTACT TCAAAATGCT GGATATCGAG ACAACCTCAC TCCGGAAGCT GGACAACTAC
CAGAATTTCG CGAAGGCTTT CCGAGTAAAC GACTACAGTG CGCTTATGGG CAGCATGGAG
CCGAACAAAG CGCGGCTGAA ATCCGCTAAG GAATTCAAGA AGGCAGGCTT CGAAATTCCG
GACTTCGGTC CATCGTCAAC TCGTTCTGCG CTGTTTGCAA TCTTCGAATT GGAGAGCGAT
GTCGAGGGGG ATGACGTACT ATCTCATCTG AAAGATCTGA TGCCCAATTA CCACAATAAG
CGGGAGGACT TGGCGGCGAT CGCGGATTAC ATTGCCCGAA AGCGGGAGAC CGTGGACGAG
ACAGAATCGC GGGCAGCGCG TATCCTCCAC GGACTGATCC GGAACGAGCG GCTGGGATGA
 
Protein sequence
MDAKVMKSKV VPFSLKDAPS LIERVWPAQM ISVEAQRERK AVHGQTLTSL GSYWKGRKPL 
VLVRACILGA LLPSTGDDEK DLAIFEKLMG MSDDQVACRF KEKVSVEEVQ KFGTVSQQSA
LIDDDGQGTT LRKLPKAQRE SLMGSVIQRM PYDVRAQKLL RPEEVDESVL TGPALDEVNK
HLGTDASNLS ELLEQLGLMR FGRRPVVVDT FSGSGSIPFE AARLGCDVFA SDLNPIACML
TWAALNVVGG SDASRKEIQQ EQAALAKAVD EEITRLGIEH DSHGNRAKVF LYCLEARCPQ
TQWLVPLAPT FVVSEARRVV ARLVPNKSKK QFDLEIVSGA NAKEMAEAAA GTVRDGKMVY
SLDGETYSTP IKTIRGDRRE AGGIAANNLR LWEKSDVAPR SGDILSERLY CIQWIEQETV
GKSRQVTFFA SPTAADLKRE LKVKNLVSHK LKEWQEGGFV SDMEIETGKE NEGPIRTNGW
KFWHQMFMPR AILTAALIAE HGQGRAHAAL SLCKYLDNNS KSCRWKVSQS GGDGGAVSTF
DSQALKTIFN WAFRALNVAP WQLGEFARSP IRTNATIECV DARNFRHSMD LSITDPPYAD
AVNYHEITEF FIAWIRKNTP TEFEEWTWDS RRPLAIKGDG EDFRHGMVDA YKAMSEKMAP
NGLEIVMFTH QSASVWADMA QIFWGAGLQV MAAWYIATET TSDLKKGGYV QGTVILVARK
RREREAGYKD EIVQEVKAEV ADQIDTMSGL SQNLKGRGRI ENLFEDADLQ MAGYAAALRV
LTKYERIDGT DMTKEALRPR RAGEKNIVGE IIDFAVQVAN EHMVPEHMPP KLWGDLSGAE
RFYFKMLDIE TTSLRKLDNY QNFAKAFRVN DYSALMGSME PNKARLKSAK EFKKAGFEIP
DFGPSSTRSA LFAIFELESD VEGDDVLSHL KDLMPNYHNK REDLAAIADY IARKRETVDE
TESRAARILH GLIRNERLG