Gene RPD_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1200 
Symbol 
ID4021676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1358425 
End bp1359558 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content64% 
IMG OID637961392 
ProductDNA methylase N-4/N-6 
Protein accessionYP_568339 
Protein GI91975680 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTGT CGCGTAGCGG GGCGTCTGCA AGGGCGCCCC GCACTCAATT CGAATACCTC 
CCGGAGAACC GGATCATCGT CGGCGATTGC GTCGCCGAGA TGTCGAAGCT TCCGGCCCGC
TCGGTCGATC TGGTGTTCGC AGATCCGCCG TATAATTTGC AGCTCAAGGG CGAGCTGAAA
CGTCCCGACG AATCGCACGT CGATGCGGTC GACAACGATT GGGACAAGTT CTCATCCTTC
GCCGCTTACG ATGATTTCAC CCGCGCCTGG CTGCTCGCCG CGCGCCGGAT CATGAAGCCG
TCGGCGACGA TCTGGGTGAT CGGCTCCTAT CACAACATCT TCCGCGTCGG CGCGATCATG
CAGGACCTCG GGTTCTGGCT GCTCAACGAT ATCGTCTGGC GCAAGACCAA TCCGATGCCG
AATTTCCGCG GCCGCCGATT CACCAACGCC CACGAGACGA TGATCTGGGC GGCGCGCGAC
GAGAACGCCA AGGGCTACAC TTTCAATTAC GACGCGCTGA AGGCCGCCAA CGAGGACGTT
CAGGCGCGCT CCGACTGGCT GATTCCGCTG TGCACCGGCG AGGAGCGGCT GAAGGGCAGC
GACGGCAAGA AGGTGCATCC GACCCAGAAG CCGGAAGGCC TGCTGGCGCG TGTGCTGCTG
TCGTCGTCGA AGCCCGGCGA TCTGGTGATC GATCCGTTCA ACGGCACCGG AACCACCGGC
GCCGTCGCCA AGCGGCTGCG CCGCAACTAC ATCGGCTTCG AGCGCGACCG CGACTATGCC
ACCGCTGCGG AAGCGCGGAT TGCCGCGATC GAGCCGCTGC CGGAAGCCAC ATTGGCGCCG
TTCATGACCG CGCGCAGCGC GCCGCGGGTC GCGTTCGCCG AACTGATCGA ACGCGGAATC
ATTTCGCCCG GGACCAAGCT GGTCGATTCG AAGAAGCGGC ACGGCGCGCT GGTCCGTGCC
GACGGCGCGA TCATGCTCGG CGACAAGGTC GGCTCGATTC ACCGCATCGG CGCGGTGGCG
CAAGGCTCAG AGGCCTGCAA CGGCTGGACG TTCTGGCATG TCGAGACCAG CAAGGGCCTG
CGCCTGATCG ACGAACTCCG CGCCGAAATC CGCAGCGCCA TGGCTGCTGG CTAA
 
Protein sequence
MILSRSGASA RAPRTQFEYL PENRIIVGDC VAEMSKLPAR SVDLVFADPP YNLQLKGELK 
RPDESHVDAV DNDWDKFSSF AAYDDFTRAW LLAARRIMKP SATIWVIGSY HNIFRVGAIM
QDLGFWLLND IVWRKTNPMP NFRGRRFTNA HETMIWAARD ENAKGYTFNY DALKAANEDV
QARSDWLIPL CTGEERLKGS DGKKVHPTQK PEGLLARVLL SSSKPGDLVI DPFNGTGTTG
AVAKRLRRNY IGFERDRDYA TAAEARIAAI EPLPEATLAP FMTARSAPRV AFAELIERGI
ISPGTKLVDS KKRHGALVRA DGAIMLGDKV GSIHRIGAVA QGSEACNGWT FWHVETSKGL
RLIDELRAEI RSAMAAG