Gene RPB_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1073 
Symbol 
ID3908925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1232526 
End bp1233659 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content65% 
IMG OID637882966 
ProductDNA methylase N-4/N-6 
Protein accessionYP_484694 
Protein GI86748198 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.981822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTGT CGCGTCGCGG GGCGTCTGCA AGGGCGCCCC GCACTCAATT CGAGTCCGCT 
CCGGAGAATC GAATCATCGT CGGCGATTGC GTCGCCGAGA TGTCGAAGCT TCCGGCCAAA
TCGGTCGATC TGGTGTTCGC CGATCCGCCG TACAATCTTC AACTCAAGGG CGCGCTCAAA
CGCCCCGACG AATCGCAGGT CGACGCGGTC GACGACGATT GGGACAAGTT CTCGTCGTTC
GCCGCCTATG ACGACTTCAC CCGCGCCTGG CTGCTCGCGG CACGCCGGAT CATGAAGCCG
TCTGCGACGA TCTGGGTGAT CGGCTCGTAT CACAACATCT TCCGCGTCGG CGCGATCATG
CAGGACCTCG GGTTCTGGGT GCTCAACGAC ATCGTCTGGC GCAAGACCAA CCCGATGCCG
AATTTCCGCG GCCGCAGATT CACCAATGCC CACGAGACCA TGATCTGGGC AGCGCGCGAC
GAGAACGCCA AGGGCTACAC CTTCAACTAC GACGCGCTGA AGGCCTCGAA CGAGGACGTC
CAGGCACGCT CCGACTGGCT GATTCCGCTG TGCACCGGCG ACGAACGGCT GAAGGGCAAG
GACGGCAAGA AGGTGCATCC GACGCAGAAG CCGGAAGGCC TGCTGGCGCG CGTGCTGTTG
AGTTCGTCGA AGCCCGGCGA TCTGGTGATC GATCCGTTCA ATGGAACCGG CACCACCGGC
GCCGTCGCCA AGCGTCTGCG CCGCAACTAC ATCGGCTTCG AGCGCGACCG CACCTATGCG
GACGCGGCGC GGGCGCGAAT CGATGCGGTC GAACCGCTCC CGGAAGACAC GCTGAAACCG
TTCCTCACCG CGCGCGACGC GCCGCGGGTG GCGTTCTCCG AACTGATCGA GCGCGGCATG
ATCTCGCCGG GCGCCAAACT GGTCGACTCG AAGAAGCGCC ACGGCGCGCT GGTCCGCGCC
GACGGCGCGA TCATGCTCGG CGACAAGGTC GGCTCCATCC ACCGCATCGG CGCAATGGCG
CAGGGCTCCG AAGCCTGCAA CGGCTGGACC TTCTGGCACG TCGAGACCAC CAAGGGCCTG
CGCCTGATCG ACGAACTGCG CGCCGAAGTG CGCAGCGCGA TGGCCGTCGG CTGA
 
Protein sequence
MILSRRGASA RAPRTQFESA PENRIIVGDC VAEMSKLPAK SVDLVFADPP YNLQLKGALK 
RPDESQVDAV DDDWDKFSSF AAYDDFTRAW LLAARRIMKP SATIWVIGSY HNIFRVGAIM
QDLGFWVLND IVWRKTNPMP NFRGRRFTNA HETMIWAARD ENAKGYTFNY DALKASNEDV
QARSDWLIPL CTGDERLKGK DGKKVHPTQK PEGLLARVLL SSSKPGDLVI DPFNGTGTTG
AVAKRLRRNY IGFERDRTYA DAARARIDAV EPLPEDTLKP FLTARDAPRV AFSELIERGM
ISPGAKLVDS KKRHGALVRA DGAIMLGDKV GSIHRIGAMA QGSEACNGWT FWHVETTKGL
RLIDELRAEV RSAMAVG