Gene RPB_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0472 
Symbol 
ID3909817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp519700 
End bp520872 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content66% 
IMG OID637882359 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_484094 
Protein GI86747598 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG CAGGACTTTT CGCTGGCATC GGAGGGCTCG AGCTTGGTCT TCATCGGGCC 
GGGCACGAGA CGGTAATCCT GTCCGAAATC TGGCAGCCCG CCGGGGCGGT GTTGGAACAC
CGCTTCAAGG GCGCCCCGAA TGTCGGCGAT GTCGCGACGT TGACGTCGCT GCCCTCCGAG
GTCGAGCTGA TGACGGCCGG CTTTCCCTGC CAGGACCTCA GCCAAGCCGG CAAGACCGCC
GGGATCAAGG GCGGGAAGTC GGGGCTGGTG ACGCATGTTT TCCGGCTGAT CGATCGCAGC
CGGCCGAAAT GGGTGCTGCT GGAGAACGTC TCGTTCATGC TGCGCCTCGA CGGCGGCAGC
GCCATGACCC GGCTGGTCTC CGAATTCGAA AGGCGCGGCT ATCGCTGGGC CTACCGCGTT
GTGAATTCAC TGAGCTTCTT GCCGCATCGG CGCGCGCGGG TGTTCTTTCT GGCGAGCATC
GAGGGTGATC CGGCCGATGT CCTGCTGGTC GACGACGCGG AGCCCGCCGG GCTGCAGACA
AGCCTGCAAA GCCACGCCCA CGGCTTCTAC TGGACCGAAG GAACGCGCGG TCTGGGATGG
GGCCCCGACT GCGTGCCGAC GCTCAAGAAC GGCTCCACGG TCGGCATTCC TTCGCCGCCG
GCGATCCTGA TGCCGAACGG GGAAATCGTG ACACCCGATA TCCGCGACGC CGAGCGGCTT
CAGGGACTTC CCGCCGACTG GACGAAGCCG GCCGAGAGGG TGGCCCGCGC GTCGTTCCGA
TGGTCGCTGG TCGGCAATGC CGTCAGCAAG CCGGTCGCGG CGTGGATCGG ACAGCGGTTG
AACGCGCCCG GAGCATACGA CAGATCGCGC GACGGCGGCA GCGTCGTCCG CGGCGATTGG
CCCAAGGCGG CGCGCTCCGA CGGCAAAGCC TGCCGTGAAG TCGCGATCTC CGAATTTCCC
AAATGGGCGA AGCGGCCGAG CCTTCAGGAC TTCCTGCGCT ACGAGCCGAG ACTGCTGTCG
GCTCGCGCCA CGGCAGGGTT TTTGTCGCGG ATCGAGAAGA GCAGCTTGCG CTTCGTACCG
GGCTTCAAGG ACAGGGTGCG ATCTCATCTC GATCATGTTC GAGCCATCGA CGCGTTCGTC
CAGGAAGGCC GCACGCTGCT CGCGGCCGAA TGA
 
Protein sequence
MKIAGLFAGI GGLELGLHRA GHETVILSEI WQPAGAVLEH RFKGAPNVGD VATLTSLPSE 
VELMTAGFPC QDLSQAGKTA GIKGGKSGLV THVFRLIDRS RPKWVLLENV SFMLRLDGGS
AMTRLVSEFE RRGYRWAYRV VNSLSFLPHR RARVFFLASI EGDPADVLLV DDAEPAGLQT
SLQSHAHGFY WTEGTRGLGW GPDCVPTLKN GSTVGIPSPP AILMPNGEIV TPDIRDAERL
QGLPADWTKP AERVARASFR WSLVGNAVSK PVAAWIGQRL NAPGAYDRSR DGGSVVRGDW
PKAARSDGKA CREVAISEFP KWAKRPSLQD FLRYEPRLLS ARATAGFLSR IEKSSLRFVP
GFKDRVRSHL DHVRAIDAFV QEGRTLLAAE