Gene RPB_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0072 
Symbol 
ID3907812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp74067 
End bp74936 
Gene Length870 bp 
Protein Length289 aa 
Translation table11 
GC content73% 
IMG OID637881953 
ProductHemK family modification methylase 
Protein accessionYP_483695 
Protein GI86747199 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.794826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.684454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCAAC AGAATTCTGT CGCCGAAGCC CGCCGCGCGC TAGGCCGGCG CCTGAAGGAC 
GCCGGCATCG AATCGGCCGA GCTCGACGCC CGGCTGCTGA TCGGCGAGGC GACCGGACTC
GATCTCACCG GACTGATCGT GCAGGCCGAG CGGGTGCTCA CGCTGGACGA GGCGGAGCGG
CTCGACGCCT TCGCGGTGCG CCGGTTGGCC GGTGAGCCGG TGGCGCGGAT CCTCGGCGTC
CGCGAATTCT GGGGGCTGGC GCTGCGGCTG TCCGACGACA CGCTGGTGCC GCGGCCCGAC
ACCGAGACCG TGGTCGAGGC CGCGCTCGAC CATCTGCGCG CCGAGGCGCG CGCCCGCCCG
CTGATTCTCG ATCTCGGCAC CGGCTCCGGC GCGATCCTGC TGGCGCTGCT GTCGGAGTGC
CCGGACGCGT TCGGCGTCGC GACCGACATC AGTCTCGGCG CCTTGCGCGC CGCGCGGGCG
AATGCCGCCG CGCTCGGCCT TGCCGATCGC GCCGGCTTCG TCGCCTGCGA CTACGCTGCT
GCGCTCGGCG GTTCGTTCGA CCTGATCGTC TCCAACCCGC CATACATCCC GGCAAGCGCG
ATCGCCGCGC TCGATGTCGA GGTCCGGGAG CACGATCCGC GCCGCGCGCT CGACGGCGGT
GAGGACGGCC TCGATGCCTA TCGCCGGATC ATCCCCGAAG CGGCGCGGCT GCTCGGGCGC
GGCGGGGCGT TGGTGGTGGA GATCGGCCAA GGCCAGGGCG ACGACGTCGC CGCGCTGATG
CGGGCCTCTG GGCTCGCCGT CCCGGAGCCG CCACGCCGCG ATCTGGGTGG CGTTTTTCGG
GCGGTGACGG GGCGCAATTT GACGGGTTAA
 
Protein sequence
MDQQNSVAEA RRALGRRLKD AGIESAELDA RLLIGEATGL DLTGLIVQAE RVLTLDEAER 
LDAFAVRRLA GEPVARILGV REFWGLALRL SDDTLVPRPD TETVVEAALD HLRAEARARP
LILDLGTGSG AILLALLSEC PDAFGVATDI SLGALRAARA NAAALGLADR AGFVACDYAA
ALGGSFDLIV SNPPYIPASA IAALDVEVRE HDPRRALDGG EDGLDAYRRI IPEAARLLGR
GGALVVEIGQ GQGDDVAALM RASGLAVPEP PRRDLGGVFR AVTGRNLTG