Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0072 |
Symbol | |
ID | 3907812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 74067 |
End bp | 74936 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637881953 |
Product | HemK family modification methylase |
Protein accession | YP_483695 |
Protein GI | 86747199 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.794826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.684454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCAAC AGAATTCTGT CGCCGAAGCC CGCCGCGCGC TAGGCCGGCG CCTGAAGGAC GCCGGCATCG AATCGGCCGA GCTCGACGCC CGGCTGCTGA TCGGCGAGGC GACCGGACTC GATCTCACCG GACTGATCGT GCAGGCCGAG CGGGTGCTCA CGCTGGACGA GGCGGAGCGG CTCGACGCCT TCGCGGTGCG CCGGTTGGCC GGTGAGCCGG TGGCGCGGAT CCTCGGCGTC CGCGAATTCT GGGGGCTGGC GCTGCGGCTG TCCGACGACA CGCTGGTGCC GCGGCCCGAC ACCGAGACCG TGGTCGAGGC CGCGCTCGAC CATCTGCGCG CCGAGGCGCG CGCCCGCCCG CTGATTCTCG ATCTCGGCAC CGGCTCCGGC GCGATCCTGC TGGCGCTGCT GTCGGAGTGC CCGGACGCGT TCGGCGTCGC GACCGACATC AGTCTCGGCG CCTTGCGCGC CGCGCGGGCG AATGCCGCCG CGCTCGGCCT TGCCGATCGC GCCGGCTTCG TCGCCTGCGA CTACGCTGCT GCGCTCGGCG GTTCGTTCGA CCTGATCGTC TCCAACCCGC CATACATCCC GGCAAGCGCG ATCGCCGCGC TCGATGTCGA GGTCCGGGAG CACGATCCGC GCCGCGCGCT CGACGGCGGT GAGGACGGCC TCGATGCCTA TCGCCGGATC ATCCCCGAAG CGGCGCGGCT GCTCGGGCGC GGCGGGGCGT TGGTGGTGGA GATCGGCCAA GGCCAGGGCG ACGACGTCGC CGCGCTGATG CGGGCCTCTG GGCTCGCCGT CCCGGAGCCG CCACGCCGCG ATCTGGGTGG CGTTTTTCGG GCGGTGACGG GGCGCAATTT GACGGGTTAA
|
Protein sequence | MDQQNSVAEA RRALGRRLKD AGIESAELDA RLLIGEATGL DLTGLIVQAE RVLTLDEAER LDAFAVRRLA GEPVARILGV REFWGLALRL SDDTLVPRPD TETVVEAALD HLRAEARARP LILDLGTGSG AILLALLSEC PDAFGVATDI SLGALRAARA NAAALGLADR AGFVACDYAA ALGGSFDLIV SNPPYIPASA IAALDVEVRE HDPRRALDGG EDGLDAYRRI IPEAARLLGR GGALVVEIGQ GQGDDVAALM RASGLAVPEP PRRDLGGVFR AVTGRNLTG
|
| |