Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0472 |
Symbol | |
ID | 3909817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 519700 |
End bp | 520872 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882359 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_484094 |
Protein GI | 86747598 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTG CAGGACTTTT CGCTGGCATC GGAGGGCTCG AGCTTGGTCT TCATCGGGCC GGGCACGAGA CGGTAATCCT GTCCGAAATC TGGCAGCCCG CCGGGGCGGT GTTGGAACAC CGCTTCAAGG GCGCCCCGAA TGTCGGCGAT GTCGCGACGT TGACGTCGCT GCCCTCCGAG GTCGAGCTGA TGACGGCCGG CTTTCCCTGC CAGGACCTCA GCCAAGCCGG CAAGACCGCC GGGATCAAGG GCGGGAAGTC GGGGCTGGTG ACGCATGTTT TCCGGCTGAT CGATCGCAGC CGGCCGAAAT GGGTGCTGCT GGAGAACGTC TCGTTCATGC TGCGCCTCGA CGGCGGCAGC GCCATGACCC GGCTGGTCTC CGAATTCGAA AGGCGCGGCT ATCGCTGGGC CTACCGCGTT GTGAATTCAC TGAGCTTCTT GCCGCATCGG CGCGCGCGGG TGTTCTTTCT GGCGAGCATC GAGGGTGATC CGGCCGATGT CCTGCTGGTC GACGACGCGG AGCCCGCCGG GCTGCAGACA AGCCTGCAAA GCCACGCCCA CGGCTTCTAC TGGACCGAAG GAACGCGCGG TCTGGGATGG GGCCCCGACT GCGTGCCGAC GCTCAAGAAC GGCTCCACGG TCGGCATTCC TTCGCCGCCG GCGATCCTGA TGCCGAACGG GGAAATCGTG ACACCCGATA TCCGCGACGC CGAGCGGCTT CAGGGACTTC CCGCCGACTG GACGAAGCCG GCCGAGAGGG TGGCCCGCGC GTCGTTCCGA TGGTCGCTGG TCGGCAATGC CGTCAGCAAG CCGGTCGCGG CGTGGATCGG ACAGCGGTTG AACGCGCCCG GAGCATACGA CAGATCGCGC GACGGCGGCA GCGTCGTCCG CGGCGATTGG CCCAAGGCGG CGCGCTCCGA CGGCAAAGCC TGCCGTGAAG TCGCGATCTC CGAATTTCCC AAATGGGCGA AGCGGCCGAG CCTTCAGGAC TTCCTGCGCT ACGAGCCGAG ACTGCTGTCG GCTCGCGCCA CGGCAGGGTT TTTGTCGCGG ATCGAGAAGA GCAGCTTGCG CTTCGTACCG GGCTTCAAGG ACAGGGTGCG ATCTCATCTC GATCATGTTC GAGCCATCGA CGCGTTCGTC CAGGAAGGCC GCACGCTGCT CGCGGCCGAA TGA
|
Protein sequence | MKIAGLFAGI GGLELGLHRA GHETVILSEI WQPAGAVLEH RFKGAPNVGD VATLTSLPSE VELMTAGFPC QDLSQAGKTA GIKGGKSGLV THVFRLIDRS RPKWVLLENV SFMLRLDGGS AMTRLVSEFE RRGYRWAYRV VNSLSFLPHR RARVFFLASI EGDPADVLLV DDAEPAGLQT SLQSHAHGFY WTEGTRGLGW GPDCVPTLKN GSTVGIPSPP AILMPNGEIV TPDIRDAERL QGLPADWTKP AERVARASFR WSLVGNAVSK PVAAWIGQRL NAPGAYDRSR DGGSVVRGDW PKAARSDGKA CREVAISEFP KWAKRPSLQD FLRYEPRLLS ARATAGFLSR IEKSSLRFVP GFKDRVRSHL DHVRAIDAFV QEGRTLLAAE
|
| |