Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2449 |
Symbol | |
ID | 6981190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2494063 |
End bp | 2496003 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643397163 |
Product | C-5 cytosine-specific DNA methylase |
Protein accession | YP_002281949 |
Protein GI | 209550032 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.199486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCGCA AGATCAAAGT CGCAGATTTG CTTTGCGGGG CCGGTGGTTC GTCAGCAGGG GCGAAGCGCG CGCTGGAAGA AATGGGCCTG GAGATGGAAT TGGTCTGTGT AAATCACTGG CCGACTGCCA TCGATACCCA CCAGAGGAAT TTTCCCGAAG CCAGACACTA CATTCAGGAC ATCGCCACAG TCCGCCCGCA CATCCTGGTG CCGGAAGGCT ATCTGGATCT CTTGATGGCG TCCCCGACGT GCACGCATCA TTCTGTGGCG CGCGGCGGCA AGCCGACCAG CGACCAGCAG CGCAGCGACC CTTGGCATAT CGTCACCTGG CTGACCGAGT TGCGGGTAAA GCGGATCATC ATCGAAAACG TCTGGGAGTT CTGCGGCTGG GGCCCGGTCA ACATGAAGAC CGGTCGCCCT ATCCCGTCCA GAAAGGGCGA ATATTTCCAC GCATGGACTG AGACGCTCCG GCGTCTTGGC TTCGAACTGG AGTGGCGCCG ACTCAACGCC GCCGACTACG GGGACGCGAC CACGCGGCAA CGCTTCATCC TCATGGGTCG ATCCGACGGC CGCAAGATCC ACTGGCCAAT GCCGACGCAC CGAAAGCGCG ATGAGGTCAA CGCAGATCTG TTCTCCGCCG CAGAGCCGTG GCGCCCGGCG CGGGAGATCA TCGACTGGTC TATCAAGGGG CGCTCCATCC TCAACCGCAA GAAGCCGTTG GCGCCCAAGA CGCTAGCGCG GGTGTTGGCC GGCGCATTTA AGTTCGGCTG GCCGAAACCG TTCATCGACA AGCTTATGGA AGAGATCGAG CGGTCCCTGC GCTACCATAT CAATTGGGCC TTCGAAGCAC GAAACGCAAA GGCCTCAAAG TCCAAACGTC GTCAGCGTCG TGCACTGGCG AAGGATTTGA TCAGGCGCCT ACGCCACTTC AGGATTGCGC CTGCCGAGTA CGCCAAGGGA GGGCGGGCCG CTGAGCCTAT GGTCATCACG TTGCGGCGCA ACGGTAACGG CACGTCGATA TCAAGCCCCA TCCCCACGGT CGCTGCAAAT GGGCAGCATG TTGGACTTGC CGAGCCCGTC ATCGTCAACA TGAAGGGCCA ATCGACGGCG ACTTCGTCGA GAGAGCCCTT ACCCACGCAA ACGTCGCATG CGGCTCATTT ATACGCTGCA GAGCCAATCA TTCTCTCACA GCACAATAGC GGGTCAGCTC GAGAGGTGAG TGATCCTCTC CCCACTATCA CGACCGGAGG AGCAGCGAAC GAAGCCCGGC CAGGTTGCGC TAGGCCTATG CTTGTCGAGC CGTTCGTTCT GTCTCAAGCA TCTGGAGGCT CCCCGCGGGC CGTTAGCGAC CCCATCCCGA CACCGACGAC AGGCGGCAAC GGGGCGGCGC ACGCGCTCAT CTCTCCATAC TATGGCTCCG GTTCCGGCGA GACCTGCAAT CATGTCGACG AGGTGTTGCC GACCATAACC AGCAAAGGCC GTTTCGGCAT GGTGGTGCCG GTCACCAATT CAAACGGTGG AGCGACGGCG CGAAACATCG ATGTTGACCC GGTTCCCACC ATGACCACCG CGAAGGGCGG TGAGTTCGCG TTCATCGCCG CACAGTTCGG CGAGCGGGAA GGCCAGGCGC CGCGCGTCCA CGATATCGAC CAACCCACGC CGACGATCGC GGCAACCGGC CATATCAACC TCGTCGAGTC TGGGCCGGAA TACGACATCC TCTTCCGCAT GCTGGAGCCG CATGAGCTGG CGGCGGCAAT GGGCTTCAAT ACCGAAGAGG CGACGTATGA GTTCGCCGGC ACCAAGACCG AAAAAATCAA GCAGATTGGC AACGCCGTCT CGGTGGCGAA GATGAAGGCT TGCGTCGGTG CAATCATGGC AGACGCGGTG CCGAAGCTGA AATCGAGGCC AGATGCTGAA TTTCTGGAGG CCGCTGAATG A
|
Protein sequence | MARKIKVADL LCGAGGSSAG AKRALEEMGL EMELVCVNHW PTAIDTHQRN FPEARHYIQD IATVRPHILV PEGYLDLLMA SPTCTHHSVA RGGKPTSDQQ RSDPWHIVTW LTELRVKRII IENVWEFCGW GPVNMKTGRP IPSRKGEYFH AWTETLRRLG FELEWRRLNA ADYGDATTRQ RFILMGRSDG RKIHWPMPTH RKRDEVNADL FSAAEPWRPA REIIDWSIKG RSILNRKKPL APKTLARVLA GAFKFGWPKP FIDKLMEEIE RSLRYHINWA FEARNAKASK SKRRQRRALA KDLIRRLRHF RIAPAEYAKG GRAAEPMVIT LRRNGNGTSI SSPIPTVAAN GQHVGLAEPV IVNMKGQSTA TSSREPLPTQ TSHAAHLYAA EPIILSQHNS GSAREVSDPL PTITTGGAAN EARPGCARPM LVEPFVLSQA SGGSPRAVSD PIPTPTTGGN GAAHALISPY YGSGSGETCN HVDEVLPTIT SKGRFGMVVP VTNSNGGATA RNIDVDPVPT MTTAKGGEFA FIAAQFGERE GQAPRVHDID QPTPTIAATG HINLVESGPE YDILFRMLEP HELAAAMGFN TEEATYEFAG TKTEKIKQIG NAVSVAKMKA CVGAIMADAV PKLKSRPDAE FLEAAE
|
| |