Gene Rleg2_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2449 
Symbol 
ID6981190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2494063 
End bp2496003 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content61% 
IMG OID643397163 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_002281949 
Protein GI209550032 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.199486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGCA AGATCAAAGT CGCAGATTTG CTTTGCGGGG CCGGTGGTTC GTCAGCAGGG 
GCGAAGCGCG CGCTGGAAGA AATGGGCCTG GAGATGGAAT TGGTCTGTGT AAATCACTGG
CCGACTGCCA TCGATACCCA CCAGAGGAAT TTTCCCGAAG CCAGACACTA CATTCAGGAC
ATCGCCACAG TCCGCCCGCA CATCCTGGTG CCGGAAGGCT ATCTGGATCT CTTGATGGCG
TCCCCGACGT GCACGCATCA TTCTGTGGCG CGCGGCGGCA AGCCGACCAG CGACCAGCAG
CGCAGCGACC CTTGGCATAT CGTCACCTGG CTGACCGAGT TGCGGGTAAA GCGGATCATC
ATCGAAAACG TCTGGGAGTT CTGCGGCTGG GGCCCGGTCA ACATGAAGAC CGGTCGCCCT
ATCCCGTCCA GAAAGGGCGA ATATTTCCAC GCATGGACTG AGACGCTCCG GCGTCTTGGC
TTCGAACTGG AGTGGCGCCG ACTCAACGCC GCCGACTACG GGGACGCGAC CACGCGGCAA
CGCTTCATCC TCATGGGTCG ATCCGACGGC CGCAAGATCC ACTGGCCAAT GCCGACGCAC
CGAAAGCGCG ATGAGGTCAA CGCAGATCTG TTCTCCGCCG CAGAGCCGTG GCGCCCGGCG
CGGGAGATCA TCGACTGGTC TATCAAGGGG CGCTCCATCC TCAACCGCAA GAAGCCGTTG
GCGCCCAAGA CGCTAGCGCG GGTGTTGGCC GGCGCATTTA AGTTCGGCTG GCCGAAACCG
TTCATCGACA AGCTTATGGA AGAGATCGAG CGGTCCCTGC GCTACCATAT CAATTGGGCC
TTCGAAGCAC GAAACGCAAA GGCCTCAAAG TCCAAACGTC GTCAGCGTCG TGCACTGGCG
AAGGATTTGA TCAGGCGCCT ACGCCACTTC AGGATTGCGC CTGCCGAGTA CGCCAAGGGA
GGGCGGGCCG CTGAGCCTAT GGTCATCACG TTGCGGCGCA ACGGTAACGG CACGTCGATA
TCAAGCCCCA TCCCCACGGT CGCTGCAAAT GGGCAGCATG TTGGACTTGC CGAGCCCGTC
ATCGTCAACA TGAAGGGCCA ATCGACGGCG ACTTCGTCGA GAGAGCCCTT ACCCACGCAA
ACGTCGCATG CGGCTCATTT ATACGCTGCA GAGCCAATCA TTCTCTCACA GCACAATAGC
GGGTCAGCTC GAGAGGTGAG TGATCCTCTC CCCACTATCA CGACCGGAGG AGCAGCGAAC
GAAGCCCGGC CAGGTTGCGC TAGGCCTATG CTTGTCGAGC CGTTCGTTCT GTCTCAAGCA
TCTGGAGGCT CCCCGCGGGC CGTTAGCGAC CCCATCCCGA CACCGACGAC AGGCGGCAAC
GGGGCGGCGC ACGCGCTCAT CTCTCCATAC TATGGCTCCG GTTCCGGCGA GACCTGCAAT
CATGTCGACG AGGTGTTGCC GACCATAACC AGCAAAGGCC GTTTCGGCAT GGTGGTGCCG
GTCACCAATT CAAACGGTGG AGCGACGGCG CGAAACATCG ATGTTGACCC GGTTCCCACC
ATGACCACCG CGAAGGGCGG TGAGTTCGCG TTCATCGCCG CACAGTTCGG CGAGCGGGAA
GGCCAGGCGC CGCGCGTCCA CGATATCGAC CAACCCACGC CGACGATCGC GGCAACCGGC
CATATCAACC TCGTCGAGTC TGGGCCGGAA TACGACATCC TCTTCCGCAT GCTGGAGCCG
CATGAGCTGG CGGCGGCAAT GGGCTTCAAT ACCGAAGAGG CGACGTATGA GTTCGCCGGC
ACCAAGACCG AAAAAATCAA GCAGATTGGC AACGCCGTCT CGGTGGCGAA GATGAAGGCT
TGCGTCGGTG CAATCATGGC AGACGCGGTG CCGAAGCTGA AATCGAGGCC AGATGCTGAA
TTTCTGGAGG CCGCTGAATG A
 
Protein sequence
MARKIKVADL LCGAGGSSAG AKRALEEMGL EMELVCVNHW PTAIDTHQRN FPEARHYIQD 
IATVRPHILV PEGYLDLLMA SPTCTHHSVA RGGKPTSDQQ RSDPWHIVTW LTELRVKRII
IENVWEFCGW GPVNMKTGRP IPSRKGEYFH AWTETLRRLG FELEWRRLNA ADYGDATTRQ
RFILMGRSDG RKIHWPMPTH RKRDEVNADL FSAAEPWRPA REIIDWSIKG RSILNRKKPL
APKTLARVLA GAFKFGWPKP FIDKLMEEIE RSLRYHINWA FEARNAKASK SKRRQRRALA
KDLIRRLRHF RIAPAEYAKG GRAAEPMVIT LRRNGNGTSI SSPIPTVAAN GQHVGLAEPV
IVNMKGQSTA TSSREPLPTQ TSHAAHLYAA EPIILSQHNS GSAREVSDPL PTITTGGAAN
EARPGCARPM LVEPFVLSQA SGGSPRAVSD PIPTPTTGGN GAAHALISPY YGSGSGETCN
HVDEVLPTIT SKGRFGMVVP VTNSNGGATA RNIDVDPVPT MTTAKGGEFA FIAAQFGERE
GQAPRVHDID QPTPTIAATG HINLVESGPE YDILFRMLEP HELAAAMGFN TEEATYEFAG
TKTEKIKQIG NAVSVAKMKA CVGAIMADAV PKLKSRPDAE FLEAAE