Gene Rleg2_4938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4938 
Symbol 
ID6978032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp578068 
End bp579135 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content60% 
IMG OID643394091 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_002278909 
Protein GI209546991 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGAA TAGATTTATT CGCTGGCGGA GGCGGGATGA CCATGGGCGC CAAGCTTTCG 
GGCGTCGACG TGCGGGCGGC GGTGGAGAAC CATCCTTCGG CCTGCTTGAC ATACTCAGCC
AACCATCCCG GTGCGACCCT GCTCGGAACA GACATTGCGA AGGTGGCCAC GATCGACGTC
GGCCCGCGGC ATCAGCCCCT GGTCCTGTTC GGTGGCCCGC CATGCCAGGG GTTTTCGACT
TCGAACCAGC GAACCCGTCA TGCCGACAAT CCGAAGAATT GGCTGTTCCG AGAATTTCTG
CGCATGGTCG AGACCCTCAA ACCTGAATGG GTGGTATTCG AGAACGTCGC GGGGATCCTG
CAGACCGATG GCGGGCGCTT CGCTGAGGCA TTTCGAGAGC AACTCAAGGC GATGGGATAC
AGGATCGCGT TCGGCATCCT GAATGCAGCT GATTTCGGGT GTCCGCAGCG GCGCTCGCGT
TACATCGTAA TCGCCGCCTT AAATTCGGAT CCTGAGCTCC CGAAGGCTGT TCCGAATGTT
GAAGTGCCCA CCCTTTGGCA AGCGATCGGG GATCTTCCTG AATTGCTCAA CGGCGCCACG
GTGGACGAGC TGGAGTACGG CGGCGCGCCG CTTTCCGAAT ACGCTCGACG CATGCGATCA
GACCTTGCGA AGTGCACTGG TCATCTGGTT AGCAGGAACG CGGACTCGAT CGTGGCGAGG
TATGCCCACA TCCCGCAGGG CGGCAACTGG CGGGACGTTC CCGGCATGAT GCGCGACCCA
GTCACCGATC GGCGACGTTA CCACTCAGGC ATCTACAAAC GGCTGGTGCA GGACGCGCCA
TCCGTCGTCA TCGGAAATTT TCGAAAGAAC ATGCTGATCC ATCCCACTCA GGACCGGGGT
CTCTCGGTTC GCGAAGCCGC GCGTCTGCAG AGCTTCTCCG ACAACTACGT CTTCCACGGA
TCGATAGGAT TCCAACAGCA GCAGGTGGGC AACGCGGTGC CTCCTATCCT GGCGAAGGCC
GTGTTTGACA AGGTGGTGGC CATGTCCAGC CAGGGGGGAG GCAAATAA
 
Protein sequence
MIGIDLFAGG GGMTMGAKLS GVDVRAAVEN HPSACLTYSA NHPGATLLGT DIAKVATIDV 
GPRHQPLVLF GGPPCQGFST SNQRTRHADN PKNWLFREFL RMVETLKPEW VVFENVAGIL
QTDGGRFAEA FREQLKAMGY RIAFGILNAA DFGCPQRRSR YIVIAALNSD PELPKAVPNV
EVPTLWQAIG DLPELLNGAT VDELEYGGAP LSEYARRMRS DLAKCTGHLV SRNADSIVAR
YAHIPQGGNW RDVPGMMRDP VTDRRRYHSG IYKRLVQDAP SVVIGNFRKN MLIHPTQDRG
LSVREAARLQ SFSDNYVFHG SIGFQQQQVG NAVPPILAKA VFDKVVAMSS QGGGK