Gene Rleg_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0047 
Symbol 
ID8011294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp43656 
End bp46382 
Gene Length2727 bp 
Protein Length908 aa 
Translation table11 
GC content65% 
IMG OID644822637 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002973897 
Protein GI241202801 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.967716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCTC GAATAGACAC AGGGAATGAA GCCTTTTCGG CTGCCGAGCT CGCCACGGCG 
GAAAGCCGTG CCTCGGCGAC GCCGATGATG GAGCAATTTA TCGAGATCAA GGCGAACAAT
CCGGGTTCGC TGCTGTTCTA TCGCATGGGC GATTTCTACG AGCTGTTCTT CGAGGATGCG
CTGGAAGCTT CACGCGCGCT CGGTATCACG CTGACGAAGC GCGGCCAGCA CATGGGCCAG
GATATCCCGA TGTGCGGGGT TCCGGTGCAT GCGGCCGACG ATTATCTGCA GAAGCTGATT
TCGCTCGGCT TCCGCGTCGC CGTCTGCGAG CAGATCGAAG ATCCGGCCGA AGCAAAAAAA
CGCGGCGCGA AATCCGTCGT CAAGCGTGAC GTCGTCCGCC TCGTCACTCC AGGCACGATC
ACCGAGGAAA AGCTGCTTTC GCCCTCGGAA TCCAACTATC TGATGGCGCT GACCCGCATT
CGCGCCAGCG GCGAGGCCTT GCTGGCGCTT GCCTGGATCG ATATTTCCAC CGGCGTCTTC
CGGCTTGCCG AGACCGAAGC CTCGCGCCTG CTTGCCGATA TCCTGCGCAT CGATCCGCGT
GAGCTCATCC TTCCCGACAC GATCTTCCAC GATCCCGAAC TCAAACCGGT CTTCGACGTG
CTCGGCCGGA CAGCCGTGCC GCAGCCTTCC GTGCTCTTCG ACAGCGCCAG CGCCGAAGGC
CGGATCGCGC GGTATTTCGG CGTCGCGACG CTCGACGGCT TCGGCACCTT CTCACGCGCC
GAGCTGGCAG CGGCCGCGGC CGCCGTCGCC TATGTCGAGA AGACGCAGAT TGCCGAGCGG
CCGCCGCTTG GAAAACCGGA ACGGGAAAGT GCAGCCTCGA CGCTGTTCAT CGATCCCGCC
ACCCGCGCCA ATCTGGAGCT TGCCCGCACG CTTTCCGGCG ATCGCAACGG TTCGCTGTTG
AAGGCAATCG ACCGCACCGT CACCGGCGGC GGCGCCAGGC TTCTTGCCGA ACGGCTGATG
TCGCCGCTGA CCGACCCGGC CCGCATCAAT GCGCGGCTCG ATTCGATCGG TTTCCTGATC
GACGAACCCC TGCTTTGCGG CAATCTGCGC GACACGCTGA AACATGTTCC CGACATGCCG
CGCGCTCTGT CGCGCCTGGC GCTCGACCGC GGCGGCCCGC GCGATCTCTG GGCGATCCGC
CAGGGCCTCG AAGCGGCGGG CGGAATTGCG GCGATGCTCG GCAAGGCGAT GCTGCCCGAA
GAACTCGGCC AGGCGCTATC CGGGCTGCAG GCCCTTCCCG CGGCAGTGGA AAAGCTGCTT
GCCGAAACGC TCGCCGACGA ATTGCCGCTG TTGAAACGCG ATGGCGGCTT CCTTCGCGAC
GGCGCCAGCG CCGAGCTCGA CGAGGTCCGG GCACTGCGCG ACCAGTCCCG TCGCGTCATC
GCCGGCCTGC AATTGCAATA TGCCGAGGAG ACCGGCATCC GGTCGCTGAA GATCAAGCAC
AACAACGTCC TCGGTTATTT CATCGAAGTC ACCGCCGGCA ATGCCTCGCC GATGACGGAG
ACGGCGGAGG CGAAGGCCCG CTTCATCCAC CGCCAGACGA TGGCGAACGC CATGCGCTTC
ACCACGACCG AACTTGCCGA TCTCGAAAGC CGCATCGCCA ATGCCGCCGA CCAGGCGCTG
ACGATCGAAC TCGAAGCCTT CGACAGGATG ACGGCGGCCG TCGTTGCGGA AGCCGAGGCG
ATCAAGTCGG GCGCCCGGGC GCTTGCCGTC ATCGACGTCG CCGCAGGCTT GGCGCTTCTC
GCCGAGGAGC AGGCCTATTG CCGCCCGCAG GTCGACGGCT CGAAGATGTT TGCCATCGAG
GGCGGGCGCC ATCCGGTCGT CGAGCAGGCG CTGCGGCGGC AAGCGGGCGG CCCCTTCGTC
GCCAATCATT GCGATCTGTC GCCGAGGACC GGCGACCGGG ACGGGGCGAT CTGGCTGCTG
ACCGGCCCGA ACATGGGCGG CAAGTCGACC TTCCTGCGCC AGAACGCGCT GATATCAATC
CTGGCGCAGA TGGGCTCCTT CGTGCCGGCG ACATCGGCCC ATATCGGCAT CGTCGACCGG
CTTTTCTCGC GCGTCGGCGC CTCCGACGAT CTGGCGCGTG GGCGCTCGAC CTTCATGGTC
GAGATGGTCG AAACGGCGGC GATCCTCAAC CAGGCGAGCG ACCGTTCCCT CGTCATCCTC
GACGAGATCG GCCGCGGCAC CGCCACGTTC GACGGTCTTT CGATCGCCTG GGCCGCCGTC
GAGCACCTGC ATGAGGCCAA TCGCTGCCGC GGCCTCTTCG CCACGCATTT CCACGAACTG
ACCGTGCTTT CGGAAAAGCT TGGCCGGCTG TCGAACGCGA CGATGCGCGT CAAAGAATGG
GACGGCGACG TCATCTTCCT GCACGAGGTC GGGCCGGGTG CGGCGGACCG CTCCTACGGC
ATCCAGGTCG CGCGCCTTGC CGGGCTTCCG GCCTCGGTCG TGGCGCGCGC CCGCGACGTG
CTCACCCGGC TGGAAGATGC CGATCGCAAG AACCCGGCGA GCCAGTTGAT CGACGACCTG
CCGCTCTTCC AGGTGGCGGT GCGCCGCGAG GAAACCGCCC GCGGGACCTC CAAGGTCGAG
GAGGCACTGA AGGCGATGAG CCTTGACGAC ATGACACCGC GCGAGGCAAT GGATGCGCTT
TACGACCTCA AGAAGAAGTT GAAATAG
 
Protein sequence
MNARIDTGNE AFSAAELATA ESRASATPMM EQFIEIKANN PGSLLFYRMG DFYELFFEDA 
LEASRALGIT LTKRGQHMGQ DIPMCGVPVH AADDYLQKLI SLGFRVAVCE QIEDPAEAKK
RGAKSVVKRD VVRLVTPGTI TEEKLLSPSE SNYLMALTRI RASGEALLAL AWIDISTGVF
RLAETEASRL LADILRIDPR ELILPDTIFH DPELKPVFDV LGRTAVPQPS VLFDSASAEG
RIARYFGVAT LDGFGTFSRA ELAAAAAAVA YVEKTQIAER PPLGKPERES AASTLFIDPA
TRANLELART LSGDRNGSLL KAIDRTVTGG GARLLAERLM SPLTDPARIN ARLDSIGFLI
DEPLLCGNLR DTLKHVPDMP RALSRLALDR GGPRDLWAIR QGLEAAGGIA AMLGKAMLPE
ELGQALSGLQ ALPAAVEKLL AETLADELPL LKRDGGFLRD GASAELDEVR ALRDQSRRVI
AGLQLQYAEE TGIRSLKIKH NNVLGYFIEV TAGNASPMTE TAEAKARFIH RQTMANAMRF
TTTELADLES RIANAADQAL TIELEAFDRM TAAVVAEAEA IKSGARALAV IDVAAGLALL
AEEQAYCRPQ VDGSKMFAIE GGRHPVVEQA LRRQAGGPFV ANHCDLSPRT GDRDGAIWLL
TGPNMGGKST FLRQNALISI LAQMGSFVPA TSAHIGIVDR LFSRVGASDD LARGRSTFMV
EMVETAAILN QASDRSLVIL DEIGRGTATF DGLSIAWAAV EHLHEANRCR GLFATHFHEL
TVLSEKLGRL SNATMRVKEW DGDVIFLHEV GPGAADRSYG IQVARLAGLP ASVVARARDV
LTRLEDADRK NPASQLIDDL PLFQVAVRRE ETARGTSKVE EALKAMSLDD MTPREAMDAL
YDLKKKLK