Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0047 |
Symbol | |
ID | 8011294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 43656 |
End bp | 46382 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644822637 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002973897 |
Protein GI | 241202801 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.967716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGCTC GAATAGACAC AGGGAATGAA GCCTTTTCGG CTGCCGAGCT CGCCACGGCG GAAAGCCGTG CCTCGGCGAC GCCGATGATG GAGCAATTTA TCGAGATCAA GGCGAACAAT CCGGGTTCGC TGCTGTTCTA TCGCATGGGC GATTTCTACG AGCTGTTCTT CGAGGATGCG CTGGAAGCTT CACGCGCGCT CGGTATCACG CTGACGAAGC GCGGCCAGCA CATGGGCCAG GATATCCCGA TGTGCGGGGT TCCGGTGCAT GCGGCCGACG ATTATCTGCA GAAGCTGATT TCGCTCGGCT TCCGCGTCGC CGTCTGCGAG CAGATCGAAG ATCCGGCCGA AGCAAAAAAA CGCGGCGCGA AATCCGTCGT CAAGCGTGAC GTCGTCCGCC TCGTCACTCC AGGCACGATC ACCGAGGAAA AGCTGCTTTC GCCCTCGGAA TCCAACTATC TGATGGCGCT GACCCGCATT CGCGCCAGCG GCGAGGCCTT GCTGGCGCTT GCCTGGATCG ATATTTCCAC CGGCGTCTTC CGGCTTGCCG AGACCGAAGC CTCGCGCCTG CTTGCCGATA TCCTGCGCAT CGATCCGCGT GAGCTCATCC TTCCCGACAC GATCTTCCAC GATCCCGAAC TCAAACCGGT CTTCGACGTG CTCGGCCGGA CAGCCGTGCC GCAGCCTTCC GTGCTCTTCG ACAGCGCCAG CGCCGAAGGC CGGATCGCGC GGTATTTCGG CGTCGCGACG CTCGACGGCT TCGGCACCTT CTCACGCGCC GAGCTGGCAG CGGCCGCGGC CGCCGTCGCC TATGTCGAGA AGACGCAGAT TGCCGAGCGG CCGCCGCTTG GAAAACCGGA ACGGGAAAGT GCAGCCTCGA CGCTGTTCAT CGATCCCGCC ACCCGCGCCA ATCTGGAGCT TGCCCGCACG CTTTCCGGCG ATCGCAACGG TTCGCTGTTG AAGGCAATCG ACCGCACCGT CACCGGCGGC GGCGCCAGGC TTCTTGCCGA ACGGCTGATG TCGCCGCTGA CCGACCCGGC CCGCATCAAT GCGCGGCTCG ATTCGATCGG TTTCCTGATC GACGAACCCC TGCTTTGCGG CAATCTGCGC GACACGCTGA AACATGTTCC CGACATGCCG CGCGCTCTGT CGCGCCTGGC GCTCGACCGC GGCGGCCCGC GCGATCTCTG GGCGATCCGC CAGGGCCTCG AAGCGGCGGG CGGAATTGCG GCGATGCTCG GCAAGGCGAT GCTGCCCGAA GAACTCGGCC AGGCGCTATC CGGGCTGCAG GCCCTTCCCG CGGCAGTGGA AAAGCTGCTT GCCGAAACGC TCGCCGACGA ATTGCCGCTG TTGAAACGCG ATGGCGGCTT CCTTCGCGAC GGCGCCAGCG CCGAGCTCGA CGAGGTCCGG GCACTGCGCG ACCAGTCCCG TCGCGTCATC GCCGGCCTGC AATTGCAATA TGCCGAGGAG ACCGGCATCC GGTCGCTGAA GATCAAGCAC AACAACGTCC TCGGTTATTT CATCGAAGTC ACCGCCGGCA ATGCCTCGCC GATGACGGAG ACGGCGGAGG CGAAGGCCCG CTTCATCCAC CGCCAGACGA TGGCGAACGC CATGCGCTTC ACCACGACCG AACTTGCCGA TCTCGAAAGC CGCATCGCCA ATGCCGCCGA CCAGGCGCTG ACGATCGAAC TCGAAGCCTT CGACAGGATG ACGGCGGCCG TCGTTGCGGA AGCCGAGGCG ATCAAGTCGG GCGCCCGGGC GCTTGCCGTC ATCGACGTCG CCGCAGGCTT GGCGCTTCTC GCCGAGGAGC AGGCCTATTG CCGCCCGCAG GTCGACGGCT CGAAGATGTT TGCCATCGAG GGCGGGCGCC ATCCGGTCGT CGAGCAGGCG CTGCGGCGGC AAGCGGGCGG CCCCTTCGTC GCCAATCATT GCGATCTGTC GCCGAGGACC GGCGACCGGG ACGGGGCGAT CTGGCTGCTG ACCGGCCCGA ACATGGGCGG CAAGTCGACC TTCCTGCGCC AGAACGCGCT GATATCAATC CTGGCGCAGA TGGGCTCCTT CGTGCCGGCG ACATCGGCCC ATATCGGCAT CGTCGACCGG CTTTTCTCGC GCGTCGGCGC CTCCGACGAT CTGGCGCGTG GGCGCTCGAC CTTCATGGTC GAGATGGTCG AAACGGCGGC GATCCTCAAC CAGGCGAGCG ACCGTTCCCT CGTCATCCTC GACGAGATCG GCCGCGGCAC CGCCACGTTC GACGGTCTTT CGATCGCCTG GGCCGCCGTC GAGCACCTGC ATGAGGCCAA TCGCTGCCGC GGCCTCTTCG CCACGCATTT CCACGAACTG ACCGTGCTTT CGGAAAAGCT TGGCCGGCTG TCGAACGCGA CGATGCGCGT CAAAGAATGG GACGGCGACG TCATCTTCCT GCACGAGGTC GGGCCGGGTG CGGCGGACCG CTCCTACGGC ATCCAGGTCG CGCGCCTTGC CGGGCTTCCG GCCTCGGTCG TGGCGCGCGC CCGCGACGTG CTCACCCGGC TGGAAGATGC CGATCGCAAG AACCCGGCGA GCCAGTTGAT CGACGACCTG CCGCTCTTCC AGGTGGCGGT GCGCCGCGAG GAAACCGCCC GCGGGACCTC CAAGGTCGAG GAGGCACTGA AGGCGATGAG CCTTGACGAC ATGACACCGC GCGAGGCAAT GGATGCGCTT TACGACCTCA AGAAGAAGTT GAAATAG
|
Protein sequence | MNARIDTGNE AFSAAELATA ESRASATPMM EQFIEIKANN PGSLLFYRMG DFYELFFEDA LEASRALGIT LTKRGQHMGQ DIPMCGVPVH AADDYLQKLI SLGFRVAVCE QIEDPAEAKK RGAKSVVKRD VVRLVTPGTI TEEKLLSPSE SNYLMALTRI RASGEALLAL AWIDISTGVF RLAETEASRL LADILRIDPR ELILPDTIFH DPELKPVFDV LGRTAVPQPS VLFDSASAEG RIARYFGVAT LDGFGTFSRA ELAAAAAAVA YVEKTQIAER PPLGKPERES AASTLFIDPA TRANLELART LSGDRNGSLL KAIDRTVTGG GARLLAERLM SPLTDPARIN ARLDSIGFLI DEPLLCGNLR DTLKHVPDMP RALSRLALDR GGPRDLWAIR QGLEAAGGIA AMLGKAMLPE ELGQALSGLQ ALPAAVEKLL AETLADELPL LKRDGGFLRD GASAELDEVR ALRDQSRRVI AGLQLQYAEE TGIRSLKIKH NNVLGYFIEV TAGNASPMTE TAEAKARFIH RQTMANAMRF TTTELADLES RIANAADQAL TIELEAFDRM TAAVVAEAEA IKSGARALAV IDVAAGLALL AEEQAYCRPQ VDGSKMFAIE GGRHPVVEQA LRRQAGGPFV ANHCDLSPRT GDRDGAIWLL TGPNMGGKST FLRQNALISI LAQMGSFVPA TSAHIGIVDR LFSRVGASDD LARGRSTFMV EMVETAAILN QASDRSLVIL DEIGRGTATF DGLSIAWAAV EHLHEANRCR GLFATHFHEL TVLSEKLGRL SNATMRVKEW DGDVIFLHEV GPGAADRSYG IQVARLAGLP ASVVARARDV LTRLEDADRK NPASQLIDDL PLFQVAVRRE ETARGTSKVE EALKAMSLDD MTPREAMDAL YDLKKKLK
|
| |