Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0032 |
Symbol | |
ID | 6978741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 30528 |
End bp | 33170 |
Gene Length | 2643 bp |
Protein Length | 880 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643394743 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002279561 |
Protein GI | 209547644 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.317103 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGAGC AATATATCGA GATCAAGGCG AACAATCCGG GTTCGCTGCT CTTCTATCGC ATGGGCGATT TCTACGAGCT GTTCTTCGAG GATGCGCTGG AAGCCTCCCG CGCGCTCGGC ATCACGCTGA CGAAGCGCGG CCAGCACATG GGCCAGGATA TCCCGATGTG CGGCGTTCCG GTGCATGCGG CCGACGATTA CCTGCAGAAA CTGATCTCGC TCGGTTTCCG CGTCGCCGTC TGCGAGCAGA TCGAAGATCC GGCCGAAGCG AAAAAACGCG GCGGCAAATC CGTCGTCAAG CGCGATGTCG TCCGCCTGGT CACGCCGGGC ACGATCACCG AGGAAAAGCT GCTTTCGCCC TCGGAATCCA ACTATCTGAT GGCGCTGACC CGCATTCGCG GCGGGGCCGA ACCGTTGCTG GCGCTTGCCT GGATCGACAT TTCCACCGGC GTCTTCCGGC TGGCCGAAAC CGAAGCCTCG CGGCTGCTTG CCGATATCCT GCGCATCGAT CCGCGCGAAC TGATCCTGCC GGAGACGATC TTCCACGATC CGGAACTCAA GCCGGTCTTC GACGTGCTCG GCCGCACCGC GGTGCCGCAG CCTTCCGTGC TCTTCGACAG CGCCAGCGCC GAAGGCCGGA TCGCGCGGTA TTTCGGCGTC TCGACGCTCG ACGGCTTCGG CACCTTCTCG CGCGCCGAAC TGGCGGCGGC TGCCGCCGCC GTCGCCTATG TCGAGAAGAC CCAGATCGCC GAGCGGCCGC CGCTTGGAAA GCCGGAACGG GAAAGTGCGG CGTCGACACT GTTCATCGAT CCCGCCACCC GCGCCAACCT GGAGCTGGCC CGCACGCTGT CGGGCGACCG CAACGGCTCA TTGCTGAAAG CGATCGACCG CACCGTTACC GGCGGCGGCG CGCGGCTTCT GGCCGAGCGG CTGATGTCGC CGCTGACCGA CCCCGCCCGC ATCAATGCGC GGCTCGATTC GATCGGCTTC CTGATCGACG AACCCTCGCT CTGCGGCAAT CTGCGCGACA CGCTGAAACA TGTGCCCGAC ATGCCGCGCG CCCTATCCCG CCTGGCGCTC GACCGCGGCG GCCCGCGCGA TCTCTCAGCC ATCCGCCAGG GCCTGCAAGC GGCGAACGAC GTGGCAGCGA TGCTTGCAAG CGCGATGCTG CCGGAAGAGC TTGGCCAGGC GCTGTCCGGG CTGCAGGCCC TTCCCGCAGC GCTCGAAACC CTGCTGGCCG AGACGCTCGC CGACGAATTG CCGCTGCTGA AGCGCGACGG CGGCTTCCTG CGCGACGGCG CCAGTGCCGA GCTCGACGAG GTCCGGGCGC TGCGCGACCA GTCGCGCCGG GTGATCGCGG GCCTGCAACT GCAATATGCC GAGGAAATCG GCATCCGGTC GCTGAAGATC AAACACAACA ACATCCTCGG CTATTTCATC GAGGTGACCG CCGGCAATGC CTCGCCGATG ACGGACACGG CTGAGGCCAA GGCCCGCTTC ATCCACCGCC AGACGATGGC GAGCGCGATG CGCTTCACCA CCACCGAACT CGCCGATCTC GAAAGCCGCA TCGCCAATGC CGCCGACCGG GCGCTGACGA TCGAGCTCGC CGCCTTCGAG AGGATGACGG CGGCCGTGGT TGCGGAAGCC GAGGCGATCA AATCCGGCGC GAGGGCGCTT GCCGTCATCG ACGTTGCAGC TAGCCTGGCG CTTCTCGCCG AGGAGCAGGC CTATTGCCGT CCGCAGGTCG ACGGCTCGAA GATGTTTGCC ATCGATGGCG GCCGCCATCC GGTCGTCGAA CAGGCGCTGC GGCGGCAGGC GAGCGGCCCC TTCGTCGCCA ACAATTGCGA TCTCTCGCCG AAAGCAGGCG ACAAGGACGG GGCGATCTGG CTCTTGACCG GCCCGAACAT GGGCGGCAAA TCGACCTTCC TGCGGCAGAA CGCGCTGATA GCCATCCTGG CGCAGATGGG CTCCTTCGTG CCGGCGACCT CGGCCCATAT CGGCATCGTC GACCGGCTTT TCTCGCGCGT CGGCGCCTCC GACGACCTGG CGCGCGGGCG CTCCACCTTC ATGGTCGAGA TGGTCGAGAC CGCTGCGATC CTCAACCAGG CGAGCGACCG TTCACTCGTC ATTCTCGACG AGATCGGCCG CGGCACCGCC ACCTTCGACG GCCTGTCGAT CGCCTGGGCC TCCGTCGAGC ACCTGCATGA GGCCAACCGC TGCCGCGGCC TCTTCGCCAC GCATTTCCAT GAGCTGACCG TGCTTTCGGA AAAGCTTGTC CGGCTATCGA ACGCCACGAT GCGCGTCAAG GAATGGGACG GCGACGTCAT CTTCCTACAT GAGGTCGGCC CGGGTGCGGC CGACCGCTCC TACGGCATCC AGGTCGCCCG CCTTGCCGGG CTTCCGGCTT CGGTGGTGAC GCGGGCCCGC GATGTGCTCA CCCGCCTCGA GGATGCCGAC CGCAAGAACC CGGCGAGCCA GCTGATCGAC GACCTGCCGC TCTTCCAGGT GGCGGTGCGC CGCGAGGATA CCGCGCGCGG GCCGTCCAAG GTCGAGGAGA CGCTGAAGGC GATGAGCCTT GACGACATGA CGCCGCGCGA GGCAATGGAC GCGCTTTACG ACCTCAAGAA AAAATTGAAA TAG
|
Protein sequence | MMEQYIEIKA NNPGSLLFYR MGDFYELFFE DALEASRALG ITLTKRGQHM GQDIPMCGVP VHAADDYLQK LISLGFRVAV CEQIEDPAEA KKRGGKSVVK RDVVRLVTPG TITEEKLLSP SESNYLMALT RIRGGAEPLL ALAWIDISTG VFRLAETEAS RLLADILRID PRELILPETI FHDPELKPVF DVLGRTAVPQ PSVLFDSASA EGRIARYFGV STLDGFGTFS RAELAAAAAA VAYVEKTQIA ERPPLGKPER ESAASTLFID PATRANLELA RTLSGDRNGS LLKAIDRTVT GGGARLLAER LMSPLTDPAR INARLDSIGF LIDEPSLCGN LRDTLKHVPD MPRALSRLAL DRGGPRDLSA IRQGLQAAND VAAMLASAML PEELGQALSG LQALPAALET LLAETLADEL PLLKRDGGFL RDGASAELDE VRALRDQSRR VIAGLQLQYA EEIGIRSLKI KHNNILGYFI EVTAGNASPM TDTAEAKARF IHRQTMASAM RFTTTELADL ESRIANAADR ALTIELAAFE RMTAAVVAEA EAIKSGARAL AVIDVAASLA LLAEEQAYCR PQVDGSKMFA IDGGRHPVVE QALRRQASGP FVANNCDLSP KAGDKDGAIW LLTGPNMGGK STFLRQNALI AILAQMGSFV PATSAHIGIV DRLFSRVGAS DDLARGRSTF MVEMVETAAI LNQASDRSLV ILDEIGRGTA TFDGLSIAWA SVEHLHEANR CRGLFATHFH ELTVLSEKLV RLSNATMRVK EWDGDVIFLH EVGPGAADRS YGIQVARLAG LPASVVTRAR DVLTRLEDAD RKNPASQLID DLPLFQVAVR REDTARGPSK VEETLKAMSL DDMTPREAMD ALYDLKKKLK
|
| |