Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0030 |
Symbol | |
ID | 5320857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 28334 |
End bp | 31081 |
Gene Length | 2748 bp |
Protein Length | 915 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640788961 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001325725 |
Protein GI | 150395258 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000355609 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATTTCC TGATGGATGC ATCGAACCGA ACGGGCGACG TCCTTTCCGT GTCCGATCTC GCGAGCGAAG AGAGCCGCTC CACCGCCACG CCGATGATGG AACAGTTCAT CGAGATCAAG GCGAACAACC GGGATTCGCT CCTGTTTTAC CGCATGGGTG ATTTCTACGA GCTGTTTTTC CAGGATGCGG TCGAGGCCTC GCGCGCACTC GGTATCACGC TGACGAAACG CGGACAGCAC ATGGGGCAGG AAATCCCCAT GTGCGGCGTG CCGGTGCATG CGGCTGACGA TTACCTGCAG AAGCTGATTG CGCTCGGCTA TCGCGTCGCG GTCTGCGAAC AGGTGGAGGA CCCTGCCGAG GCGAAGAAGC GCGGCGGCAA ATCGGTCGTG CGCCGCGATG TCGTCCGCCT CGTAACGCCG GGAACGATCA CCGAGGACAA GCTGCTCTCG CCCTCGGAAT CAAACTATCT CATGGCGCTC GCGCGCATCA GGAGCGGCTC GGAGCCCGCT TATGCGCTTG CCTGGATCGA TATTTCGACG GGAATCTTCC GCCTCGCCGA GACCGCGGAG AGCCGGCTTC TTGCCGACAT ATTGCGCATC GAACCACGCG AACTGATCCT GCCGGATACC GTCTTCCACG ATCCGGATCT CAGGCCCGTT TTCGACGTGC TCGGGCGGGT CGCGGTACCG CAGCCGGCCA TCCTTTTCGA CAGTGCGACG GCGGAAGGCC GGATATCACG CTACTACGGC GTCGGGACGC TCGACGGCTT CGGCAGTTTC TCGCGCGCGG AGCTCGCCGC CGCATCGGCG GCAGTCTCCT ATGTCGAAAA GACCCAGCTC CAGGAGCGCC CGGCGCTCGG CATACCGGAA AGGGAAAGCG CCGCCTCGAC CCTCTTCATC GATCCGGCAA CCCGTGCCAA TCTGGAGCTC GCCAAGACGC TGTCGGGCTC GCGCGACGGC AGCCTGCTCA AGTCGCTCGA CCGTACGGTG ACGAGCGGTG GCGCCCGGCT GTTGGCCGAA CGGCTGATGT CACCCCTGAC CGACCCGGAA CGGATCAATC GGCGGCTCGA TTCCATCGAA ATGCTGGCCG ACCAGCCGCG CTTCACGGCC GACGCTCGCG ATGCGCTTCG CAGGGCGCCG GACATGCCGC GCGCCCTGTC GCGGCTCGCG CTTGGCCGCG GCGGCCCTCG CGATCTCGGT GCCATACAGG CGGGCATGCG GGCCGCGGTC GCGATCGCGG CGCTTCTCTC GGGTGCCGAG CTTTCGGTGG AACTGGCTGA AGCGCGTGAC GCGATCGCGG GCTTGCCGCG GGACCTCCTC GCGCGCCTCG ACGCGACCCT TGCGGAGGAA TTGCCGCTTT TGAAGCGCGA TGGCGGTTTC GTCCGCGAAG GTGCTAACGC CGAACTCGAC GAGATGCGCG CTCTGCGCGA CCAGTCGCGC CGCGTGGTTG CCGGTCTTCA GCTCCAATAT TGCGAAGAGA CCGGAATCAA GTCGCTGAAA ATCAAGCATA ACAACGTGCT CGGCTACTTC ATCGAGGTGA CCGCCGGAAA TGCCGGCGCC ATGATCGATA CGGATGCGGG CCGTGCCCGC TTCATCCACC GCCAGACCAT GGCGAACGCC ATGCGCTTCA CCACGACCGA GTTGGCGGAG CTCGAAACCA AGATCGCCAA TGCCGCGGAC CGCGTTCTGG CGATCGAACT CGAGACTTTC GAGGTCATGA CGCGCGAGGT GGTCGCCGAG GCCGAAGCGA TCAAAGCGGC GGCGCTGGCG CTGGCGACGA TCGACGTCTC GGCCGGACTG GCGGTGCTTG CGGAGGAGCA GAACTATACG CGCCCCGCCG TCGACCGCTC GCGCATGTTC GCGATCGACG GGGGCCGCCA CCCCGTGGTG GAGCAGGCGT TGAGACGCCA GGCCGCCAAT CCCTTCGTCG CGAATGGCTG CGACCTTTCC CCGCCCGGTG GGGAAGAGGG CGGCGCGATC TGGCTCCTCA CCGGCCCCAA CATGGGCGGC AAGTCGACTT TCCTGCGGCA GAACGCGCTG ATCGCGATCA TGGCGCAGAT GGGGTCCTTC GTGCCTGCAT CCGCCGCGCA TATCGGCGTC GTCGACCGCC TCTTCTCACG CGTCGGGGCA TCGGACGACC TCGCGCGTGG CCGTTCGACC TTCATGGTCG AAATGGTCGA GACGGCTGCG ATTCTCAACC AGGCGACCGA CCGCTCGCTG GTGATCCTCG ACGAGATCGG CCGCGGCACG GCGACCTTCG ACGGCCTGTC GATCGCCTGG GCGGCTGTCG AGCATCTGCA CGAGGTCAAT CGTTGCCGCG GGCTGTTCGC GACCCATTTC CACGAATTGA CGGTGCTTTC CGAAAAACTC GGCCGGCTTT CCAACGCGAC CATGCGCGTC AAGGAGTGGG ACGGCGACGT CATATTCCTG CATGAAGTGG GGCCAGGGGC AGCCGACCGC TCCTACGGAA TCCAGGTCGC CCGGCTTGCC GGCTTGCCGG CGTCGGTCGT CGCCCGCGCG CGGGACGTTC TCGCCAAGCT TGAAGACGCG GACCGCAAAA ATCCGGCGAG CCAGCTGATC GACGACCTGC CGCTGTTCCA GGTCGCGGTC CGGCGCGAGG AGGCGGCGAG GGCACCGGGA CTTTCCAGGG CGGAGGAGGC CCTGAAGGCG CTCAACCCGG ACGACATGAC GCCGCGCGAG GCGCTCGACG CGCTGTACGC GCTCAAGAAG CAGCTTTCCA ACCGCTGA
|
Protein sequence | MNFLMDASNR TGDVLSVSDL ASEESRSTAT PMMEQFIEIK ANNRDSLLFY RMGDFYELFF QDAVEASRAL GITLTKRGQH MGQEIPMCGV PVHAADDYLQ KLIALGYRVA VCEQVEDPAE AKKRGGKSVV RRDVVRLVTP GTITEDKLLS PSESNYLMAL ARIRSGSEPA YALAWIDIST GIFRLAETAE SRLLADILRI EPRELILPDT VFHDPDLRPV FDVLGRVAVP QPAILFDSAT AEGRISRYYG VGTLDGFGSF SRAELAAASA AVSYVEKTQL QERPALGIPE RESAASTLFI DPATRANLEL AKTLSGSRDG SLLKSLDRTV TSGGARLLAE RLMSPLTDPE RINRRLDSIE MLADQPRFTA DARDALRRAP DMPRALSRLA LGRGGPRDLG AIQAGMRAAV AIAALLSGAE LSVELAEARD AIAGLPRDLL ARLDATLAEE LPLLKRDGGF VREGANAELD EMRALRDQSR RVVAGLQLQY CEETGIKSLK IKHNNVLGYF IEVTAGNAGA MIDTDAGRAR FIHRQTMANA MRFTTTELAE LETKIANAAD RVLAIELETF EVMTREVVAE AEAIKAAALA LATIDVSAGL AVLAEEQNYT RPAVDRSRMF AIDGGRHPVV EQALRRQAAN PFVANGCDLS PPGGEEGGAI WLLTGPNMGG KSTFLRQNAL IAIMAQMGSF VPASAAHIGV VDRLFSRVGA SDDLARGRST FMVEMVETAA ILNQATDRSL VILDEIGRGT ATFDGLSIAW AAVEHLHEVN RCRGLFATHF HELTVLSEKL GRLSNATMRV KEWDGDVIFL HEVGPGAADR SYGIQVARLA GLPASVVARA RDVLAKLEDA DRKNPASQLI DDLPLFQVAV RREEAARAPG LSRAEEALKA LNPDDMTPRE ALDALYALKK QLSNR
|
| |