Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1171 |
Symbol | mutS |
ID | 4206504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1317994 |
End bp | 1320726 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642565727 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_698493 |
Protein GI | 110803440 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTAA CGCCTATGAT GCGTCAGTAT TTTGAAATAA AGGAAAATTA TAAGGATTGT ATTCTATTTT TTAGACTTGG AGACTTCTAT GAGATGTTCT TTGAGGATGC AGAAACAGCT GCAAGGGAAT TAGAATTAGT TCTTACTGGA AGAGATTGTG GCTTAGAAAA AAGAGCACCT ATGTGTGGAA TTCCGTTTCA TGCATCTAAT TCATATATAG GAAGATTAGT AGCTAAAGGA TATAAGGTTG CTATTTGTGA ACAAGTTGAA GATCCTAAAG TTGCTAAAGG AATTGTTAAA AGGGATGTTA TAAAGGTTAT AACTCCTGGA ACATATACAG ATTCTTCTTT TGTTGAGGAA ACAAAGAACA ATTATATAAT GACTATATAT TCAGATTTAG AGAGAAATAG ATGTTCACTA GCTATAACAG ATATTTCTAC TGGAGATTTC TTAGCTACAG AGGGAGAGTT AGAAAAGGGA GTTATTTTAG ATGAAATATC TAAGTTTAAT CCTAAAGAAA TAATACTTTT AGATTCTTTA GATCAAGAGC TTATAAAAGA TATAACATTA ACTACTCCAG CCTTAATAAG TAGAAAGCCT ATAGAATATT TTGAAGAAAA ATTTGAGGAA GTATTAAATT CTCAATTTGG AGAAAAATCA AACTCTCTAA GTTTAATGAT TAAGAAATCA AGTAATGCTT TAGTTAAATA CATACTAGAT ACTCAAAAAA TAAGCTTAAC TAATATAAAT GATATAGAAG TATATAGTTT GGTTGATTTT ATGACTATAG ATTTAAGTTC TAGAAGAAAT TTAGAGCTTA CAGAGAATTT AAGAGAAAAA AGCAAGAAAG GTTCTCTTCT TTGGGTATTA GATAAGACAG AAACTTCTAT GGGAAGTAGA ATGCTTAGAA GATGGATTGA AGAACCTCTT GTTAATAAGG AAAAGATAAC TTTAAGATTA AATGCTGTTG AAGAATTATT TAATGATCTT TCTTTAAACG ACTCTCTTAA AGAAGCTTTA CATGATATTT ATGATATAGA GAGAATATTA GGTAAAATTT CAAACAAGAA TGCTAATGCT AAGGATTTAA TAGCATTAAA AACTTCTATA GGTAAAATTC CTAATGTAAA AGGAATTATA GAAAATTGTA CTTCAAGTCT TCTAAAAAAT TATCACCATA ATTTAGATGA TTTAAGAGAT ATTTATGAGC TTTTAGAAAA ATCTATAAAA GAAGATCCCT CTCTAACATT AAAAGACGGT GATTTAATAA AGGATGGATT TAATGGTGAA ATAGATGAAT TAAGACTTGC TAAGACTAAC GGTAAGGATT GGATATCTAG TTTAGAAAAT AGAGAAAGAG AGTTTACTGG CATAAAATCT TTAAAGGTAG GATTTAATAA GGTTTTTGGT TATTATATAG AAATATCTAA GGCTAATTAT AGTTCTATTC CTGAAGGAAG ATATATAAGA AAGCAAACTT TAGCTAATGC AGAGAGATTT ATTACTCCAG AACTTAAGGA AATAGAAGAA AAACTTTTAG GAGCTAGTGA AAAACTTTGC TCTTTAGAAT ATGATATTTT CCTAGATATA AGAAATGAAG TTGAAAATCA TATAGATAGG TTAAAGACTA CTGCTAAGAT AATTGCTGAA CTAGATTGTA TAAGCAATTT AGCCTTTGTA GCTTTAGAAA ATGATTTTAT AAAACCTGAA ATTAATGAAG ATGGGGAAAC TAAAATAGAA AATGGAAGAC ACCCTGTTGT TGAAAAGGTA ATTCCTAAGG GAGAGTTTAT TCCTAATGAC ACTATTATAA ATAAAGATGA TAATCAACTT CTTATAATAA CAGGTCCTAA TATGGCTGGT AAATCAACAT ATATGAGACA GGTAGCTATT ATTACTTTAA TGTGTCAAAT AGGTTCTTTT GTGCCAGCTA GTAAGGCTAA TATAAGTGTA GTAGATAAAA TCTTTACTAG AATAGGAGCT TCTGATGATT TAGCAGGTGG TAAAAGTACA TTTATGGTTG AAATGTGGGA AGTTTCTAAC ATTTTAAAAA ATGCTACAGA AAATAGCTTG GTTCTTTTAG ATGAAGTTGG AAGAGGAACG AGTACATATG ATGGATTAAG TATAGCTTGG TCTGTTATAG AGTATATATG TAAGAATAAA AATTTAAGAT GTAAAACTCT TTTTGCTACT CATTATCATG AGTTAACTAA ATTAGAAGGA GAAATTCACG GAGTTAGAAA CTATTCAGTA GCTGTGAAGG AAGTTGATAA CAATATTATC TTCTTAAGAA AAATAATAGA AGGTGGAGCA GATCAATCTT ATGGTATTGA GGTTGCTAAA CTTGCTGGTA TTCCAGACGA GGTAATAAAT AGAGCTAAGG AAATCTTAGA AACTCTTGAA ATGGAATCTT CAAAGGATAA CTTAGATTTA GCTCTTAAAG AAGTAAATGC TTCAAAAGAA GAAATGAAGG AAGCCTCTAT TGAAGCCTCC TATGAAGTTA AAGAAACCAT AGTAGAAGAG GATAAAATTG AGATTATAGA AGAAATTATT TCAAAATCTT CGGATGCTAA AACACATAAA AAAGAAGATG ATCAAATACA ATTAGATTTT TCTGCCATAG GAAAAGATAA TTTGATAAAA GAACTTTCAG AAGTTGATAT TTTATCTTTA AATCCTATGG AGGCTATGAA TAGATTATAT GCTCTAGTTA AAGAAGCTAA AAATTTAATT TAG
|
Protein sequence | MALTPMMRQY FEIKENYKDC ILFFRLGDFY EMFFEDAETA ARELELVLTG RDCGLEKRAP MCGIPFHASN SYIGRLVAKG YKVAICEQVE DPKVAKGIVK RDVIKVITPG TYTDSSFVEE TKNNYIMTIY SDLERNRCSL AITDISTGDF LATEGELEKG VILDEISKFN PKEIILLDSL DQELIKDITL TTPALISRKP IEYFEEKFEE VLNSQFGEKS NSLSLMIKKS SNALVKYILD TQKISLTNIN DIEVYSLVDF MTIDLSSRRN LELTENLREK SKKGSLLWVL DKTETSMGSR MLRRWIEEPL VNKEKITLRL NAVEELFNDL SLNDSLKEAL HDIYDIERIL GKISNKNANA KDLIALKTSI GKIPNVKGII ENCTSSLLKN YHHNLDDLRD IYELLEKSIK EDPSLTLKDG DLIKDGFNGE IDELRLAKTN GKDWISSLEN REREFTGIKS LKVGFNKVFG YYIEISKANY SSIPEGRYIR KQTLANAERF ITPELKEIEE KLLGASEKLC SLEYDIFLDI RNEVENHIDR LKTTAKIIAE LDCISNLAFV ALENDFIKPE INEDGETKIE NGRHPVVEKV IPKGEFIPND TIINKDDNQL LIITGPNMAG KSTYMRQVAI ITLMCQIGSF VPASKANISV VDKIFTRIGA SDDLAGGKST FMVEMWEVSN ILKNATENSL VLLDEVGRGT STYDGLSIAW SVIEYICKNK NLRCKTLFAT HYHELTKLEG EIHGVRNYSV AVKEVDNNII FLRKIIEGGA DQSYGIEVAK LAGIPDEVIN RAKEILETLE MESSKDNLDL ALKEVNASKE EMKEASIEAS YEVKETIVEE DKIEIIEEII SKSSDAKTHK KEDDQIQLDF SAIGKDNLIK ELSEVDILSL NPMEAMNRLY ALVKEAKNLI
|
| |