Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0886 |
Symbol | |
ID | 4204994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1020393 |
End bp | 1022066 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642565445 |
Product | MutS domain-containing protein |
Protein accession | YP_698211 |
Protein GI | 110803146 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.437767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTAT ATATTTTTAT TGGAATAATC ATAGTTATTT TATCTGTGAT TTATTACAAC ATGAAATCCA AAGGTAAGTT TATTAAAAGC TTAAATGAAA GTTTTGGGCA CAAGCCTAAG GATTATCTTG AAGATTTTGA TATGACCTTT CTAAAAAATC ACTATGAAAT TCGTAAGGAA AATGAATCCT CTGGTGAATC AATTGATGAG CTAACTTGGA ATGACCTAGA TATGGATGCA GTCTTTAAGC GTATAAACTA CACAAGGACA AGCTTAGGGG AAGCTTATTT ATACTATAAA CTTAGAGAAA TTAGTTACAA TAAAGATGAG TGGACAAGCT TAGAAAAACT TATAACCCTA TTCACCACCA ATGAAGAATT AAGAAATAAA GTATCACTGC TTTTACTGAA AGTAGGAAAG TTAATTGACC TTAACTTAAC TAATTTCATT TATAATCCTA AGTTTAGCAA AATACCTAGT TATTATAAAT ACCCCTTATT ATCTTTAGGT TTTATATTTT CAATATTCTT ATCCTTTATT TACACAAAGG TTGGTCTTAT ACTTAGCTTT ATCTTTCTAT GCATAAATAT ATTATCATAC CAAAGTGAAA AAATATTTTT GGAAGATAGG TTTAAAGTTA TGATTTATCT ATTAAATAAT ATTAATCTTT GTATAAGTCT CTCTAAAATT AAGGATAAGG ATTTTGAGTT TTTTAAAAAT GAAGTAAACT CTGCCCTTCA TAACTTCAAA GCTTTAAATA TAGTTAAAAT TTATGGTAAT AGCTTTCAAA AGAAGAAAAA TGCCTTTACT GATATTGATA TTATTTTTGA TTACATAAAG ATGTTTTTTA TGGTGGATAT TATAGCTTAC CAAAACTCAG TTAAAATCTT AGAGAAAAAC AAGGAGAATC TTTATAAGAT ATATGATATA GTGGCCAAGT TAGATTTTGC CTTAAGCCTT GCTTACTATA GAAAAAGTCT AAGTGAATAC ACTATCCCAG AATTTATTAA AAGTGATGAT ATAAGTCTAG AAAATCTATA CCATCCTCTT ATTGATAATC CTGTTAAAAA CTCCATACTT ATAAAGAATA ATATTCTCTT TACAGGGTCA AATGCTTCTG GTAAGTCTAC CTTTATAAAA GCAGTAGCTC TAAACTGCAT ACTTGCCCAA AGTTTAAACA CTGCCTTATG TTCAAAATAT AGATGCAAGT TTTCTAAGGT GGTAACCTCT ATGGCAATAA AAGATAATAT ACTAGCTGGG GATAGCTACT TCATTGCAGA AATAAAAAGT TTAAAAAGGC TCCTTGACTC TTTAAATGGA GAAATTAGAG TTTTAGCCTT TATTGATGAA ATATTAAAAG GTACAAACAC CATAGAGAGA ATCTCAGCCT CAGTCTCCAT ATTAAAATAT GCTGAAAACA CCAATGGAAA ATTGCTAGTA GCCACTCATG ATATGGAATT AACTCAAATC CTTGAAACCT ATGAAAACTA TCACTTTAGT GAAACTGTTA CAGAAAATGG AGTAACTTTT GATTACAAAC TAAAAAAAGG ACCTTCCAAT ACAAGAAATG CCCTAAAACT CCTTAAAGCA ATGAACTTTA ATAAGGACGT AGTTTCCTTA TCAAACCAAG TATATAATAA TTTCATAGAA ACTGAAAAAT GGGGAAAGCT TTAA
|
Protein sequence | MQLYIFIGII IVILSVIYYN MKSKGKFIKS LNESFGHKPK DYLEDFDMTF LKNHYEIRKE NESSGESIDE LTWNDLDMDA VFKRINYTRT SLGEAYLYYK LREISYNKDE WTSLEKLITL FTTNEELRNK VSLLLLKVGK LIDLNLTNFI YNPKFSKIPS YYKYPLLSLG FIFSIFLSFI YTKVGLILSF IFLCINILSY QSEKIFLEDR FKVMIYLLNN INLCISLSKI KDKDFEFFKN EVNSALHNFK ALNIVKIYGN SFQKKKNAFT DIDIIFDYIK MFFMVDIIAY QNSVKILEKN KENLYKIYDI VAKLDFALSL AYYRKSLSEY TIPEFIKSDD ISLENLYHPL IDNPVKNSIL IKNNILFTGS NASGKSTFIK AVALNCILAQ SLNTALCSKY RCKFSKVVTS MAIKDNILAG DSYFIAEIKS LKRLLDSLNG EIRVLAFIDE ILKGTNTIER ISASVSILKY AENTNGKLLV ATHDMELTQI LETYENYHFS ETVTENGVTF DYKLKKGPSN TRNALKLLKA MNFNKDVVSL SNQVYNNFIE TEKWGKL
|
| |