Gene CPR_0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0886 
Symbol 
ID4204994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1020393 
End bp1022066 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content29% 
IMG OID642565445 
ProductMutS domain-containing protein 
Protein accessionYP_698211 
Protein GI110803146 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.437767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAT ATATTTTTAT TGGAATAATC ATAGTTATTT TATCTGTGAT TTATTACAAC 
ATGAAATCCA AAGGTAAGTT TATTAAAAGC TTAAATGAAA GTTTTGGGCA CAAGCCTAAG
GATTATCTTG AAGATTTTGA TATGACCTTT CTAAAAAATC ACTATGAAAT TCGTAAGGAA
AATGAATCCT CTGGTGAATC AATTGATGAG CTAACTTGGA ATGACCTAGA TATGGATGCA
GTCTTTAAGC GTATAAACTA CACAAGGACA AGCTTAGGGG AAGCTTATTT ATACTATAAA
CTTAGAGAAA TTAGTTACAA TAAAGATGAG TGGACAAGCT TAGAAAAACT TATAACCCTA
TTCACCACCA ATGAAGAATT AAGAAATAAA GTATCACTGC TTTTACTGAA AGTAGGAAAG
TTAATTGACC TTAACTTAAC TAATTTCATT TATAATCCTA AGTTTAGCAA AATACCTAGT
TATTATAAAT ACCCCTTATT ATCTTTAGGT TTTATATTTT CAATATTCTT ATCCTTTATT
TACACAAAGG TTGGTCTTAT ACTTAGCTTT ATCTTTCTAT GCATAAATAT ATTATCATAC
CAAAGTGAAA AAATATTTTT GGAAGATAGG TTTAAAGTTA TGATTTATCT ATTAAATAAT
ATTAATCTTT GTATAAGTCT CTCTAAAATT AAGGATAAGG ATTTTGAGTT TTTTAAAAAT
GAAGTAAACT CTGCCCTTCA TAACTTCAAA GCTTTAAATA TAGTTAAAAT TTATGGTAAT
AGCTTTCAAA AGAAGAAAAA TGCCTTTACT GATATTGATA TTATTTTTGA TTACATAAAG
ATGTTTTTTA TGGTGGATAT TATAGCTTAC CAAAACTCAG TTAAAATCTT AGAGAAAAAC
AAGGAGAATC TTTATAAGAT ATATGATATA GTGGCCAAGT TAGATTTTGC CTTAAGCCTT
GCTTACTATA GAAAAAGTCT AAGTGAATAC ACTATCCCAG AATTTATTAA AAGTGATGAT
ATAAGTCTAG AAAATCTATA CCATCCTCTT ATTGATAATC CTGTTAAAAA CTCCATACTT
ATAAAGAATA ATATTCTCTT TACAGGGTCA AATGCTTCTG GTAAGTCTAC CTTTATAAAA
GCAGTAGCTC TAAACTGCAT ACTTGCCCAA AGTTTAAACA CTGCCTTATG TTCAAAATAT
AGATGCAAGT TTTCTAAGGT GGTAACCTCT ATGGCAATAA AAGATAATAT ACTAGCTGGG
GATAGCTACT TCATTGCAGA AATAAAAAGT TTAAAAAGGC TCCTTGACTC TTTAAATGGA
GAAATTAGAG TTTTAGCCTT TATTGATGAA ATATTAAAAG GTACAAACAC CATAGAGAGA
ATCTCAGCCT CAGTCTCCAT ATTAAAATAT GCTGAAAACA CCAATGGAAA ATTGCTAGTA
GCCACTCATG ATATGGAATT AACTCAAATC CTTGAAACCT ATGAAAACTA TCACTTTAGT
GAAACTGTTA CAGAAAATGG AGTAACTTTT GATTACAAAC TAAAAAAAGG ACCTTCCAAT
ACAAGAAATG CCCTAAAACT CCTTAAAGCA ATGAACTTTA ATAAGGACGT AGTTTCCTTA
TCAAACCAAG TATATAATAA TTTCATAGAA ACTGAAAAAT GGGGAAAGCT TTAA
 
Protein sequence
MQLYIFIGII IVILSVIYYN MKSKGKFIKS LNESFGHKPK DYLEDFDMTF LKNHYEIRKE 
NESSGESIDE LTWNDLDMDA VFKRINYTRT SLGEAYLYYK LREISYNKDE WTSLEKLITL
FTTNEELRNK VSLLLLKVGK LIDLNLTNFI YNPKFSKIPS YYKYPLLSLG FIFSIFLSFI
YTKVGLILSF IFLCINILSY QSEKIFLEDR FKVMIYLLNN INLCISLSKI KDKDFEFFKN
EVNSALHNFK ALNIVKIYGN SFQKKKNAFT DIDIIFDYIK MFFMVDIIAY QNSVKILEKN
KENLYKIYDI VAKLDFALSL AYYRKSLSEY TIPEFIKSDD ISLENLYHPL IDNPVKNSIL
IKNNILFTGS NASGKSTFIK AVALNCILAQ SLNTALCSKY RCKFSKVVTS MAIKDNILAG
DSYFIAEIKS LKRLLDSLNG EIRVLAFIDE ILKGTNTIER ISASVSILKY AENTNGKLLV
ATHDMELTQI LETYENYHFS ETVTENGVTF DYKLKKGPSN TRNALKLLKA MNFNKDVVSL
SNQVYNNFIE TEKWGKL