Gene CPR_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2123 
Symbol 
ID4204405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2351412 
End bp2353199 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content27% 
IMG OID642566673 
ProductMutS domain-containing protein 
Protein accessionYP_699432 
Protein GI110801595 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAATA GAAAAGATAT TTATGAAAAA AGAATAGAAG AGTATTCAAG TGCTTTAAAG 
AGATTAAAGA GAGACTATAA TATAATCAGC GCATTAAGAT TAGTAGTTTC TTTAAGTATT
TTATTTTTTA TTTACTACGC TTATAGCATA GGCTCAATAA GCTTTGGGAT CATATTATTT
TTGCTAAATT CTATATTATT TTTATATTTA GCAAAGGTTC ATGAAGGTAT AGTGAATAAA
ATAAGTAAAA GAGAAGCTCT TATAGAGGTT AATAAAAAAG AGATTCTTAG GTTAGAGGGA
AAGTGGAGAG AATTTAATGA TTTAGGAGAA GAATATTTAG ATAATAAGCA TCCTTTTATA
AATGACTTAG ATATATTTGG GAAAAACTCT TTATTTCAAT GGATAAATGA AACTGGTACT
GTTTATGGAA GAGAGAAATT AAGCCATTTA TTAAAGTTAG AAGAACTTCC AAATAAAGAA
GAAATTTTAT TAAGACAGGA AGCTTTAAAG GAGCTTTCTA AAAAAGTAGA TTTTAGACAT
GAATTTATAG CTTCATTAAA AGATAAAAAA GGAAAAAAAG AGAAGTATTT AGGGGAATGG
TTAAAAGAAG ACAGTAAAGC CATATCGCCT TTATTAAATA TTCTTAGAAT AATAATGCCA
GTAATAAATA TTGGAATTAC TATTTTAGTT GGTATGAATG TTATTTCATG GCAAATACTA
TTAATTTCTC TTGTTATAAG TTATGGTATT TTGAAGCTTG GCAATAAGGA AGTTATTAAA
GGATTAAATA TATTTGAAGA TTTAAAATAT AGAATAAAAA CCTATGTAGA GGCTTTAGAG
TTAATAGAAA AAGAGAATTT CCAGTCTAAT ATAATAAAAA GTATAAAAAG TAACTTAGAT
ATGAATGGCA AAAGTGCTAG TAAGGAGCTT AAAAGCTTAG AAAAGATAAC TAGCTGGCTT
TATGATAGGG GAAATGCCTT TTATCTTTTA TTAAACTGTT ATTTGCTTTG GGATTATCAA
ATTCTATCAA AGCTTGAAAA GTGGAAGAGT TCTAATAAAG ATGAGTTTTA TAAATGGATG
ATTTCTTTAG GTGATTTTGA GGCTTTAGTT TCTTTAGCTG GATTTACTTA CAATAATCAT
GGATGGGCTA CACCAAAAAT AAATGATGAC TATACTTTAA AGGGAAAAAA TCTTAGCCAT
CCTATGTTAG GAGAAAAAGG CGTTGGAAAC AGTTTTGATA TTAATAAGGA TAAGAGAGTA
ATCTTAATAA CAGGATCTAA TATGTCAGGT AAGAGTACAT TTTTAAGAAC TGTTGGATTT
AATTGTATAT TAGCTTATCT AGGACTTCCT GTAAAAGGAG AAAGTTTTGA AGCTCCAATA
TTAAAAGTTT ATACCTGTAT GAGAACTGGA GATAATCTTG AAGAGAGTAT ATCTTCATTT
TATGCAGAGA TACTTAGAAT AAAGATTATA GTTGAGGGTG TAAAAAGAGG AGAAAAGATT
TTATTTTTGT TAGATGAAAT ATTTAAAGGA ACAAACTCCT TAGATAGACA TGAGGGAGCG
GAGATATTAA TAAATCAGCT TTTAGAAGGA AACACATTAG GATTAGTTTC AACTCATGAT
TTTGAACTTT GCGATATGGA GAAAAAAGAT TCTACTATAC AAAATTATAA TTTTAGAGAA
TATTATGAGG ATAATAAATT AAAGTTTGAT TATATTTTAA GAAAAGGTGT TTCACAAACA
AGAAATGCTA GATATTTAAT GAAGATGGCT GGAATAGATA TTGAATAA
 
Protein sequence
MENRKDIYEK RIEEYSSALK RLKRDYNIIS ALRLVVSLSI LFFIYYAYSI GSISFGIILF 
LLNSILFLYL AKVHEGIVNK ISKREALIEV NKKEILRLEG KWREFNDLGE EYLDNKHPFI
NDLDIFGKNS LFQWINETGT VYGREKLSHL LKLEELPNKE EILLRQEALK ELSKKVDFRH
EFIASLKDKK GKKEKYLGEW LKEDSKAISP LLNILRIIMP VINIGITILV GMNVISWQIL
LISLVISYGI LKLGNKEVIK GLNIFEDLKY RIKTYVEALE LIEKENFQSN IIKSIKSNLD
MNGKSASKEL KSLEKITSWL YDRGNAFYLL LNCYLLWDYQ ILSKLEKWKS SNKDEFYKWM
ISLGDFEALV SLAGFTYNNH GWATPKINDD YTLKGKNLSH PMLGEKGVGN SFDINKDKRV
ILITGSNMSG KSTFLRTVGF NCILAYLGLP VKGESFEAPI LKVYTCMRTG DNLEESISSF
YAEILRIKII VEGVKRGEKI LFLLDEIFKG TNSLDRHEGA EILINQLLEG NTLGLVSTHD
FELCDMEKKD STIQNYNFRE YYEDNKLKFD YILRKGVSQT RNARYLMKMA GIDIE