Gene CPF_1358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1358 
SymbolmutS 
ID4201810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1532731 
End bp1535463 
Gene Length2733 bp 
Protein Length910 aa 
Translation table11 
GC content29% 
IMG OID638082239 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_695804 
Protein GI110799241 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.476923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTGA CTCCGATGAT GAGGCAATAT TTTGAGATAA AGGAAAATTA TAAGGATTGT 
ATTCTATTTT TTAGACTTGG AGACTTCTAT GAGATGTTCT TTGAGGATGC AGAAACAGCT
GCAAGGGAAT TAGAATTAGT TCTTACTGGA AGAGATTGTG GCTTAGAAAA AAGAGCACCT
ATGTGTGGAA TTCCGTTTCA TGCATCTAAT TCATATATAG GAAGATTAGT AGCTAAGGGG
TATAAGGTTG CTATTTGTGA ACAAGTTGAA GATCCTAAAT TTGCTAAAGG AATTGTTAAA
AGAGATGTTA TAAAGGTTAT AACTCCTGGA ACATATACAG ATTCTTCTTT TGTTGAGGAA
ACAAAAAATA ATTATATAAT GACTATTTAT GCAGACTTAG AGAGAAATAG ATGTTCACTA
GCTATAACAG ATATTTCTAC TGGAGATTTC TTAGCTACAG AGGGAGAATT AGAAAAGGGA
GTTATTTTAG ATGAAATATC TAAGTTTAAT CCTAAAGAAA TAATACTTTT AGATTCTTTA
GATCAAGAGC TTATAAAAGA TATAACATTA ACTACTCCAG CTTTAATAAG TAGAAAGCCT
ATAGAATATT TTGAGGAAAA CTTTGAGGAA GTATTAAATA ATCAATTTGG AGAAAAATCA
AACTCTCTAA GTTTAATGGT TAAGAAATCA AGTAATGCTT TAGTTAAATA CATACTAGAC
ACTCAAAAAA TAAGCTTAAC CAATATAAAT GATATAGAAG TATATAGTTT GGTTGACTTT
ATGACTATAG ATTTAAGTTC TAGAAGAAAT TTAGAGCTTA CAGAGAATCT AAGAGAAAAA
AGCAAGAAGG GTTCTCTTCT TTGGGTATTA GATAAGACAG AAACTTCTAT GGGAAGTAGA
ATGCTTAGAA GATGGATTGA AGAACCTCTT GTTAATAAGG AAAAGATAAC TTTAAGATTA
AATGCTGTAG AAGAATTATT TAATGATCTT TCTTTAAATG ACTCTCTTAA GGAAGCTTTA
CATGATATTT ATGATATAGA GAGAATATTA GGGAAAATTT CAAATAAAAA TGCTAATGCT
AAGGATTTAA TAGCATTAAA AACATCTATA GGTAAAATTC CTAATGTAAA AGGAATTATA
GAAAATTGCA CTTCAAGTCT TCTAAAGAAT TATCACCATA ATTTAGATGA CTTAAGAGAT
ATTTATGATC TTTTAGAAAA ATCTATAAAA GAAGATCCAT CCCTAACATT AAAAGATGGT
GATTTAATAA AAGATGGATT TAATGGTGAA ATAGATGAAT TAAGACTTGC TAAGACTAAC
GGTAAGGATT GGATATCTAG CTTAGAAAAT AGAGAAAGAG AGTTTACTGG CATAAAATCT
TTAAAGGTAG GATTTAATAA GGTCTTTGGT TATTATATAG AAATATCTAA GGCTAATTAT
AGTTCTATTC CTGAAGGAAG ATATATAAGA AAGCAAACTT TAGCTAATGC AGAGAGATTT
ATTACTCCAG AACTTAAGGA AATAGAAGAA AAACTTTTAG GAGCTAGTGA AAAACTTTGC
TCTTTAGAAT ATGATATTTT CCTAGATATA AGAAATGAAG TTGAAAATCA TATAGATAGA
CTAAAAACTA CTGCTAAGAT AATTGCTGAG TTAGATTGTA TAAGCAATTT AGCCTTTGTA
GCTTTAGAAA ATGATTTTAT AAAACCTGAA ATTAATGAAG ATGGGGAAAC TAAAATAGAA
AATGGAAGAC ACCCTGTTGT TGAAAAGGTA ATTCCTAAGG GAGAGTTTAT TCCTAATGAC
ACTATTATAA ATAAAGATGA TAATCAACTT CTTATAATAA CAGGGCCTAA TATGGCTGGT
AAATCAACAT ATATGAGACA GGTAGCTATT ATTACTTTAA TGTGTCAAAT AGGTTCCTTT
GTGCCAGCTA GTAAGGCTAA TATAAGTGTA GTAGATAAAA TCTTTACTAG AATAGGAGCT
TCTGATGATT TAGCAGGTGG TAAAAGTACA TTTATGGTTG AAATGTGGGA AGTTTCTAAC
ATTTTAAAAA ATGCTACAGA AAATAGCTTG GTTCTTTTAG ATGAAGTTGG AAGAGGAACG
AGTACATATG ATGGATTAAG TATAGCTTGG TCTGTTATAG AGTATATATG TAAAAATAAA
AATTTAAGAT GTAAAACTCT TTTTGCTACT CATTATCATG AGTTAACTAA ATTAGAAGGA
GAAATTCACG GAGTTAGAAA TTATTCAGTA GCTGTGAAGG AAGTTGATAA CAATATTATC
TTCTTAAGAA AAATAATAGA AGGTGGAGCA GATCAATCTT ATGGTATTGA GGTTGCTAAA
CTTGCTGGTA TTCCAGATGA GGTAATAAAT AGAGCTAAGG AAATCTTAGA AACTCTTGAA
ATGGAATCTT CAAAGGATAA CTTAGATTTA GCTCTTAAAG AAGTAAATGC TTCAAAAGAA
GACATAGAAG AAGCCTCTAT TACAACCTCT TATGAAGTTA AAGAAACCCT AGTAGAAGAG
GATAAAATTG AAATTAAAGA AGAAGTTATT TCAAAAGCTT CAGAGGCTAA AACACATAAA
AAAGAAGATG ATCAAATACA ATTAGATTTT TCTGCCATAG GAAAAGATAA TTTGATAAAA
GAACTTTCAG AAGTTGATAT TTTATCTTTA AATCCTATGG AGGCTATGAA TAGATTATAT
GCTCTAGTTA AAGAAGCTAA AAATTTAATT TAG
 
Protein sequence
MKLTPMMRQY FEIKENYKDC ILFFRLGDFY EMFFEDAETA ARELELVLTG RDCGLEKRAP 
MCGIPFHASN SYIGRLVAKG YKVAICEQVE DPKFAKGIVK RDVIKVITPG TYTDSSFVEE
TKNNYIMTIY ADLERNRCSL AITDISTGDF LATEGELEKG VILDEISKFN PKEIILLDSL
DQELIKDITL TTPALISRKP IEYFEENFEE VLNNQFGEKS NSLSLMVKKS SNALVKYILD
TQKISLTNIN DIEVYSLVDF MTIDLSSRRN LELTENLREK SKKGSLLWVL DKTETSMGSR
MLRRWIEEPL VNKEKITLRL NAVEELFNDL SLNDSLKEAL HDIYDIERIL GKISNKNANA
KDLIALKTSI GKIPNVKGII ENCTSSLLKN YHHNLDDLRD IYDLLEKSIK EDPSLTLKDG
DLIKDGFNGE IDELRLAKTN GKDWISSLEN REREFTGIKS LKVGFNKVFG YYIEISKANY
SSIPEGRYIR KQTLANAERF ITPELKEIEE KLLGASEKLC SLEYDIFLDI RNEVENHIDR
LKTTAKIIAE LDCISNLAFV ALENDFIKPE INEDGETKIE NGRHPVVEKV IPKGEFIPND
TIINKDDNQL LIITGPNMAG KSTYMRQVAI ITLMCQIGSF VPASKANISV VDKIFTRIGA
SDDLAGGKST FMVEMWEVSN ILKNATENSL VLLDEVGRGT STYDGLSIAW SVIEYICKNK
NLRCKTLFAT HYHELTKLEG EIHGVRNYSV AVKEVDNNII FLRKIIEGGA DQSYGIEVAK
LAGIPDEVIN RAKEILETLE MESSKDNLDL ALKEVNASKE DIEEASITTS YEVKETLVEE
DKIEIKEEVI SKASEAKTHK KEDDQIQLDF SAIGKDNLIK ELSEVDILSL NPMEAMNRLY
ALVKEAKNLI