Gene CPF_0305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0305 
Symbol 
ID4203524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp362736 
End bp364301 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content27% 
IMG OID638081192 
ProductMutS domain-containing protein 
Protein accessionYP_694765 
Protein GI110799781 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.667577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGA GGTTGTTAAA AGTATTAAAG CATGACTATG CAGTAGAAAT GGATAAAAAG 
AGAAACCTAA AAGCTATAAG AACACTATAT GATATGAACG AAAAAGAAGA ATACCATATA
GATGAACAAA CATGGAATGA TTTAGATTTA GATAAGGTTT ATTCAAAATT AGATAGAAAT
TATAGTAGTT TAGGAGAAGC TTCCCTTTAT TCAATGCTTA GAAATCCACT AAATGATGAA
AAAAAGTTAA ATAAGAGAAG AGAAGAAATA GAGTACTTTA AAGAAAACGA AGAAGTAAGA
TTTAAGATTA TGTGGATATT TTTTGAGTTA GGAAGAGATA AAAAGAATAG CTTGTTAGAT
ATGATTAATG AAAAGCTAAT AGAATCTAAT AAGTTTAAAT ATTATTTTTA TACAATTACA
GGAAAGATAT TACCTTTAAT ATTAATACTT ACTGCAATCT TTGTAAGCGT AAGTGCAATG
CTTGGCTTAA TGGCTGTAAC CTTCGTTAAT ATTTATATAA ATAGTAAGGA AAGAGACAGA
ATTAAGGCTA ATGGGCTTAT GTATTTAAGA AGAGTAATAA AGGCATCTAA GCAGATAGTT
AAAATTAATG ATAAAAATTT AGAGTCTTTT AATATAAAGA TAAGAGAAAA CTTAAAAGAT
TTAAAATCAA TAGATAGAAA TACTATAATG ATTAGCTTTA TTAATATGTG GGGTGGAGTA
TTTGAATTTA TATCCGTATT ATTTTTACTA GAAGAAACTG CATATTATAA AATTGCTGAT
TCAATAGAGA AAAATAAAGA TTCAATTTTA AGCCTTTATA AAACATTAGG TGAAATGGAA
GCAATAATAT CTATTGGAAG TTATGAAGAA GAAAAAAAGG ATAAGATAAC TAGACCTAAA
TTCATAAGAG AAACAACATT AGAAATAAAA AATGGAATAC ACCCAATTAT AGAAAATCCA
GTGGCTAACT CCATAAACAT GAGTAAGAGA GGAATTGTTC TTACTGGAAC TAATATGTCA
GGTAAATCAA CTTTTTTAAG GATGTTAGGA GTAAATATGC TTTTTGCTCA AGCCTTTAAT
TTTGTTTTAG CAGAAAAATA TGAAGGGCCA ATATTTAATA TAGTTACTTC AATTAGTCCA
AATGATGATT TAAGTGTTGG TAAAAGTTTT TACATGGCTG AAGCAGAATC AATTTTAAGA
ATAATAAGAG CCTTAGATAA AGATTTACCA GTATTTTGTG CAATTGATGA GATTTTTAGA
GGGACAAATC CAATAGAGAG AATATCAGCT TCAGCAGAAA TTCTTACTTA TATCAATAAT
AAAAACAGTA TTTCTATAGT TGCTACTCAT GATAGAGAAT TAGTTGACAT ATTAAAAGAA
TGTTATGAGT TTTATTATTT TAGCGAAAAT GTTGATAGCA AGAATGGATT AAGCTTTGAT
TATAAGCTAA AAAGAGGAGT ATCTAAAACT AGAAATGCTA TAAAATTACT AGAATATATA
GGTTATCCAA AAGATATAAT AAACAAGTCA TATAGAAGAT CAGAAAAGCT AGAAGGCTTT
ATTTAG
 
Protein sequence
MDKRLLKVLK HDYAVEMDKK RNLKAIRTLY DMNEKEEYHI DEQTWNDLDL DKVYSKLDRN 
YSSLGEASLY SMLRNPLNDE KKLNKRREEI EYFKENEEVR FKIMWIFFEL GRDKKNSLLD
MINEKLIESN KFKYYFYTIT GKILPLILIL TAIFVSVSAM LGLMAVTFVN IYINSKERDR
IKANGLMYLR RVIKASKQIV KINDKNLESF NIKIRENLKD LKSIDRNTIM ISFINMWGGV
FEFISVLFLL EETAYYKIAD SIEKNKDSIL SLYKTLGEME AIISIGSYEE EKKDKITRPK
FIRETTLEIK NGIHPIIENP VANSINMSKR GIVLTGTNMS GKSTFLRMLG VNMLFAQAFN
FVLAEKYEGP IFNIVTSISP NDDLSVGKSF YMAEAESILR IIRALDKDLP VFCAIDEIFR
GTNPIERISA SAEILTYINN KNSISIVATH DRELVDILKE CYEFYYFSEN VDSKNGLSFD
YKLKRGVSKT RNAIKLLEYI GYPKDIINKS YRRSEKLEGF I