Gene CPF_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1123 
Symbol 
ID4203634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1281873 
End bp1286585 
Gene Length4713 bp 
Protein Length1570 aa 
Translation table11 
GC content31% 
IMG OID638082004 
Productcell wall-associated serine proteinase 
Protein accessionYP_695569 
Protein GI110799369 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.677567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAAC AGAAAAAACA GATGAAGAGG TTTTTATCCT CAACGCTCAA TGGTTTGGTG 
GTATTGGCTC TTATAATGCC TAGTAGTGTA GGAACTAATG TAATGGCAGA GGAAATTCAA
AATGGGACAA GCCATACAGT AAGAAATTTA GAGAATATTG CTAGGGATGA ACTTTATTTT
AAGTATCAAA ATCCAAATGA AGTAGTAAGA GTTATAGTTG AACTTGAAAA GCCAGCAGCT
ATAGAGGAAG CTAAGGCTGA AGGTGAGAAA AAACCATCTG AAGCAAAAAT TCAAGAAGTA
AAAGAAGAAC AAAAAGATGC TAAGGATGAA GCAGAAGAAA TTACAGGAGA AAAGATAAAT
AAAAGCTTTG GAACCTTAAT AAATGGATTC AGTATCGATA CAAAAGTAAA AGATATAGAG
GAATTAAAGA AAATCGATGG TGTAAAAAGC GTAAAAGTTG TAAAGACTTA TTATCCAGCT
ATGAATTCTG CTAAAGATTT AACACAGGCA GTAGAAACTT GGAAAGAGTT AGGCTTAAAA
GGTGAAGGAA TGGTTGTTTC TATTATAGAT TCAGGAATAG ATCCAAATCA TAAAGACATG
AAAATAACAG ATTCATCAAA AGCTAAGCTT AAAAAAGAAA ATTTAAAAGA TGGACCAGGA
AAATATTTTA CAGAAAAAAT TCCATATGGA TATAATTTTG CTGATGAAAA TGAAAATATT
ATAGATACAC ATCCAAAAGT AGATATGCAT GGAATGCATG TAGCAGGAAT AGTTGCTGCC
AATGGAAGTG ATGAAGAGGT TGCTAAAAAT GAGGCAATAA AAGGAGTAGC ACCAGAAGCA
CAATTACTAG CTATGAAAGT TTTTTCAAAT AATCCTAATA GACAAGGGGC TGCTGAGGAT
GATATAGTAG CAGCTATTGA AGAGTCTGTT AATCAAGGAG CAGACATAAT AAATATGAGT
TTAGGATCTT CTGCTGGATT TCAAAAAGAA GATGATCCAG AACAAATAGC AGTTAAAAAG
GCTGTGGATG CTGGGGTAGT CGTTGTCGTG GCTGCTGGAA ATTCACAATA TTCAACGGCT
CCATACAAGG TTCCAGATAT AAAGGATACT GGTTTAGTAG GAGCTCCTGG AACTGCAAAG
GATGCACTTA CAGTAGCAAA CTATCATAAT AGTAAGATGT TATTACCAAC AATAAGCTTT
GAAGAAAATG GGGAAGCAGT TAATATACCA TTTATGTTAT CAGGAGAAGA AAATAGTCTT
AATTTAGATA AAGACTTTAA TTTAGTAGAT TGTGGACTTG GAAAGGTACA AGATTTTAAA
GGAAAAGATT TAAAGGGAAA AGTTGCCTTA ATAAAAAGAG GGGAAATTAC TTTTATAGAT
AAAAATTTAA ATGCACAGGT AGCTGGTGCT GAAGGGGTAA TAATATACAA TGGAGATGGT
GATGAGTCAT TTATAAATAT GGCAACAGAT CCAAAGGTTA AAATTCCATC AGTATTTGTT
AAAAACTCAG ACGGGGAAAA ATTTAAAAAT GCTATTAATA AAAGTTTAAA AATAAAGTTT
ACAAACAATA AAATATTAGT TGCAAGTAGT GATGCTGGTG ATTTTGTTGA ATCATCATCA
TGGGGACCTA CTCCAAGCTT AGACTTTAAA CCACAAATAT CTGCACCAGG TGGAAATATA
TATTCAACTA TAAATGATAA TAAATATGGT ATTAAGACTG GTACATCAAT GGCAGCGCCA
CATGTTGCTG GAGGAGAAAC ATTAATAGTT GAAGGGCTTA AAAAGGAAAA TCCAAATCTT
AAGGGAAGAG ATTTAGTAGA ATTAGCAAAA AATACAGCAA TAAGTACTTC TAAGATAGAG
ATGGATAAAA ATAATCCTAA GATACCTTAT TCACCTAGAA GACAAGGAGC TGGTCTTATG
CAAATAGAGG AAGCTCTTAA AAATAAGGTT GTAGTATTAG ATGAAAATAA TAATTCTACT
GTGGCATTAA AGCAAATAGG AAATGAGAAA GAATTTACAT TAACATTAAA AAATTATGGA
GATAAAGAAG CTGAGTATGA TGTTGAAAAT TTAGGTGGAG TTTTAACAGA AACTAGTGAT
ACTTTAAAGA CTATGTCTCA TGATGTAAGG ATTGAGGGGG CAAATCTTAA GTTTGATAAA
AATAAAGTTA TTGTTCCAGC TAAGGGTACA GAAACTTTAA AAGTGAAATT AACAATACCT
AAAGCCATTT CAGAGGATAG ATTTGTTGAA GGATTTATTA AACTTACAGG AAAAGATGTT
CCATCATTAT CAGTTCCTTT CATAGGATAT TATGGAGACT GGGGAAAAGA TCAAATAATA
GAAGCTATGA ATTGGGATAG TAACAATCAA AAGTTCATAG TTCCATCAGA AGTATTAACA
AATTTAAATG GAGCAATTGG GTACAAGCTA GGTTTAGGAG CAAAGGATGA AAAGGGAAAT
CTTAAAGTAG ATCCTAGTAA AATAGCAATA TCTCCAGATG GAAATGGAAA TGGTGATATC
ATAGCTCCAT ATTTATATTA TTTAAGAAAT GCTAAGGTAA CTGAATTAGA ATTATTAGAT
AAAGATAAAA AATCCTTAGG AGTTATAGGA CATGAAGACT ATATAAGAAA AGAGGAATAT
AGTGAACCAA GTGGAAGTGG AAAAGCTCCA AACTTATTTG AGAACTTAAC TTGGGATGGA
AAGCTATATA ACCAAAGTAC AGGAGAAAAG GAAGTTGTAC CAGAAGGACA ATATTATTTA
AATATAAAAT CAAAAGTTGA TTATGATAAT GCTAAAGATC AAGAGGTAGT TGTTCCAGTA
CAAGTTGACC TTACTGCACC TAATATTGAA ATAACTTCAG GAGACAAAGT ATTAGGCAAT
AAGGATGATA ATGAAGTAGA TTATAAATTA GAATGGACTG CTAAGGACAA TGTTTCTATT
ATACCAGATA TAGCTACAGT ATATGTAAAT GGTAAAAGTG TAAGAGCTAA TATAAGTGAA
AATAATGGCA CTTATAGTTG TGACATAAAG TTAAAAAACA ATGCTTTAAA TGAAGTTAAG
GTAGCTATGA ATGATACAGC ATTTAACTTA GGTGAAGTAT CTAAGAATAT AAAGGTTGAA
TCTTCAGATC CATTAATAAA ATTTGAAGGT AACTTTGGAA CTGCTACTTT AAGTGTTGAT
AATTCTTTAG AATATCTAGT AAAGGGAGTA GTTTTAGGTC CAGTAAAAGA ATTTAAGTTA
AATAATGAAG ATGTTAAGGT AAATGAAGAT GGAACTTTTA TACATAAAGT TTCTTTAAAA
GAAGGTATGA ATAAAGTTAA TATTTATGCT AAAGATGAAA ATGGAAATGT ATTATATAAT
TATGCTAGTA ATATATTATG TGATACTAAA GCTCCTATAA TAAACTTATT ATCTCCAAAG
GTAGAATCAG ATGGTATAGT TATAACTAAT GAAGATAAAG TAAATATAAA AGGTACTGTT
GAGGATAACA CATTAGGATA TAAGTTCTAT AAAAATGATA CTATTCAATT AGAAGTTGAA
GAGAGAGCTA AGCCAGGAAA TGATAGTACA AGAAGAGAGT TTTCATATGA AGTTCCTGTA
AAAGATGGAG ATGTTATAGT ATTAAAGGCC GTTGATGTAT TAGGTCATGA AACTCTTAGA
AAGCTTACTG TTAAGGTTGA TAAAAATGCT CCAGAAGTGA CAATTGGAGG AGTATCAGAT
CAAGGAATAT ACAATAGTGA TGTAGCTCCA AAGGTAGTTT CTAATGAAGA TGTAGAAATT
AGTTACTTAT TAAATGGAAA AGATTATGAT GGAAAAACTC CTATTTCAGA GGATGGAAAC
TATGAGTTAA TTGTAAGGGC TAAAGATAAA GCTGGAAATA AAACAGAAGT AAAAACTAAC
TTTACTATAG ATAAAACACC AGCAAATATT TCTGTTAATA ATATTGAAGA GGGAAAAGTA
TATAATGAAG AAATTATTCC TGAAATAGCT AGTAATGAGG AAGCTACTTT TAAATATACT
TTAAACGGAA AAGAATATGA TGGTAAGTCT AGTATAAAAG AAGATGGTGA CTATGTTTTA
AATATACAAG CAACAGATAA AGCTGGAAAT GTATCAAATA AAGAAGTTAA GTTTTCTATA
GATAGAACAC CTGCTAATAT ATTTGTAACT GGAGTTGAAG AGGGTAAAGT TTATAATGAA
CCTGTTACTC CAATAATTGA GATTGATGAT AAGGATGCAA CTTTAAAATA TACTTTAAAT
GGAAAAGAAT ATGACGGAAA ATCAAGAATA GATGAAGATG GTAAGTATAT CTTAAAGGTT
GAAGCTTTAG ATAAAGCAGG AAACCCATCA GAAAAAGTTA TTAACTTTAC TATAGACAGA
AGTTCCTTAA AAAATTCAGA AAAGGATGAT CCAAATAACA ATAAGAAATA TAATGAACCT
ATTGATGAGG AAATAGTACA AAAGCCTGAA GCTAAAACTG ATTCAAAAGA GGAATTAAAG
GCTAATAAGC TTAAAGAAGA GAATAAAGTT AGTGAAGAAA ATAAAAGTAA TGAAGAGAAC
TCAGTTAAAG ATGAAAAACT TCTTAAGAAA GAAGGAACAT TGCCAACAAC AGGACAAGTT
CTTGGAGGAT CTATGATATC TTTATTAGGA GCTATAATGG CTTCAGTTGG AGCTGTTTTC
TTAAAAAGAA AAAATAAAAA CAAGGAAGAA TAG
 
Protein sequence
MKEQKKQMKR FLSSTLNGLV VLALIMPSSV GTNVMAEEIQ NGTSHTVRNL ENIARDELYF 
KYQNPNEVVR VIVELEKPAA IEEAKAEGEK KPSEAKIQEV KEEQKDAKDE AEEITGEKIN
KSFGTLINGF SIDTKVKDIE ELKKIDGVKS VKVVKTYYPA MNSAKDLTQA VETWKELGLK
GEGMVVSIID SGIDPNHKDM KITDSSKAKL KKENLKDGPG KYFTEKIPYG YNFADENENI
IDTHPKVDMH GMHVAGIVAA NGSDEEVAKN EAIKGVAPEA QLLAMKVFSN NPNRQGAAED
DIVAAIEESV NQGADIINMS LGSSAGFQKE DDPEQIAVKK AVDAGVVVVV AAGNSQYSTA
PYKVPDIKDT GLVGAPGTAK DALTVANYHN SKMLLPTISF EENGEAVNIP FMLSGEENSL
NLDKDFNLVD CGLGKVQDFK GKDLKGKVAL IKRGEITFID KNLNAQVAGA EGVIIYNGDG
DESFINMATD PKVKIPSVFV KNSDGEKFKN AINKSLKIKF TNNKILVASS DAGDFVESSS
WGPTPSLDFK PQISAPGGNI YSTINDNKYG IKTGTSMAAP HVAGGETLIV EGLKKENPNL
KGRDLVELAK NTAISTSKIE MDKNNPKIPY SPRRQGAGLM QIEEALKNKV VVLDENNNST
VALKQIGNEK EFTLTLKNYG DKEAEYDVEN LGGVLTETSD TLKTMSHDVR IEGANLKFDK
NKVIVPAKGT ETLKVKLTIP KAISEDRFVE GFIKLTGKDV PSLSVPFIGY YGDWGKDQII
EAMNWDSNNQ KFIVPSEVLT NLNGAIGYKL GLGAKDEKGN LKVDPSKIAI SPDGNGNGDI
IAPYLYYLRN AKVTELELLD KDKKSLGVIG HEDYIRKEEY SEPSGSGKAP NLFENLTWDG
KLYNQSTGEK EVVPEGQYYL NIKSKVDYDN AKDQEVVVPV QVDLTAPNIE ITSGDKVLGN
KDDNEVDYKL EWTAKDNVSI IPDIATVYVN GKSVRANISE NNGTYSCDIK LKNNALNEVK
VAMNDTAFNL GEVSKNIKVE SSDPLIKFEG NFGTATLSVD NSLEYLVKGV VLGPVKEFKL
NNEDVKVNED GTFIHKVSLK EGMNKVNIYA KDENGNVLYN YASNILCDTK APIINLLSPK
VESDGIVITN EDKVNIKGTV EDNTLGYKFY KNDTIQLEVE ERAKPGNDST RREFSYEVPV
KDGDVIVLKA VDVLGHETLR KLTVKVDKNA PEVTIGGVSD QGIYNSDVAP KVVSNEDVEI
SYLLNGKDYD GKTPISEDGN YELIVRAKDK AGNKTEVKTN FTIDKTPANI SVNNIEEGKV
YNEEIIPEIA SNEEATFKYT LNGKEYDGKS SIKEDGDYVL NIQATDKAGN VSNKEVKFSI
DRTPANIFVT GVEEGKVYNE PVTPIIEIDD KDATLKYTLN GKEYDGKSRI DEDGKYILKV
EALDKAGNPS EKVINFTIDR SSLKNSEKDD PNNNKKYNEP IDEEIVQKPE AKTDSKEELK
ANKLKEENKV SEENKSNEEN SVKDEKLLKK EGTLPTTGQV LGGSMISLLG AIMASVGAVF
LKRKNKNKEE