Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0968 |
Symbol | prtP |
ID | 4204193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1099161 |
End bp | 1103873 |
Gene Length | 4713 bp |
Protein Length | 1570 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642565525 |
Product | cell wall-associated serine proteinase, lactocepin precursor |
Protein accession | YP_698291 |
Protein GI | 110802449 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGAAC AGAAAAAACA GATGAAGAGG TTTTTATCCT CAACGCTCAA TGGTTTGGTG GTATTGGCTC TTATAATGCC TAGTAGTGTA GGAACTAATG TAATGGCAGA GGAAATTCAA AATGGGACAA GCCATACAGT AAGAAATTTA GAGAATATTG CTAGGGATGA ACTTTATTTT AAGTATCAAA ATCCAAATGA AGTAGTAAGA GTTATAGTTG AACTTGAAAA GCCAGCAGCT ATAGAGGAAG CTAAGGCTGA AGGTGAGAAA AAACCATCTG AAGCAAAAAT TCAAGAAGTA AAAGAAGAAC AAAAAGATGC TAAGGATGAA GCAGAAGAAA TTACAGGAGA AAAGATAAAT AAAAGCTTTG GAACCTTAAT AAATGGATTC AGTATCGATA CAAAAGTAAA AGATATAGAG GAATTAAAGA AAATCGATGG TGTAAAAAGC GTAAAAGTTG TAAAGACTTA TTATCCAGCT ATGAATTCTG CTAAAGATTT AACACAGGCA GTAGAAACTT GGAAAGAGTT AGGCTTAAAA GGTGAAGGAA TGGTTGTTTC TATTATAGAT TCAGGAATAG ATCCAAATCA TAAAGATATG AAAATAACAG ATTCATCAAA AGCTAAGCTT AAAAAAGAAA ATTTAAAAGA TGGACCAGGA AAATATTTTA CAGAAAAAAT TCCATATGGA TATAATTTTG CTGATGAAAA TGAAAATATT ATAGATACAC ATCCAAAAGT AGATATGCAT GGAATGCATG TAGCAGGAAT AGTTGCTGCC AATGGAAGTG ATGAAGAGCT TGCTAAAAAT GAGGCAATAA AAGGAGTAGC ACCAGAAGCA CAATTACTAG CTATGAAAGT TTTTTCAAAT AATCCTAATA GACAAGGGGC AGCTGAGGAT GATATAGTAG CTGCCATTGA AGAGTCTGTT AATCAAGGAG CAGACATAAT AAATATGAGT TTAGGATCTT CTGCTGGATT CCAAAAAGAA GATGATCCAG AACAAATAGC CGTTAAAAAG GCTGTGGATG CTGGGGTAGT CGTTGTTGTG GCTGCTGGAA ATTCACAATA TTCAACGGCT CCATACAAGG TTCCGGATAT AAAGGATACT GGTTTAGTAG GAGCTCCTGG AACTGCAAAG GATGCACTTA CAGTAGCAAA CTATCATAAT AGTAAGATGC TATTACCAAC AATAAGCTTT GAAGATAATG GTGAAGCAGT TAATATACCA TTTATGTTAT CAGGAGAAGA AAATAGTCTT AATTTAGATA AAGACTTTGA TTTAGTAGAT TGTGGACTTG GAAAGGTACA AGATTTTAAA GGAAAAGATT TAAAGGGAAA AGTTGCCTTA ATAAAAAGAG GGGAAATTAC TTTTATAGAT AAAAATTTAA ATGCACAGGC AGCTGGTGCT GAAGGGATAA TAATATACAA TGGAGATGGT GATGAGTCAT TTATAAATAT GGCAACAGAT CCAAAGGTTA AAATTCCATC AGTATTTGTT AAAAACTCAG ATGGGGAAAA ATTTAAAAAT GCTATTAATA AGAATTTAAA GATAAAGTTT ACAAACAATA AAATATTAGT TGCAAGTAGT GATGCTGGTG ATTTTGTTGA ATCATCATCA TGGGGACCTA CTCCAAGCTT AGACTTTAAA CCACAAATAT CTGCACCAGG TGGAAATATA TATTCAACTA TAAATGATAA TAAATATGGT ATTAAGACTG GTACATCAAT GGCAGCGCCA CATGTTGCTG GAGGAGAAAC ATTAATAGTT GAAGGGCTTA AAAAGGAAAA TCCAAATCTT AAGGGAAGAG ATTTAGTAGA ATTAGCAAAA AATACAGCAA TAAGTACTTC TAAGATAGAG ATGGATAAAA ATAATCCTAA GATACCTTAT TCACCTAGAA GACAAGGAGC TGGTCTTATG CAAATAGAGG AAGCTCTTAA AAATAAGGTT GTAGTATTAG ATGAAAATAA TAATTCTACT GTGGCATTAA AGCAAATAGG AAATGAGAAA GAATTTACAT TAACATTAAA AAATTATGGA GATAAAGAAG CTGAGTATGA TGTTGAAAAT TTAGGTGGAG TTTTAACAGA AACTAGTGAT ACTTTAAAGA CTATGTCTCA TGATGTAAGG ATTGATGGGG CAAATCTTAA GTTTGATAAA AATAAAGTTA TTGTTCCAGC TAAGGGTACA GAAACTTTAA AAGTGAAATT AACAATACCT AAAGCCGTTT CAGAGGATAG ATTTGTTGAA GGATTTATTA AACTTACAGG AAAAGATGTT CCATCATTAT CAGTTCCTTT CATAGGATAT TATGGAGACT GGGGAAAAGA TCAAATAATA GAAGCTATGA ATTGGGATAG TAAAAATCAA AAGTTCATAG TTCCATCAGA AGTATTAACA AATTTAAATG GAGCAATTGG GTACAAGCTA GGTTTAGGAG CAAAGGATGA AAAGGGAAAT CTTAAAGTAG ATCCTAGTAA AATAGCAATA TCTCCAGATG GAAATGGAAA TGGTGATATC ATAGCTCCAT ATTTATATTA TTTAAGAAAT GCTAAGGTAA CTGAATTAGA GTTATTAGAT AAAGATAAAA AATCCTTAGG AGTTATAGGA CATGAAAATT ATATAAGAAA AGAGGAATAT AGTGAACCAA GTGGAAGTGG AAAAGCTCCA AACTTATTTG AGAACTTAAC TTGGGATGGA AAGTTATATA ACCAAAGTAC AGGAGAAAAG GAAGTTGTAC CAGAAGGACA ATATTATTTA AATATAAAAT CAAAAGTTGA TTATGATAAT GCTAAATATC AAGAGGTAGT TGTTCCAGTA CAAGTTGACC TTACTGCACC TAATATTGAA ATAACTTCAG GAGACAAAGT ATTAGGTAAT AAGGACGATA ATGAAGTAGA TTATAAATTA GAATGGACTG CTAAGGATAA TGTTTCTATT ATACCAGATA TAGCTACAGT ATATGTAAAT GGTAAAAGTG TAAGCGCTAA TATAAGTGAA AATAATGGCA CTTATAGTTG TGATATAAAG TTAAAAAACA ATGCTTTAAA TGAAGTTAAG GTAGCTATGA ATGATACAGC ACTTAATTTA GCTGAAGTAT CTAAGAATAT AAAGGTTGAA TCTTCAGATC CATTAATAAA ATTTGAAGGT AACTTTGGAA CTGCTACTTT AAGTGTTGAT AATTCTTTAG AATATCTAGT AAAGGGAGTA GTTTTAGGTC CAGTAAAAGA ATTTAAGTTA AATAATGAAG ATGTTAAGGT AAATGAAGAT GGAACTTTTG TACATAAAGT TGCTTTAAAA GAAGGAATGA ATAAAGTTAA TATTTATGCG AAAGATGAAA ATGGAAATGT GTTATATAAT TATGCTAGTA ATATATTATG TGATACTAAA GCTCCTATAA TAAACTTATC ATCTCCAAAG GTAGAATCAG ATGGTATAGT TATAACTAAT GAAGATAAAA TAAATATAAA AGGTACTGTT GAGGATAACA CATGGGGATA TAAATTCTAT AAAAATGACA CTATTCAGTT AGAAGTTGAA GAGAGAGCTA AGCCAGGAAA TGATAGTACA AGAAGAGAGT TTTCATATGA AGTTCCTGTA AAAGATGGAG ATGTTATAGT ATTAAAGGCC GTTGATGTAT TAGGTCATGA AACTCTTAGA AAGCTTACTG TTAAGGTTGA TAAAAATGCT CTAGAAGTGA CAATTGGAGG GGTATCAGAT CAAGGAATAT ACAATAGTGA TGTAACTCCA AAGGTAGTTT CTAATGAAGA TGCAGAAATT AGTTACTTAT TAAATGGAAA AGATTATGAT GGAAAAACTC CTATTTCAGA GGATGGAAAC TATGAGTTAA TTGTAAAGTC TAAAGATAAA GCTGGAAATA AAACAGAAGT AAAAACTAAC TTTACTATAG ATAAAACACC AGCAAATATT TCTGTTAATA ATATTGAAGA TGGAAGAGTA TATAATGAAG AAATTATTCC TGAAATAGCT AGTAATGAAG AAGCTACTTT TAAATATACT TTAAACGGAA AAGAATATGA TGGTAAGTCT AGTATAAAAG AAGATGGTGA CTATGTTTTA AATATACAAG CAACAGATAA AGCTGGAAAT GTATCAAATA AAGAAGTTAA GTTTTCTATA GATAGAACAC CTGCTAATAT ATTTGTAACT GGAGTTGAAG AGGGTAAAGT TTATAATGAA CCTGTTACTC CAATAATTGA GATTGATGAG AAGGATGCAA CTTTAAAATA TACTTTAAAT GGAAAAGAAT ATGACGGAAA ATCAATAATA GATGAAGATG GTAAGTATAT CTTAAAGGTT GAAGCTTTAG ATAAAGCAGG AAATCCATCA GAAAAAGTTA TTAACTTTGC TATAGACAGA AGTTTCTTAA AAAATTCAGA AAAGGATGAT CCAAATAATA ATAAGAAATA TAATGAACCT ATTGATGAGG AAATAGTACA AAAGCCTGAA GCCAAAACTG ATTCAAAAGA GGAATTAAAG GCTAATAAGC TTAAAGAAGA GAATAAAGTT AGTGAAGAAA ATAAAAGTAA TGAAGAGAAC TCAGTTAAAG ATGAAAAACT TCTTAAGAAA GAAGGAACAT TGCCAACAAC AGGACAAGTT CTTGGAGGAT CTATGCTATC TTTATTAGGA GCTATAATGG CTTCAGTTGG AGCTGTTTTC TTAAAAAGAA AAAATAAAAA CAAGGAAGAA TAG
|
Protein sequence | MKEQKKQMKR FLSSTLNGLV VLALIMPSSV GTNVMAEEIQ NGTSHTVRNL ENIARDELYF KYQNPNEVVR VIVELEKPAA IEEAKAEGEK KPSEAKIQEV KEEQKDAKDE AEEITGEKIN KSFGTLINGF SIDTKVKDIE ELKKIDGVKS VKVVKTYYPA MNSAKDLTQA VETWKELGLK GEGMVVSIID SGIDPNHKDM KITDSSKAKL KKENLKDGPG KYFTEKIPYG YNFADENENI IDTHPKVDMH GMHVAGIVAA NGSDEELAKN EAIKGVAPEA QLLAMKVFSN NPNRQGAAED DIVAAIEESV NQGADIINMS LGSSAGFQKE DDPEQIAVKK AVDAGVVVVV AAGNSQYSTA PYKVPDIKDT GLVGAPGTAK DALTVANYHN SKMLLPTISF EDNGEAVNIP FMLSGEENSL NLDKDFDLVD CGLGKVQDFK GKDLKGKVAL IKRGEITFID KNLNAQAAGA EGIIIYNGDG DESFINMATD PKVKIPSVFV KNSDGEKFKN AINKNLKIKF TNNKILVASS DAGDFVESSS WGPTPSLDFK PQISAPGGNI YSTINDNKYG IKTGTSMAAP HVAGGETLIV EGLKKENPNL KGRDLVELAK NTAISTSKIE MDKNNPKIPY SPRRQGAGLM QIEEALKNKV VVLDENNNST VALKQIGNEK EFTLTLKNYG DKEAEYDVEN LGGVLTETSD TLKTMSHDVR IDGANLKFDK NKVIVPAKGT ETLKVKLTIP KAVSEDRFVE GFIKLTGKDV PSLSVPFIGY YGDWGKDQII EAMNWDSKNQ KFIVPSEVLT NLNGAIGYKL GLGAKDEKGN LKVDPSKIAI SPDGNGNGDI IAPYLYYLRN AKVTELELLD KDKKSLGVIG HENYIRKEEY SEPSGSGKAP NLFENLTWDG KLYNQSTGEK EVVPEGQYYL NIKSKVDYDN AKYQEVVVPV QVDLTAPNIE ITSGDKVLGN KDDNEVDYKL EWTAKDNVSI IPDIATVYVN GKSVSANISE NNGTYSCDIK LKNNALNEVK VAMNDTALNL AEVSKNIKVE SSDPLIKFEG NFGTATLSVD NSLEYLVKGV VLGPVKEFKL NNEDVKVNED GTFVHKVALK EGMNKVNIYA KDENGNVLYN YASNILCDTK APIINLSSPK VESDGIVITN EDKINIKGTV EDNTWGYKFY KNDTIQLEVE ERAKPGNDST RREFSYEVPV KDGDVIVLKA VDVLGHETLR KLTVKVDKNA LEVTIGGVSD QGIYNSDVTP KVVSNEDAEI SYLLNGKDYD GKTPISEDGN YELIVKSKDK AGNKTEVKTN FTIDKTPANI SVNNIEDGRV YNEEIIPEIA SNEEATFKYT LNGKEYDGKS SIKEDGDYVL NIQATDKAGN VSNKEVKFSI DRTPANIFVT GVEEGKVYNE PVTPIIEIDE KDATLKYTLN GKEYDGKSII DEDGKYILKV EALDKAGNPS EKVINFAIDR SFLKNSEKDD PNNNKKYNEP IDEEIVQKPE AKTDSKEELK ANKLKEENKV SEENKSNEEN SVKDEKLLKK EGTLPTTGQV LGGSMLSLLG AIMASVGAVF LKRKNKNKEE
|
| |