Gene CPR_2567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2567 
SymbolcspB 
ID4205751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2794082 
End bp2795776 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content30% 
IMG OID642567117 
Productprotease CspB 
Protein accessionYP_699814 
Protein GI110802592 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAATA AAGCTAAGGG TGGAATAGAT TTTATAAATA TAATACCGAA GCAAATACTA 
ACTAGATTAA TAGAAAAATA TTCGCCTAAT AATGAAGATA TAGAATTAGT TGTTTTATAT
GGGGATAATT TTTTCAAATT CAAGAATTCA GTAGATGCTA TAGGAGCAAA AGTTGAAGAC
TTAGGATATG GATTTGGAAT ACTTATAATA AAAGTTAATG ACTTAAATAG GATAATTGAG
CTTGATGGCC TTCAGTATAT AGAGTTACCT AAAATATTAT ATACATCAGC TTATGATAGC
AACAGAGCAT CATGTATTCC TTCAGTTTGG AATAATTATA ATTTAACTGG AGAAGGAATA
TTAGTCGGTT TTTTAGATAC TGGGATAGAC TATACTCATA ATGCTTTTAA AGATACTGAT
GGAAATACCA GGATAGAATA TATATACGAT CTTGAAAATG AAGTTGTATA TGACAAGAAT
AAAATAAATG AAGCTTTAAA ATCTGAAGAT CCATTTAGTA TTGTACCTGA AATTGATTTA
TCAGGTCATG GAACTCATGT TGCAGGAATT GCATGTGCTG GAGGAAATAT AAATTTTGAT
AATTATGGGG TTGCTTATAA AAGTAGCATA GCAATGGTAA AAATAACTAG TGGAAATAGT
TTAAGAGAAG CATTAAGTAC ACAACTCATG AAGGGTTTAA AATTTTTAAT GGATAAAAGT
AATGAGATTA ATAAACCTTT AGTTGTGAAT ATAAGTTTAA GTACAAATGA TGGTTCTCAT
AATGGAAATA GTTTGTTAGA AAAGTATATT CAAACTTTTT CTCAATTACA GAAAGCAGTT
ATTGTAGTTG CTGCTGGAAA TGAAGGTAAT AGTGCTCACC ATGTAGGTGG AAATATGAAA
AAAGAAGAAG AATTAGATTT AAATATAGGA GATGGAGAAA AGAGTATAAT ATTAGCTTTT
TTCAAGTCTG TACTAGTTGA TGTATCAGTT GAGGTAATTT CACCAACTGG AGTAAGTACG
GGACCTATGG AATTATCTGA ATCTTATAGA GAAAGATTTG TTGGAAAAGA AAAAATAGTT
GTATATAGTA CTGGTCCTAA ACCTTTTGAT ATACAAGGAC AAACTACAAT AAGTATTTTA
CCATTAGGAG ACACAATAAC TTCTGGTGGA TGGAGAATTA TAGTTAGAAA ATTAAATAAT
TATGATGGAT ATTTTGATAT TTGGTTACCT ATAGCAGAAG GATTAAATGA AAAAACAAGA
TTTTTACAAC CATCTGTTTA TAATACATTA GGAATCCCTG CAACTGTACA AGGGGTTATC
TCTGTTGGAA GTTATAATTT TTTAAATAAT AATTTATCAG CATTTTCTGG TAGGGGCGTT
GTTAGACCTG AATGGTTAAT AAAACCAGAT TTAGTTGCTC CTGGTGAAAA TATATTATCT
ACTGTTCCAG GACAAGGATT TGATACTAAA AGTGGTACAT CAATGGCTGC ACCACAAGTT
TCTGGGATAT GCGCTCTTCT TTTGGAATGG GGTATTATAA GAAACAATGA TCCTTTTTTA
TATGGACAAA AGATTAAATA TTATTTAATT AAAGGAGCAA GGAGGACCAT ATCTGGAGAG
GCTTATCCAA ATCCAGACTT AGGATATGGG TTTGTATGTT TAGAGAGAAC AATGGAATTA
TTAACTAATA GATGA
 
Protein sequence
MENKAKGGID FINIIPKQIL TRLIEKYSPN NEDIELVVLY GDNFFKFKNS VDAIGAKVED 
LGYGFGILII KVNDLNRIIE LDGLQYIELP KILYTSAYDS NRASCIPSVW NNYNLTGEGI
LVGFLDTGID YTHNAFKDTD GNTRIEYIYD LENEVVYDKN KINEALKSED PFSIVPEIDL
SGHGTHVAGI ACAGGNINFD NYGVAYKSSI AMVKITSGNS LREALSTQLM KGLKFLMDKS
NEINKPLVVN ISLSTNDGSH NGNSLLEKYI QTFSQLQKAV IVVAAGNEGN SAHHVGGNMK
KEEELDLNIG DGEKSIILAF FKSVLVDVSV EVISPTGVST GPMELSESYR ERFVGKEKIV
VYSTGPKPFD IQGQTTISIL PLGDTITSGG WRIIVRKLNN YDGYFDIWLP IAEGLNEKTR
FLQPSVYNTL GIPATVQGVI SVGSYNFLNN NLSAFSGRGV VRPEWLIKPD LVAPGENILS
TVPGQGFDTK SGTSMAAPQV SGICALLLEW GIIRNNDPFL YGQKIKYYLI KGARRTISGE
AYPNPDLGYG FVCLERTMEL LTNR