Gene CPF_2887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2887 
SymbolcspB 
ID4201610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3156394 
End bp3158091 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content31% 
IMG OID638083754 
Productprotease CspB 
Protein accessionYP_697251 
Protein GI110800758 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA AAGCTAAGGT TGGCATAGAT TTTATAAATA CAATACCCAA GCAAATACTA 
ACTAGCTTAA TAGAACAATA TTCCCCTAAT AATGGAGAGA TAGAGTTAGT TGTTTTATAT
GGAGATAATT TTTTAAGATT TAAAAATTCA GTAGATGTCA TAGGTGCTAA AGTTGAAGAT
TTAGGATATG GATTTGGAAT ACTTATAATA AAGGTTAATG ACTTAAATAG GATAATTGAG
CTTGAAGGCC TTCAATATAT AGAGCTACCC AAAATTTTAT ATACATCAGC TTATGATAGT
AATAGAGCAT CATGCATTCC ATCAGTTTGG AATAATTATA ATTTAACTGG AGAAGGAATA
TTAGTTGGTT TTTTAGATAC TGGAATAGAC TACACTCATA ATGCTTTTAA AGATGCTGAG
GGCAATACCA GGATAGAGTA TATATATGAT CTTGAAAATG GAGTTGTATA TGATAAAAAT
AAAATAAATG AAGCTTTAAA ATCTGAAGAT CCATTTAGCA TTGTACCTGA AATTGATTTA
TCAGGTCATG GAACTCATGT TGCAGGAATT GCATGTGCCG GAGGAAATAT AAACTTTGAT
AATTATGGAG TTGCATATAA AAGTAGCATA GCAATGGTAA AGATAACTGG TGAGAATAGT
TTAAGGGCGG CCTTAAGTAC ACAGCTTATG AGAGGTTTAA AATTTTTAAT GGATAAAAGT
AATGAAATTA ATAAACCTTT AGTTGTAAAT ATAAGTTTAA GTACAAATGA TGGTTCTCAT
AATGGAAGTA GTTTACTAGA AAAGTATATT CAAACTTTTA CGCAATTACA AAAGGCAGTT
ATTGTAGTAG CTGCTGGGAA TGAAGGTAAT AGTGCTCATC ATGTAGGGGG CAAGATGAAA
AAAGAAGAAG ATTTAGACTT AAATATAGGA GATGGAGAAA AAGGTATTAT ATTAGATTTT
TTCAAGCCTG TATTAGTTGA TGTATCAGTT GAAGTAATTT CACCAACTGG AATAAGTACA
GGACCAATAG AATTATCTGA GTCTTATAAA GAAAGATTTG TTGGAAGAGA AAAAATAGTT
GTATATAGTA CTGGTCCTAA GCCTTTTGAT ATACAAGGAC AAACTACCAT AAGTATTTTG
CCCTTAGGAG ATACAATAAC TTCTGGTGGA TGGAGAATTA TAGTTAGAAA ATTAAATAAT
TATGAGGGAT ATTTTGATAT TTGGTTACCT ATAGCAGAAG GATTAAATGA AAGAACAAGA
TTTTTACAAC CATCTGTTTA TAATACCTTA GGAATCCCTG CAACTGTAGA AGGGGTTATC
TCCGTTGGAA GTTATAATTT TTTAAACAAT AATTTATCAG CTTTTTCTGG AAGAGGAGTT
GTTAGACCTG AGTGGTTAAT AAAACCAGAT TTAGTTGCTC CAGGTGAAAA CATATTATCC
ACTGTTGAGG AGCAAGGATT TGATACTAAA AGTGGTACAT CAATGGCTGC GCCACAAGTT
TCTGGAATAT GTGCTCTTCT TTTTGAATGG GGGATTATAA GAAATAATGA TCCTTTTTTA
TATGGAGAAA GAATTAAATA TTATTTGATT AAAGGAGCAA AGAGGACGAT CTTTGGTGAG
GCATATCCAA ATCCAGACTT GGGATATGGA TTTGTATGTT TAGATAGAAC AATGGAATTA
TTAATTAATA GGAGATAG
 
Protein sequence
MENKAKVGID FINTIPKQIL TSLIEQYSPN NGEIELVVLY GDNFLRFKNS VDVIGAKVED 
LGYGFGILII KVNDLNRIIE LEGLQYIELP KILYTSAYDS NRASCIPSVW NNYNLTGEGI
LVGFLDTGID YTHNAFKDAE GNTRIEYIYD LENGVVYDKN KINEALKSED PFSIVPEIDL
SGHGTHVAGI ACAGGNINFD NYGVAYKSSI AMVKITGENS LRAALSTQLM RGLKFLMDKS
NEINKPLVVN ISLSTNDGSH NGSSLLEKYI QTFTQLQKAV IVVAAGNEGN SAHHVGGKMK
KEEDLDLNIG DGEKGIILDF FKPVLVDVSV EVISPTGIST GPIELSESYK ERFVGREKIV
VYSTGPKPFD IQGQTTISIL PLGDTITSGG WRIIVRKLNN YEGYFDIWLP IAEGLNERTR
FLQPSVYNTL GIPATVEGVI SVGSYNFLNN NLSAFSGRGV VRPEWLIKPD LVAPGENILS
TVEEQGFDTK SGTSMAAPQV SGICALLFEW GIIRNNDPFL YGERIKYYLI KGAKRTIFGE
AYPNPDLGYG FVCLDRTMEL LINRR