Gene CPF_0298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0298 
Symbol 
ID4203821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp353027 
End bp354313 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content32% 
IMG OID638081185 
Productcarboxyl-terminal protease 
Protein accessionYP_694758 
Protein GI110800692 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.201307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGCA ATAAGGATGA AAAAAATAAA TATAATAATA TTATTAAAAA TCTTAAGAAG 
AAAAAAGGAT TCAGAGCTTT AGCCATAGGT GTTGCCTTAA CTTTAATAAT TGGTGTTTCT
GTATATGCAG GGAATAGATT AACTGCCTTT GGTATTTTAC CTATAACTAG TGTAAGTGCA
GTTCAATCTT CCTTAGAAAA GGTAAATGAT ACAGAAAATT TTAAAAAGGT ATTAGAAGTA
AGGGAAATGC TTTATAGATG GTATGATGGA GATATTGATG ATAGTAAATT AGCTGAAGGT
GCTATAAAGG GTATGGTTTC ATCTTTAGGT GATCAATACA CATATTATAT GAATGAAAAA
GAATTTTCAG ATTTTAAAGA AAAAAGTCAA GGAAACTACA TGGGAATTGG GATTCAAGTA
GCTGTTAAGG ATGGTAAGAT AGTAGTAATT TCTCCTATTC AAGGAGGGCC AGCTGAAAAA
GCAGGAATAA AAACTGGAGA TATTATCTTA AAAGTAAATG GAGAACCAGT TTCAGGAAAT
GAATTGGATA AAGCCGTTTC AATGATGAAG GGTACTACAA AAGAAAATAT AAAATTAACA
TTATATAGAG AAGGTAAAGG CGAATTTGAC GTTGATGTTA TGAGAGATGT AATTAAAACA
GTTAACGTTA AAAGTGAAAT GATTGATGGA GATATTGGAT ATATAGAAGT TTTAGCTTTT
GATGAGGGAA CAGCTAAAGA CTTTGAAACT CAATTAAAAG CTTTAGAAGA GAAAGGTATG
AAGGGGTTAA TCCTTGATTT AAGAGGAAAT CCAGGAGGAT TTATGAAAGA ATGTGTAGAC
CTAGTATCAA ACTTTGTTCC AAAGGATAAG GTTATAGTGT CTACAATAGA TAAGTATGGT
AATAAAGAGG AAAGTGTATC TAAGGGCGGA ATTGCACAGG GAATGCCTTT AGTAGTTTTA
ATTGATGGAG GTACTGCTTC TGCCTCAGAA ATAGTTGCGG GAGCTATTAG AGATTATGAT
TTAGGAACTC TAGTTGGAAC AACATCCTTT GGTAAAGGAA TAGTTCAAGT TGTTTTAGAT
AAGATAGGTC AAGAAAAAGA TGGTACAGCT TTAAAGGTAA CTATTTCAAA ATACTATACT
CCAAATGGAG AAAATATTCA CAAAAAGGGT ATAGGACCAG ATGTTACAGT TGAATATTCA
AAAGAGCTTA AAGAAAAGAC ATATTCAAGA AGTACTGATC CTCAATTTGA AAAAGCCTTA
GAAATCATAC AAGAAAAGAT AAAATAA
 
Protein sequence
MSRNKDEKNK YNNIIKNLKK KKGFRALAIG VALTLIIGVS VYAGNRLTAF GILPITSVSA 
VQSSLEKVND TENFKKVLEV REMLYRWYDG DIDDSKLAEG AIKGMVSSLG DQYTYYMNEK
EFSDFKEKSQ GNYMGIGIQV AVKDGKIVVI SPIQGGPAEK AGIKTGDIIL KVNGEPVSGN
ELDKAVSMMK GTTKENIKLT LYREGKGEFD VDVMRDVIKT VNVKSEMIDG DIGYIEVLAF
DEGTAKDFET QLKALEEKGM KGLILDLRGN PGGFMKECVD LVSNFVPKDK VIVSTIDKYG
NKEESVSKGG IAQGMPLVVL IDGGTASASE IVAGAIRDYD LGTLVGTTSF GKGIVQVVLD
KIGQEKDGTA LKVTISKYYT PNGENIHKKG IGPDVTVEYS KELKEKTYSR STDPQFEKAL
EIIQEKIK