Gene CPR_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0631 
Symbolbcn 
ID4205789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp752746 
End bp753816 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content31% 
IMG OID642565191 
Productbacteriocin 
Protein accessionYP_697958 
Protein GI110802571 
COG category[T] Signal transduction mechanisms 
COG ID[COG3103] SH3 domain protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.420339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAT GTTTCCCATA TAACTTTTCA CCTCAATATA CTTCAAGAAA CTATACAGGA 
CCTAATCCAT TAGGAGCACC AGAGGCAGTT GCACTTAAAA ATTTAGTTGA AAGAATAAAT
TCTAGTTCTA GTGAAATGGT AGTTTTAGAT TTCCATGGAT GGATGAATTT CACTCAAGAT
AATGCAGAAT TAGGAAGATA TTTTGGAAAT CAATTTGGAT TTGGACATAA TAATGGATAT
AGCTCAGGTT TTTTCTCAAG TTGGGCAACA ACTTTAAGGA ATACAAAAGC AGTTTTAATT
GAGTATCCTA CAAATACTTA TAGCTATAAT GATGTTATAA ATAAAAATTA TATAGGAAAA
ACATTCAATG GAATAATAAA TATCATTAAG AATAATCCTA ATGGTGGAGA TGTAGATAAT
GGTGGAAATT CAGGAGGAAG TTCATCTTCG GATGTGAGAT ATATAGCAGC TGGAGAAGTA
ATAAATGTAC AATCATTCCT AAATGTTAGA AAAGGACCAG GGACAAATTA TGATTCTATA
GGACAACTTC ATCAAGGCGA AAAAGTTAGT ATAGTAGCTA CAAATAAAGA GTGGAATAAG
ATAGAATATG GTACTGGGTA TGGATACGTT CATAAAGATT TTGTAAATAT ATTATATAGA
GATATAAATG AGGAACTGAG AGGTTTAATG GTTAGGTATG AGTATATGTA TGGACCACAA
TGGAATGGAA TAACTTCCGG AGTTGCTAAT TTAGCTAAAT TTTATAATTT GGTTAGGAAT
GGTTCTATAG TTGATTTGAA AAATCAAGGT TGGGATGAAA ATCAATATTA TTTTAATGGT
AAAATTTACA GAAAAGATGC TCCAGGAAAT ATTCTTTATG GATATTTAGG AAAGGTTTTT
GGTTTTACAG ATGAATTATT ATTGAGAGCT GCAGGATTTG CACAAAAAGA AGCTGGAACA
AGCAAACCAG AATTTGGAGA TCCTTTTGGA AATCCACCAT ATGGAGATGA TCCTTATGAT
CAAGAATGTA TAAAAGACGG TATTGATTAT TTTAATAAAT ACAGAAAATA G
 
Protein sequence
MNRCFPYNFS PQYTSRNYTG PNPLGAPEAV ALKNLVERIN SSSSEMVVLD FHGWMNFTQD 
NAELGRYFGN QFGFGHNNGY SSGFFSSWAT TLRNTKAVLI EYPTNTYSYN DVINKNYIGK
TFNGIINIIK NNPNGGDVDN GGNSGGSSSS DVRYIAAGEV INVQSFLNVR KGPGTNYDSI
GQLHQGEKVS IVATNKEWNK IEYGTGYGYV HKDFVNILYR DINEELRGLM VRYEYMYGPQ
WNGITSGVAN LAKFYNLVRN GSIVDLKNQG WDENQYYFNG KIYRKDAPGN ILYGYLGKVF
GFTDELLLRA AGFAQKEAGT SKPEFGDPFG NPPYGDDPYD QECIKDGIDY FNKYRK