Gene CPR_1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1395 
Symbol 
ID4206182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1566617 
End bp1569538 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content28% 
IMG OID642565949 
Productpeptidase, putative 
Protein accessionYP_698714 
Protein GI110803737 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.131852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTA AAGAGAATAA TATTTATAGT GGATTTAAAC TTTTAAAAAT AGAAAATTTA 
AATGAAATAG GTGGATTAGG TTTAAGGTTT GAACATGAAA AAACTAAGGC TAAACTTATA
AAAATCTTAA GTGAAGATGA TAATAAGTGC TTTGCAATAG GTTTTAGAAC ACCACCTGAA
AATAGTACAG GAGTTCCTCA TATTTTAGAG CATTCAGTTT TATGTGGTTC TAGAAAATTT
AATACTAAGG AACCCTTTGT AGAGCTTTTA AAAGGGTCTT TAAATACATT CTTAAATGCT
ATGACATACC CAGATAAAAC AATATATCCA GTAGCATCAA GAAATGAAAA AGACTTTATG
AATCTTATGG ATGTTTACTT AGATGCTGTA TTATATCCAA ATATATATAA GCATAAGGAA
ATATTCATGC AAGAGGGATG GCATTATTAT ATAGAAAATA AGGAAGATGA ATTAAAGTAT
AATGGTGTTG TTTATAATGA GATGAAAGGG GCATACTCAT CTCCAGATTC TATACTTTAT
AGAAAGATTC CTCAAACAAT ATACCCAGAT ACTTGTTATG CCTTATCTTC AGGAGGAGAT
CCTGATGAAA TACCAAATTT AACTTATGAA GAGTTTGTAG AATTTCATAA GAAATATTAT
CATCCATCAA ACTCATATAT TTTCTTATAT GGTAATGGAG ATACTGAAAA AGAATTAGAA
TTTATAAATG AAGAATATTT AAAGAATTTT GAATATAAAG AGATAGATTC AGAAATAAAA
GAACAAAAAT CCTTTGAAAG TATGAAAGAA GAAAGTTTTA CTTATGGAAT AGCTGAAAGT
GAAGATTTAA ATCATAAAAG TTATTATAGT TTAAACTTTG TAATTGGAGA TGCCACAGAT
GGAGAAAAAG GCTTAGCTTT TGATGTTTTA GCATATCTTC TAACAAGAAG TACAGCAGCC
CCATTAAAGA AAGCATTAAT AGATGCAGGT ATAGGGAAAG CTGTATCAGG AGACTTTGAT
AACTCAACTA AACAATCAGC CTTTACTGTT TTAGTTAAGA ATGCAGAGCT AAACAAAGAA
GAAGAATTTA AAAAAGTAGT AATGGATACT TTAAAGGATT TGGTTGAAAA TGGAATAGAT
AAAGAACTTA TAGAAGCTTC CATAAATAGA GTTGAATTTG AATTAAGAGA AGGGGATTAT
GGTTCTTATC CTAATGGGTT AATTTATTAT TTAAAAGTTA TGGATAGTTG GCTTTATGAT
GGGGATCCAT ATGTTCATTT AGAATATGAA AAAAATCTTG AAAAAATAAA ATCTGCTTTA
ACAAGCAATT ACTTTGAAGA TTTAATTGAA AGATATATGA TAAATAATAC TCACTCTTCA
CTTGTTTCTC TTCATCCTGA AAAAGGAATA AATGAGAAAA AGTCAGCTGA ATTAAAGAAA
AAGTTAGAAG AGATTAAAAA TAGTTTTGAT GAAAAGACTT TAAATGAAAT AATTGATAAT
TGTAAAAAGT TAAAAGAAAG ACAAAGTACA CCTGATAAAA AAGAAGATTT AGAAAGCATT
CCTATGTTAT CTTTAGAGGA TATAGATAAA GAAGCAACTA AAATTCCTAC AGAAGAGAAA
GAGATAGATG GAATTACAAC ATTACACCAT GATTTCCATA CTAATAAAAT AGACTATGTT
AATTTCTTCT TTAATACAAA TAGTGTTCCT CAAGATTTAA TACCTTATGT TGGATTGCTA
TGTGATATAT TAGGTAAGTG TGGAACAGAA AATTATGATT ATTCTAAGTT ATCAAATGCC
ATAAATATAA GCACAGGTGG AATAAGCTTT GGGGCTATAA CTTTTGCTAA TTTAAAGAAA
AATAATGAGT TTAGACCATA TTTGGAAATT TCATATAAAG CATTAAGCAG TAAGACTAAT
AAAGCTATAG AATTAGTTTC TGAAATTGTA AATCACACTG ACCTAGATGA TATGGACAGA
ATTATGCAAA TAATTAGAGA GAAGAGAGCT AGATTAGAAG GTGCTATATT TGATAGTGGT
CATAGAATAG CTATGAAAAA AGTTTTATCA TACTCTACAA ATAGAGGAGC TTATGATGAA
AAAATAAGTG GATTAGATTA TTATGATTTT CTAGTAAATA TAGAGAAGGA AAATAAAAAA
TCAATAATAT CAGATAGCTT AAAAAAGGTA AGAGACTTAA TCTTTAATAA GGGAAATATG
CTTATAAGTT ATTCAGGAAA AGAAGAGGAA TATGAAAACT TTAAGGAAAA AGTAAAATAT
TTAATAAGCA AAACAAGTAA TAATGATTTT GAAAAAGAAG AGTATAATTT TGAGTTAGGA
AAGAAAAATG AAGGACTTTT AACTCAAGGA AATGTACAAT ATGTAGCTAA GGGTGGAAAT
TATAAAACTC ATGGATATAA GTATTCTGGG GCCCTATCTT TATTAGAAAG TATTCTAGGC
TTTGACTACT TATGGAATGC CGTAAGGGTT AAAGGTGGAG CTTATGGAGT GTTCTCTAAC
TTTAGAAGAG ATGGCGGAGC ATATATAGTT TCATATAGAG ACCCTAATAT AAAAAGCACT
TTAGAAGCTT ATGATAATAT ACCTAAGTAT TTAAATGATT TTGAAGCTGA CGAAAGAGAA
ATGACTAAAT ACATCATAGG TACAATAAGA AAATATGATC AACCTATAAG CAATGGAATA
AAAGGAGATA TAGCAGTTTC ATATTACTTG AGTAACTTTA CTTATGAAGA TCTTCAAAAG
GAAAGAGAAG AAATCATAAA TGCAGATGTA GAAAAAATTA AGAGTTTTGC ACCTATGATT
AAAGATTTAA TGAAGGAAGA TTACATCTGT GTACTAGGTA ATGAAGAAAA GATAAAAGAA
AATAAAGAGT TATTTAATAA TATTAAAAGT GTAATTAAAT AG
 
Protein sequence
MNFKENNIYS GFKLLKIENL NEIGGLGLRF EHEKTKAKLI KILSEDDNKC FAIGFRTPPE 
NSTGVPHILE HSVLCGSRKF NTKEPFVELL KGSLNTFLNA MTYPDKTIYP VASRNEKDFM
NLMDVYLDAV LYPNIYKHKE IFMQEGWHYY IENKEDELKY NGVVYNEMKG AYSSPDSILY
RKIPQTIYPD TCYALSSGGD PDEIPNLTYE EFVEFHKKYY HPSNSYIFLY GNGDTEKELE
FINEEYLKNF EYKEIDSEIK EQKSFESMKE ESFTYGIAES EDLNHKSYYS LNFVIGDATD
GEKGLAFDVL AYLLTRSTAA PLKKALIDAG IGKAVSGDFD NSTKQSAFTV LVKNAELNKE
EEFKKVVMDT LKDLVENGID KELIEASINR VEFELREGDY GSYPNGLIYY LKVMDSWLYD
GDPYVHLEYE KNLEKIKSAL TSNYFEDLIE RYMINNTHSS LVSLHPEKGI NEKKSAELKK
KLEEIKNSFD EKTLNEIIDN CKKLKERQST PDKKEDLESI PMLSLEDIDK EATKIPTEEK
EIDGITTLHH DFHTNKIDYV NFFFNTNSVP QDLIPYVGLL CDILGKCGTE NYDYSKLSNA
INISTGGISF GAITFANLKK NNEFRPYLEI SYKALSSKTN KAIELVSEIV NHTDLDDMDR
IMQIIREKRA RLEGAIFDSG HRIAMKKVLS YSTNRGAYDE KISGLDYYDF LVNIEKENKK
SIISDSLKKV RDLIFNKGNM LISYSGKEEE YENFKEKVKY LISKTSNNDF EKEEYNFELG
KKNEGLLTQG NVQYVAKGGN YKTHGYKYSG ALSLLESILG FDYLWNAVRV KGGAYGVFSN
FRRDGGAYIV SYRDPNIKST LEAYDNIPKY LNDFEADERE MTKYIIGTIR KYDQPISNGI
KGDIAVSYYL SNFTYEDLQK EREEIINADV EKIKSFAPMI KDLMKEDYIC VLGNEEKIKE
NKELFNNIKS VIK