Gene CPR_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2071 
Symbol 
ID4205081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2294303 
End bp2295964 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content31% 
IMG OID642566621 
Productsubtilisin like protease 
Protein accessionYP_699380 
Protein GI110803117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACAAC TAAGAAAAGA TGTTCCATCA ATTCGTTATA TACAGACAAG AGTGCCTTAT 
GTTTTACAGG AAGTATCTCC ACAAGAAACT GATAATATAT CCACAATTGT AAATAATCCA
TATTTAGATC TTGATGGAAA TGGAGTTATA GTTGGAATAG TTGATACCGG AATAGATTAT
CTTAATAAGG AATTTATGAG AGAAGATGAT ACCTCTAGAA TACTTACTAT TTGGGATCAA
AAGAGTACTA AAAAACCTAA TGATTCAGTT TATGTAGGAT CTATATTTAA TAATGAAGAC
ATAAATAATG CTATTAAAAG TAAGGCAGAT GGGAGAGATC CATATGATAT TGTTGATAGT
AGAGATGAAA TATGGCATGG AACTAAATTA GCTAGCATAA TAGGAGCACG TGGATATAAT
AGAAAAATTA AAGGTATTGC ACCTAATTGT GATTTTGCTA TTGTAAAACT ATTAAATTCA
TTTAGTTATG AAAAAGCTTT TAGAGAAAAT GGAATAGAAA ATGTTCCAGT TTATGATGAA
GTTGAAATTG TAGCAGGAGT AGAATATCTC AAAAATTATG CCCTCAGTCT TAAGAGACCA
TTAGTAATTT GTTTAGCGAT AGGGTGTACT GAAGCAAGTC ATGATGGAAG AGGACTTTTT
CCTAGGTATT TAACTACAGT TGCTTCAATA AGGGGAATTG CTATAGTTGC TGGAGTAGGA
AATGAAGGAA GTGCACAAGG GCATGCCTCT GGAGTTATAG AACTTGAAAA TAGTGTTGAA
AAAATTGAAT TATCAATACC TAGAGAGATA AAAAATTTTA ACTTAGCTTT CTGGATTCAA
AGACCTAATA TAATGTCTTT AAATATAAAA TCTCCTAGTG GAGAAGAATC TTCATTCATA
GATGCTAAAA TTTTTCTAGA GAGATCATTT AAATTTATAT TAACTGATAC AAGTGTTAAT
ATTAACTATT ATGTTCCTGA TACCTTTACA GGAAATGAGC TTATTTATGT AGAGTTTAAA
GATATTAAGC CTGGAATATG GACTTTTGAG TTAAGAGGAG ATTATATAAC TAATGGAAGA
TATGATGTAT GGTTAGCTCC AAGTTCTTTA TTGCCTGAAA ATACAAAATT TCTTAGCCCT
AATCCATTAA ATACTCTTAT GGATCCATCT ACTGCTAAAT ATATAATAAC CGTAGCCTAT
TATAATAGTC AAACTCAATC TTTATTAGCG GAATCTGGAA AAGGTTTTAA TGTTAATGGA
TGGATAAATC CAGATATAAC CACTGCAGGT AAAGATATTT TGACCATATT TCCTGGAGAT
AGAGTTGGAA GAATGTCAGG GAGTTCTCCA GCAACAGCAA TAACTGTAGG AGTATGTGCC
TTATTATTTC AATGGGGAAT TATAAATAGA AATGATAGAA CAATGAATTC TAGTAAATTA
AGAAGCTATT TAATATATGG GGCTACTAGA ATTCCTGGGC AAACATATCC TAATGAATAT
ACTGGATATG GGTATTTAGA TTTATATGAA ATATTTAGAA ATATAATAGG AGTTCCTTTA
CCACCTTATA GAAGTACTAA GATTAATGAA GATTATACTG AGTATTATTG TGGAAGGATG
TTAGTTAGCG TGCCTTCTGA TTTTTATTAT GGAGGGAAAT AA
 
Protein sequence
MKQLRKDVPS IRYIQTRVPY VLQEVSPQET DNISTIVNNP YLDLDGNGVI VGIVDTGIDY 
LNKEFMREDD TSRILTIWDQ KSTKKPNDSV YVGSIFNNED INNAIKSKAD GRDPYDIVDS
RDEIWHGTKL ASIIGARGYN RKIKGIAPNC DFAIVKLLNS FSYEKAFREN GIENVPVYDE
VEIVAGVEYL KNYALSLKRP LVICLAIGCT EASHDGRGLF PRYLTTVASI RGIAIVAGVG
NEGSAQGHAS GVIELENSVE KIELSIPREI KNFNLAFWIQ RPNIMSLNIK SPSGEESSFI
DAKIFLERSF KFILTDTSVN INYYVPDTFT GNELIYVEFK DIKPGIWTFE LRGDYITNGR
YDVWLAPSSL LPENTKFLSP NPLNTLMDPS TAKYIITVAY YNSQTQSLLA ESGKGFNVNG
WINPDITTAG KDILTIFPGD RVGRMSGSSP ATAITVGVCA LLFQWGIINR NDRTMNSSKL
RSYLIYGATR IPGQTYPNEY TGYGYLDLYE IFRNIIGVPL PPYRSTKINE DYTEYYCGRM
LVSVPSDFYY GGK