Gene CPR_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1741 
Symbol 
ID4205093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1938577 
End bp1939806 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content32% 
IMG OID642566291 
ProductU32 family peptidase 
Protein accessionYP_699056 
Protein GI110802005 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAC CAGAAATATT AGCACCAGCA GGTAGCTTAG AAAAATTAAA GACAGCAATT 
GATTTTGGAG CAGATGCAGT TTATATTGGA GGAAGCAAAC TTAACCTAAG AGCTTTCGCT
GATAACTTTA CAAATGAACA GATTGCAGAA GGTGTAAAGT ATGCTCATGA CAGAGGAAGA
AAAGTTTATG TTACTATGAA TGTATTTCCT CACAATGCAG ATTTAGAAGG GTTAGAAGAA
TACATTGTAG GACTTAATGA TTTAAATGTA GATGCAATAA TAGTTTCAGA TCCATCAATA
ATAATGACTG CAAAGGAAGT TGCACCAAAT TTAGAAATAC ACTTAAGTAC TCAAGCAAAC
AATGTAAACT GGAAGTCAGC TAAGTTTTGG CATAGCTTAG GAGTTAAGAG AATTGTTTTA
GCTAGAGAGC TTAGTTTTAA AGAAATTGAA AAAATACACG AGAATTTACC AGAAGATTGT
GATTTAGAAG CATTTGTTCA TGGTTCAATG TGTATGGCAT ACTCAGGAAG ATGTTTAATA
TCAAACTACA TGACAGGAAG AGACTCAAAT AGAGGAGCTT GTTCACAAGC ATGTAGATAT
AAGTATTATT TAATGGAAGA AAAGAGACCT GGAGAGTATT TCCAAGTTAT AGAGGATGAT
AAAGGTACAT ACATAATGAA CTCTAAGGAT TTATGTATGA TAGAATATAT ACCAGAGCTT
GTTAAGAGTG GTATATACTC ATTTAAAATA GAAGGAAGAA TGAAGAGCCC ATACTATGTT
GCTGCAATTG TTAAAGCTTA CAGAGAAGCT TTAGATAAGT ATTGGGATGA TCCAGAAGGA
TATGAATTTG ATCAAAAATT AATGGATAAT CTTTTAAAGG TTAGTCATAG AAGATATCAC
ACAGGATTTT ACTTTGGAAA ATCAGGAGAG CAGGTTTATG AATCATCATC TTATATAAGA
GATTATGATA TTGTAGGAGT TGTTAGAGAT TATAACGAAG AAACTAAGGT AGCAACTATA
GAACAAAGAA ACAGATTATT CGAAGGGGAT AAAGTAGAGG TATTAACTCC AGTAGGAGAT
TACTATGAAA TCCAAATGAA TGATATGAAG GATGAAAAAG ATGAAAAAAT AGATGTTGCT
AACAAAGCTC AAATGATATT CAAGGTTAAA ATAGATAAGC CTGTAAAGGT AAATGATATG
TTAATTAAGT GCAAGGAGGC AAACAGCTAA
 
Protein sequence
MRKPEILAPA GSLEKLKTAI DFGADAVYIG GSKLNLRAFA DNFTNEQIAE GVKYAHDRGR 
KVYVTMNVFP HNADLEGLEE YIVGLNDLNV DAIIVSDPSI IMTAKEVAPN LEIHLSTQAN
NVNWKSAKFW HSLGVKRIVL ARELSFKEIE KIHENLPEDC DLEAFVHGSM CMAYSGRCLI
SNYMTGRDSN RGACSQACRY KYYLMEEKRP GEYFQVIEDD KGTYIMNSKD LCMIEYIPEL
VKSGIYSFKI EGRMKSPYYV AAIVKAYREA LDKYWDDPEG YEFDQKLMDN LLKVSHRRYH
TGFYFGKSGE QVYESSSYIR DYDIVGVVRD YNEETKVATI EQRNRLFEGD KVEVLTPVGD
YYEIQMNDMK DEKDEKIDVA NKAQMIFKVK IDKPVKVNDM LIKCKEANS