Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1741 |
Symbol | |
ID | 4205093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1938577 |
End bp | 1939806 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642566291 |
Product | U32 family peptidase |
Protein accession | YP_699056 |
Protein GI | 110802005 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAC CAGAAATATT AGCACCAGCA GGTAGCTTAG AAAAATTAAA GACAGCAATT GATTTTGGAG CAGATGCAGT TTATATTGGA GGAAGCAAAC TTAACCTAAG AGCTTTCGCT GATAACTTTA CAAATGAACA GATTGCAGAA GGTGTAAAGT ATGCTCATGA CAGAGGAAGA AAAGTTTATG TTACTATGAA TGTATTTCCT CACAATGCAG ATTTAGAAGG GTTAGAAGAA TACATTGTAG GACTTAATGA TTTAAATGTA GATGCAATAA TAGTTTCAGA TCCATCAATA ATAATGACTG CAAAGGAAGT TGCACCAAAT TTAGAAATAC ACTTAAGTAC TCAAGCAAAC AATGTAAACT GGAAGTCAGC TAAGTTTTGG CATAGCTTAG GAGTTAAGAG AATTGTTTTA GCTAGAGAGC TTAGTTTTAA AGAAATTGAA AAAATACACG AGAATTTACC AGAAGATTGT GATTTAGAAG CATTTGTTCA TGGTTCAATG TGTATGGCAT ACTCAGGAAG ATGTTTAATA TCAAACTACA TGACAGGAAG AGACTCAAAT AGAGGAGCTT GTTCACAAGC ATGTAGATAT AAGTATTATT TAATGGAAGA AAAGAGACCT GGAGAGTATT TCCAAGTTAT AGAGGATGAT AAAGGTACAT ACATAATGAA CTCTAAGGAT TTATGTATGA TAGAATATAT ACCAGAGCTT GTTAAGAGTG GTATATACTC ATTTAAAATA GAAGGAAGAA TGAAGAGCCC ATACTATGTT GCTGCAATTG TTAAAGCTTA CAGAGAAGCT TTAGATAAGT ATTGGGATGA TCCAGAAGGA TATGAATTTG ATCAAAAATT AATGGATAAT CTTTTAAAGG TTAGTCATAG AAGATATCAC ACAGGATTTT ACTTTGGAAA ATCAGGAGAG CAGGTTTATG AATCATCATC TTATATAAGA GATTATGATA TTGTAGGAGT TGTTAGAGAT TATAACGAAG AAACTAAGGT AGCAACTATA GAACAAAGAA ACAGATTATT CGAAGGGGAT AAAGTAGAGG TATTAACTCC AGTAGGAGAT TACTATGAAA TCCAAATGAA TGATATGAAG GATGAAAAAG ATGAAAAAAT AGATGTTGCT AACAAAGCTC AAATGATATT CAAGGTTAAA ATAGATAAGC CTGTAAAGGT AAATGATATG TTAATTAAGT GCAAGGAGGC AAACAGCTAA
|
Protein sequence | MRKPEILAPA GSLEKLKTAI DFGADAVYIG GSKLNLRAFA DNFTNEQIAE GVKYAHDRGR KVYVTMNVFP HNADLEGLEE YIVGLNDLNV DAIIVSDPSI IMTAKEVAPN LEIHLSTQAN NVNWKSAKFW HSLGVKRIVL ARELSFKEIE KIHENLPEDC DLEAFVHGSM CMAYSGRCLI SNYMTGRDSN RGACSQACRY KYYLMEEKRP GEYFQVIEDD KGTYIMNSKD LCMIEYIPEL VKSGIYSFKI EGRMKSPYYV AAIVKAYREA LDKYWDDPEG YEFDQKLMDN LLKVSHRRYH TGFYFGKSGE QVYESSSYIR DYDIVGVVRD YNEETKVATI EQRNRLFEGD KVEVLTPVGD YYEIQMNDMK DEKDEKIDVA NKAQMIFKVK IDKPVKVNDM LIKCKEANS
|
| |