Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_2024 |
Symbol | |
ID | 4203066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 2264591 |
End bp | 2265820 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638082893 |
Product | U32 family peptidase |
Protein accession | YP_696457 |
Protein GI | 110799271 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAC CAGAAATATT AGCACCAGCA GGTAGCTTAG AAAAATTAAA GACAGCAATT GATTTTGGAG CAGATGCAGT TTATATTGGA GGAAGCAAAC TTAACCTAAG AGCTTTCGCT GATAACTTTA CAAATGAACA GATTGCAGAA GGTGTAAAGT ATGCTCATGA TAGAGGAAGA AAAGTTTATG TTACTATGAA TGTATTTCCT CACAATGCAG ATTTAGAAGG ATTAGAAGAA TACATTGTAG GACTTAATGA TTTAAATGTA GATGCAATAA TAGTTTCAGA TCCATCAATA ATAATGACTG CAAAGGAAGT TGCACCAGAT TTAGAAATAC ACTTAAGTAC TCAAGCAAAC AATGTAAACT GGAAGTCAGC TAAGTTTTGG CATAGCTTAG GAGTTAAGAG AATTGTTTTA GCTAGAGAGC TTAGTTTTAA AGAAATTGAA AAAATACACG AGAATTTACC AGAAGATTGT GATTTAGAAG CATTTGTTCA TGGTTCAATG TGTATGGCAT ACTCAGGAAG ATGTTTAATA TCAAACTACA TGACAGGAAG AGACTCAAAT AGAGGAGCTT GTTCACAAGC ATGTAGATAT AAGTATTATT TAATGGAAGA AAAGAGACCT GGAGAGTATT TCCAAGTTAT AGAAGATGAT AAAGGTACAT ACATAATGAA CTCTAAGGAT TTATGTATGA TAGAATATAT ACCAGAGCTT GTTAAGAGTG GTATATACTC ATTTAAAATA GAAGGAAGAA TGAAGAGCCC ATACTATGTT GCTGCAATTG TTAAAGCTTA CAGAGAAGCT TTAGATAAGT ATTGGGATGA TCCAGAAGGA TATGAATTTG ATCAAAAATT AATGGATAAT CTTTTAAAGG TTAGTCATAG AAGATATCAC ACAGGATTTT ACTTTGGAAA ATCAGGAGAG CAAGTTTATG AATCATCATC TTATATAAGA GATTATGATA TCGTAGGAGT TGTTAGAGAT TATAACGAAG AAACTAAGGT AGCAACTATA GAACAAAGAA ACAGATTATT CGAAGGGGAT ACAGTAGAGG TATTAACTCC AGTAGGAGAT TACTATGAAA TCCAAATGAA TGATATGAAG GATGAAAAAG ACGAAAAAAT AGATGTTGCT AACAAAGCTC AAATGATATT CAAGGTTAAA ATAGATAAGC CTGTAAAGGT AAATGATATG TTAATTAAGT GCAAGGAGGC AAACAGCTAA
|
Protein sequence | MRKPEILAPA GSLEKLKTAI DFGADAVYIG GSKLNLRAFA DNFTNEQIAE GVKYAHDRGR KVYVTMNVFP HNADLEGLEE YIVGLNDLNV DAIIVSDPSI IMTAKEVAPD LEIHLSTQAN NVNWKSAKFW HSLGVKRIVL ARELSFKEIE KIHENLPEDC DLEAFVHGSM CMAYSGRCLI SNYMTGRDSN RGACSQACRY KYYLMEEKRP GEYFQVIEDD KGTYIMNSKD LCMIEYIPEL VKSGIYSFKI EGRMKSPYYV AAIVKAYREA LDKYWDDPEG YEFDQKLMDN LLKVSHRRYH TGFYFGKSGE QVYESSSYIR DYDIVGVVRD YNEETKVATI EQRNRLFEGD TVEVLTPVGD YYEIQMNDMK DEKDEKIDVA NKAQMIFKVK IDKPVKVNDM LIKCKEANS
|
| |