Gene CPF_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2024 
Symbol 
ID4203066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2264591 
End bp2265820 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content32% 
IMG OID638082893 
ProductU32 family peptidase 
Protein accessionYP_696457 
Protein GI110799271 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAC CAGAAATATT AGCACCAGCA GGTAGCTTAG AAAAATTAAA GACAGCAATT 
GATTTTGGAG CAGATGCAGT TTATATTGGA GGAAGCAAAC TTAACCTAAG AGCTTTCGCT
GATAACTTTA CAAATGAACA GATTGCAGAA GGTGTAAAGT ATGCTCATGA TAGAGGAAGA
AAAGTTTATG TTACTATGAA TGTATTTCCT CACAATGCAG ATTTAGAAGG ATTAGAAGAA
TACATTGTAG GACTTAATGA TTTAAATGTA GATGCAATAA TAGTTTCAGA TCCATCAATA
ATAATGACTG CAAAGGAAGT TGCACCAGAT TTAGAAATAC ACTTAAGTAC TCAAGCAAAC
AATGTAAACT GGAAGTCAGC TAAGTTTTGG CATAGCTTAG GAGTTAAGAG AATTGTTTTA
GCTAGAGAGC TTAGTTTTAA AGAAATTGAA AAAATACACG AGAATTTACC AGAAGATTGT
GATTTAGAAG CATTTGTTCA TGGTTCAATG TGTATGGCAT ACTCAGGAAG ATGTTTAATA
TCAAACTACA TGACAGGAAG AGACTCAAAT AGAGGAGCTT GTTCACAAGC ATGTAGATAT
AAGTATTATT TAATGGAAGA AAAGAGACCT GGAGAGTATT TCCAAGTTAT AGAAGATGAT
AAAGGTACAT ACATAATGAA CTCTAAGGAT TTATGTATGA TAGAATATAT ACCAGAGCTT
GTTAAGAGTG GTATATACTC ATTTAAAATA GAAGGAAGAA TGAAGAGCCC ATACTATGTT
GCTGCAATTG TTAAAGCTTA CAGAGAAGCT TTAGATAAGT ATTGGGATGA TCCAGAAGGA
TATGAATTTG ATCAAAAATT AATGGATAAT CTTTTAAAGG TTAGTCATAG AAGATATCAC
ACAGGATTTT ACTTTGGAAA ATCAGGAGAG CAAGTTTATG AATCATCATC TTATATAAGA
GATTATGATA TCGTAGGAGT TGTTAGAGAT TATAACGAAG AAACTAAGGT AGCAACTATA
GAACAAAGAA ACAGATTATT CGAAGGGGAT ACAGTAGAGG TATTAACTCC AGTAGGAGAT
TACTATGAAA TCCAAATGAA TGATATGAAG GATGAAAAAG ACGAAAAAAT AGATGTTGCT
AACAAAGCTC AAATGATATT CAAGGTTAAA ATAGATAAGC CTGTAAAGGT AAATGATATG
TTAATTAAGT GCAAGGAGGC AAACAGCTAA
 
Protein sequence
MRKPEILAPA GSLEKLKTAI DFGADAVYIG GSKLNLRAFA DNFTNEQIAE GVKYAHDRGR 
KVYVTMNVFP HNADLEGLEE YIVGLNDLNV DAIIVSDPSI IMTAKEVAPD LEIHLSTQAN
NVNWKSAKFW HSLGVKRIVL ARELSFKEIE KIHENLPEDC DLEAFVHGSM CMAYSGRCLI
SNYMTGRDSN RGACSQACRY KYYLMEEKRP GEYFQVIEDD KGTYIMNSKD LCMIEYIPEL
VKSGIYSFKI EGRMKSPYYV AAIVKAYREA LDKYWDDPEG YEFDQKLMDN LLKVSHRRYH
TGFYFGKSGE QVYESSSYIR DYDIVGVVRD YNEETKVATI EQRNRLFEGD TVEVLTPVGD
YYEIQMNDMK DEKDEKIDVA NKAQMIFKVK IDKPVKVNDM LIKCKEANS