Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2202 |
Symbol | gcp |
ID | 4204126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2431480 |
End bp | 2432499 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642566752 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_699502 |
Protein GI | 110801551 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA AAATTATATT AGCAATAGAA AGTAGTTGTG ACGAAACAGC GGCAGCTGTA GTAGTCAATG GTAGAGAAGT TTTATCAAAT ATAATATCTT CTCAGATAGA TATACATACA AAATTTGGAG GAGTAGTTCC AGAGGTTGCA TCAAGAAAAC ACATAGAAGC TATAAATGCA GTGGTAGAGG AAGCCTTAGA AGTTGCTGGA GTAACATTTG ATGACATAGA TGCAATAGCA GTTACATATG GTCCAGGTTT AGTTGGAGCA CTTTTAGTAG GACTTCAATA TGCTAAAGGA TTAGCATACT CTTTAGATAA ACCATTAATA GGAGTTAATC ATATAGAAGG GCATATAAGT GCTAACTTTA TAGATCATAA GGACTTAGAG CCACCTTTTG TTTGCTTAGT TGTTTCAGGA GGACATACTT TTGTAGTCCA TGTTGAAGAC TATGGAAAGT TTGAAATAAT AGGCGAAACA AGAGATGATG CAGCAGGAGA AGCTTTTGAT AAGGTAGCAA GAGCCGTAGG ATTAGGATAT CCAGGAGGTC CTAAAATAGA TAAATTAGCT AAGGAAGGAA ATAGTGATGC TATAAAATTC CCAAAAGCTA ATTTCCATGA TGATAACTTA GATTTTTCAT TTAGTGGAGT TAAATCAGCT GTCTTAAATT ATCTAAATAA GATGGAAATG AAAAATGAAG AAATAAATAA AGCTGATGTT GTAGCTAGTT TCCAAAAGGC CGTAGTTGAA GTGTTAACTG ATAATGCAAT AAAAACTTGT AAAATGAGAA AGGCAGATAA AATAGCCATT GCAGGTGGAG TTGCTTCTAA TAGTGCTTTA AGAGAAAACC TTCTTAGAGA AGGAGAAAAG AGAGGAATAA AGGTTTTATT CCCATCACCA ATACTTTGTA CAGATAATGC TGCCATGATA GGAAGTGCTG CATATTTTGA ATTATTAAAG GGAAATATAT CTAAAATGAG TCTTAACGCA AAACCTAATT TAAGATTAGG AGAAAGATAG
|
Protein sequence | MNKKIILAIE SSCDETAAAV VVNGREVLSN IISSQIDIHT KFGGVVPEVA SRKHIEAINA VVEEALEVAG VTFDDIDAIA VTYGPGLVGA LLVGLQYAKG LAYSLDKPLI GVNHIEGHIS ANFIDHKDLE PPFVCLVVSG GHTFVVHVED YGKFEIIGET RDDAAGEAFD KVARAVGLGY PGGPKIDKLA KEGNSDAIKF PKANFHDDNL DFSFSGVKSA VLNYLNKMEM KNEEINKADV VASFQKAVVE VLTDNAIKTC KMRKADKIAI AGGVASNSAL RENLLREGEK RGIKVLFPSP ILCTDNAAMI GSAAYFELLK GNISKMSLNA KPNLRLGER
|
| |