Gene CPR_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2202 
Symbolgcp 
ID4204126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2431480 
End bp2432499 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content34% 
IMG OID642566752 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_699502 
Protein GI110801551 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AAATTATATT AGCAATAGAA AGTAGTTGTG ACGAAACAGC GGCAGCTGTA 
GTAGTCAATG GTAGAGAAGT TTTATCAAAT ATAATATCTT CTCAGATAGA TATACATACA
AAATTTGGAG GAGTAGTTCC AGAGGTTGCA TCAAGAAAAC ACATAGAAGC TATAAATGCA
GTGGTAGAGG AAGCCTTAGA AGTTGCTGGA GTAACATTTG ATGACATAGA TGCAATAGCA
GTTACATATG GTCCAGGTTT AGTTGGAGCA CTTTTAGTAG GACTTCAATA TGCTAAAGGA
TTAGCATACT CTTTAGATAA ACCATTAATA GGAGTTAATC ATATAGAAGG GCATATAAGT
GCTAACTTTA TAGATCATAA GGACTTAGAG CCACCTTTTG TTTGCTTAGT TGTTTCAGGA
GGACATACTT TTGTAGTCCA TGTTGAAGAC TATGGAAAGT TTGAAATAAT AGGCGAAACA
AGAGATGATG CAGCAGGAGA AGCTTTTGAT AAGGTAGCAA GAGCCGTAGG ATTAGGATAT
CCAGGAGGTC CTAAAATAGA TAAATTAGCT AAGGAAGGAA ATAGTGATGC TATAAAATTC
CCAAAAGCTA ATTTCCATGA TGATAACTTA GATTTTTCAT TTAGTGGAGT TAAATCAGCT
GTCTTAAATT ATCTAAATAA GATGGAAATG AAAAATGAAG AAATAAATAA AGCTGATGTT
GTAGCTAGTT TCCAAAAGGC CGTAGTTGAA GTGTTAACTG ATAATGCAAT AAAAACTTGT
AAAATGAGAA AGGCAGATAA AATAGCCATT GCAGGTGGAG TTGCTTCTAA TAGTGCTTTA
AGAGAAAACC TTCTTAGAGA AGGAGAAAAG AGAGGAATAA AGGTTTTATT CCCATCACCA
ATACTTTGTA CAGATAATGC TGCCATGATA GGAAGTGCTG CATATTTTGA ATTATTAAAG
GGAAATATAT CTAAAATGAG TCTTAACGCA AAACCTAATT TAAGATTAGG AGAAAGATAG
 
Protein sequence
MNKKIILAIE SSCDETAAAV VVNGREVLSN IISSQIDIHT KFGGVVPEVA SRKHIEAINA 
VVEEALEVAG VTFDDIDAIA VTYGPGLVGA LLVGLQYAKG LAYSLDKPLI GVNHIEGHIS
ANFIDHKDLE PPFVCLVVSG GHTFVVHVED YGKFEIIGET RDDAAGEAFD KVARAVGLGY
PGGPKIDKLA KEGNSDAIKF PKANFHDDNL DFSFSGVKSA VLNYLNKMEM KNEEINKADV
VASFQKAVVE VLTDNAIKTC KMRKADKIAI AGGVASNSAL RENLLREGEK RGIKVLFPSP
ILCTDNAAMI GSAAYFELLK GNISKMSLNA KPNLRLGER