Gene CPR_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2472 
Symbol 
ID4205012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2691376 
End bp2693181 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content34% 
IMG OID642567022 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_699726 
Protein GI110803261 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TACAAAGTAT GATTACTTAT ATAGCTATTT TTGCTGTAGT GCTTATGTTT 
GCATTTGCTT TTTATAGAAA TGGAACTCAA GGAAAAGTAA TATCATATAC TGAATTTAAA
GAAGCATATG TAGGAAATAA AATTGAAACA ATGACCATAA AAGAAGATAA AATGTCAGTT
GATGGGGTTT TTAAAGATGG TAAGAGGTTT ACTTCATATG TTTCAAATAA TATGTTAGAC
AATCTTTTGC AAGAGACTAA AGGGGTAGAA ACTGTAATAA AGTATACTCC GCCTAATAAT
ATGGGTATTT GGATTAGTTT CCTTCCAACA ATACTTATAA TAGGTGTGAT ATTCTTTGGT
TTATTTATGT TCACACAGCA AGCTCAAAAC AGTGGTGGAA ATAGAGGGGT AATGAATTTC
GGTAAAAGTA AAGCTAAGAT GGCTAATTTA GACGGAAAAA AAGTTACGTT CAAAGATGTC
GCTGGAGCTG ATGAAGAAAA AGGTGAATTA GAAGAAATTG TTGATTTCTT AAAACAACCT
AAGAGATATA TAGAAATGGG AGCAAGAATA CCTAAAGGAG TTCTTTTAGT AGGGCCTCCA
GGAACAGGTA AGACACTTCT TGCAAAAGCT ATAGCAGGAG AAGCTGGTGT TCCTTTCTTT
AGTATATCAG GTTCAGATTT CGTTGAAATG TTTGTTGGAG TTGGTGCTTC AAGAGTAAGA
GATTTATTTG AGCAAGCTAA GAAAAATGCT CCATGTATTA TATTCATAGA CGAAATCGAT
GCTGTTGGTA GACAAAGAGG TGCTGGACTT GGTGGCGGTC ATGATGAGAG AGAACAAACA
TTAAACCAAT TACTAGTTGA AATGGACGGT TTTGGAGTTA ATGAAGGAAT AATTATGATA
GCTGCTACAA ATAGACCAGA TATCTTAGAT CCAGCATTAC TAAGACCAGG AAGATTTGAT
AGAAGAATTT TAGTAGGTGC ACCTGATGTT AAGGGTAGAG AAGAAGTACT AAAAGTTCAT
ACAAGAAATA AACATCTATC AGAAGATGTA GATTTAAAAG TTTTAGCTAA AATGACACCA
GGATTTAGTG GTGCAGATCT TGAAAACTTA ACTAACGAAG CTGCATTATT AGCGGTTAGA
GGTGGTAAAA GTAGCATAGA TATGTCAGAC ATTGAAGAGG CTATAACAAG AGTGATAGCT
GGGCCAGAGA AGAAGAGTAG AGTTGTTAGT GAATATGATA GAAGAATCAC TGCAGTTCAC
GAATCTGGAC ACGCTGTTGT AAGTAATGTT CTAGAGTATG CAGATCCAGT TCATGAAATA
AGTATAATTC AAAGAGGAAT GGCTGCAGGA TACACAATGA ATTTACCAGA GGAAGATAGA
ACTCACACAT CTAAGAAACA ACTTAAAGAT AAGATGGTTG AACTTTTAGG TGGAAGAGTG
GCTGAGAAAT TAGTTATTGG AGATATAAGT GCCGGTGCTA AAAACGATAT AGATAGAGCT
AGTCACATTG CTAGAAGTAT GGTTATGGAA TATGGAATGA GTGATGTTAT TGGACCTATA
TCATTTGGTA ATAGCGATGG TGGCGAAGTA TTCTTAGGTA GAGACATTGG AAAGAGTAGT
AACATAAGTG AAGAAACTAG CGCTAAAATA GATGAAGAAA TCAAGAAATT AATTGATGAA
GCTTATAATA GAGCAGAATC TATATTAAGA GAAAATATAA GTAAATTAAA TGCAGTAACT
GATGTGTTAC TTCAAAAAGA AAAAATTGAT GGTGATGAGT TTAGAGAAAT ATTTAAAAAC
TCATAG
 
Protein sequence
MKKLQSMITY IAIFAVVLMF AFAFYRNGTQ GKVISYTEFK EAYVGNKIET MTIKEDKMSV 
DGVFKDGKRF TSYVSNNMLD NLLQETKGVE TVIKYTPPNN MGIWISFLPT ILIIGVIFFG
LFMFTQQAQN SGGNRGVMNF GKSKAKMANL DGKKVTFKDV AGADEEKGEL EEIVDFLKQP
KRYIEMGARI PKGVLLVGPP GTGKTLLAKA IAGEAGVPFF SISGSDFVEM FVGVGASRVR
DLFEQAKKNA PCIIFIDEID AVGRQRGAGL GGGHDEREQT LNQLLVEMDG FGVNEGIIMI
AATNRPDILD PALLRPGRFD RRILVGAPDV KGREEVLKVH TRNKHLSEDV DLKVLAKMTP
GFSGADLENL TNEAALLAVR GGKSSIDMSD IEEAITRVIA GPEKKSRVVS EYDRRITAVH
ESGHAVVSNV LEYADPVHEI SIIQRGMAAG YTMNLPEEDR THTSKKQLKD KMVELLGGRV
AEKLVIGDIS AGAKNDIDRA SHIARSMVME YGMSDVIGPI SFGNSDGGEV FLGRDIGKSS
NISEETSAKI DEEIKKLIDE AYNRAESILR ENISKLNAVT DVLLQKEKID GDEFREIFKN
S