Gene CPF_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0866 
Symbol 
ID4202019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1029967 
End bp1031742 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content31% 
IMG OID638081749 
ProductM24 family metallopeptidase 
Protein accessionYP_695316 
Protein GI110800352 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.23062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTGA CTGAAAGATT AGAAAAATTA AGAAAAATTA TGAAGGATAA AGGAATTGAC 
TATTATATAA TTCCTAGTGA GGATGCTCAT CAAAGTGAAT ATGTATGCGA ACATTATAGG
GGGAGAGCAT ATATGTCAGG TTTTACAGGG TCAGCAGGAA CTTTACTTGT TGGTTTAGAA
AATGATATTT TGTGGACTGA CGGTAGATAT TTCATACAAG CTTTAGAGGA GTTAAAGGGT
TCTGGAATTG AAATGTTTAA AATGAGAATT CCAGGATGGC CAAGCTTATT AGAATGGCTT
AAAGAAAATG CAAAGGCAGG AGAAACTATT GCTTTTGATG GAAAGGTATT TTCCGTAGGA
GAATATAAAG ATTTTAAAAA ATTAGAGAAA GAAAATAATA TTAATATAAA AATAGATGAG
GACCTTTTAG ATGAGGTTTG GAAAGAGAGA CCATCTCTTC CTAAGGAAAA GGCATTTTTA
CATGAAGTTA AGTACTGTGG AAAAAGTGCG AAAGAAAAAT TAAGAGAAGT TAGAGAGGAA
ATGAAAAAGC TAGGCGCTAA TAATTATATT ATAGCTTCTT TAGATGACAT AGCTTGGCTT
TATAATATTA GAGGAAATGA TGTTAAATGC AATCCTGTAG TTTTAAGCTA TGCCTTAGTT
AAAGAAAATG AAGCATATCT TTATGTAGAT AAATCAAAGT TCACTTCTAA AATGGAGGAA
GAACTTTTAA ATGAAGGGGT AACTTTAAAA TCATATGAGA AAATTGGAGA GGATATTAGT
AATTTAGAAG GAAAGATTTT AATTGATCCA AATAAGATAA GTGCTTATTT ATACGAGTGC
ATTAAGGATA AAAATAATAT TGTGGAATTT GGAAACATAA CAACTAAGTT TAAGGCTATT
AAGAATGAAG TTGAATTAGA TAACTTAAGA AAGTGTCAAG TTAGAGATGG ATTAGCTATG
GTTAAGTTTA TGAAATGGCT TAAGGATAAC ATTGGAAAGA TAGAAATAAG TGAAATATCA
GCTTCAGATA AGTTAGAAGA GCTTAGAAGT TTAGATAAGT TATTTAAAGG AATTAGTTTT
GAAACTATAG CAGGGCATAA AGAACATGGT GCTATGATGC ATTATTCAGC AACTAAAGAG
AGTGATTACA CTTTAGAACC AAGAGGATTT TTATTAATAG ATTCAGGTGG ACAATACTTA
GATGGAACTA CAGATATAAC AAGAACTTTT GTTTTAGGAG AATTAACTGA GGAAGAGAGA
AAAGATTATA CTCTAGTTTT AAAAGGGCAT ATAGGCCTTA TGAGAGCTAA ATTCTTAAAG
GGAACAACTG GATCAGCCCT TGATATAAAA GCTAGAGAAC CATTATGGAA TGAAGGAATT
GATTATAAAT GTGGAACAGG TCATGGAGTT GGATTTTTCT TAAATGTTCA TGAAGGACCA
CAAAGCATAA GTCCAGTACC AAATAAGGTT GCCTTAGAGC CAGGAATGAT TATAACTAAT
GAACCTGGAG TTTATAGAGA AGGAAAACAT GGAATAAGAA CAGAGAATAC AATGGTAGTT
GTTAAAGATA CTTATTCAGA AGAGTTTGGA GAATTTTATA AGTTTGATAC TATTTCACTT
TGTCCAATAG ATTTAGAAGG ATTAGATATA AGCTTATTAA ATGAAGAGGA AAAGGATTGG
CTAAATAATT ATCATAAAAA GGTTTATGAT TTATTATCAC CATATTTAGA TGAAGAGGAA
AAAGAATTAT TAAAGAATGA AACAAGGGAA ATATAA
 
Protein sequence
MKVTERLEKL RKIMKDKGID YYIIPSEDAH QSEYVCEHYR GRAYMSGFTG SAGTLLVGLE 
NDILWTDGRY FIQALEELKG SGIEMFKMRI PGWPSLLEWL KENAKAGETI AFDGKVFSVG
EYKDFKKLEK ENNINIKIDE DLLDEVWKER PSLPKEKAFL HEVKYCGKSA KEKLREVREE
MKKLGANNYI IASLDDIAWL YNIRGNDVKC NPVVLSYALV KENEAYLYVD KSKFTSKMEE
ELLNEGVTLK SYEKIGEDIS NLEGKILIDP NKISAYLYEC IKDKNNIVEF GNITTKFKAI
KNEVELDNLR KCQVRDGLAM VKFMKWLKDN IGKIEISEIS ASDKLEELRS LDKLFKGISF
ETIAGHKEHG AMMHYSATKE SDYTLEPRGF LLIDSGGQYL DGTTDITRTF VLGELTEEER
KDYTLVLKGH IGLMRAKFLK GTTGSALDIK AREPLWNEGI DYKCGTGHGV GFFLNVHEGP
QSISPVPNKV ALEPGMIITN EPGVYREGKH GIRTENTMVV VKDTYSEEFG EFYKFDTISL
CPIDLEGLDI SLLNEEEKDW LNNYHKKVYD LLSPYLDEEE KELLKNETRE I