Gene CPF_1656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1656 
Symbol 
ID4202674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1871497 
End bp1874418 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content28% 
IMG OID638082533 
Productputative peptidase 
Protein accessionYP_696097 
Protein GI110799577 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0884161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTA AAGAGAATAA TATTTATAGT GGATTTAAAC TTTTAAACAT AGAAAATTTA 
AATGAAATAG GTGGAGTAGG TTTAAGGTTT GAGCATGAAA AAACTAAGGC TAAACTTATA
AAAATCCTAA GTGAAGATGA CAATAAGTGC TTTGCAATAG GATTTAGAAC ACCACCTGAA
AATAGTACAG GAGTTCCTCA TATTTTAGAG CATTCAGTTT TATGTGGTTC TAGAAAATTT
AATACTAAGG AACCCTTTGT AGAGCTTTTA AAAGGGTCTT TAAATACATT CTTAAATGCT
ATGACATATC CAGATAAAAC AATATATCCA GTAGCATCAA GAAATGAAAA AGACTTTATG
AATCTTATGG ATGTTTACTT AGATGCTGTA TTATATCCAA ATATATATAA GCATAAGGAA
ATATTCATGC AAGAGGGATG GCATTATTAT ATAGAAAATA AGGAAGATGA ATTAAAGTAT
AATGGTGTTG TTTATAATGA GATGAAAGGG GCATACTCAT CTCCAGATTC TATACTTTAT
AGAAAGATTC CTCAAACAAT ATACCCAGAT ACTTGTTATG CCTTATCTTC AGGAGGAGAT
CCTGATGAAA TACCAAATTT AACTTATGAA GAGTTTGTGG AATTTCATAA GAAATATTAT
CATCCATCGA ACTCATATAT TTTCTTATAT GGTAATGGAG ATACTGAAAA AGAATTAGAA
TTTATAAATG AAGAGTATTT AAAGAATTTT GAATATAAAG AGATAGATTC AGAAATAAAA
GAACAAAAAT CCTTTGAAAG TATGAAAGAA GAAAGTTTTA CTTATGGAAT AGCTGAAAGT
GAAGATTTAA ATCATAAAAG TTATTATAGT TTAAACTTTG TAATTGGAGA TGCCACAGAC
GGAGAAAAAG GCTTAGCTTT TGATGTTTTA GCATATCTTC TAACAAGAAG CACAGCAGCA
CCATTAAAGA AAGCATTAAT AGATGCAGGT ATAGGGAAAG CTGTATCAGG AGACTTTGAT
AACTCAACTA AACAATCAGC CTTTACTGTT TTAGTTAAGA ATGCAGAGCT AAACAAAGAA
GAAGAATTTA AAAAAGTAGT AATGGATACT TTAAAGGATT TAGTTGAAAA TGGAATAGAT
AAAGAACTTA TAGAAGCTTC CATAAATAGA GTTGAATTTG AATTAAGAGA AGGAGATTAT
GGTTCTTATC CTAATGGATT AATTTATTAT TTAAAAGTTA TGGATAGTTG GCTTTATGAT
GGGGATCCAT ATGTTCATTT AGAATATGAA AAAAATCTTG AAAAAATAAA ATCTGCTTTA
ACAAGCAATT ACTTTGAAGA TTTAATAGAA AAATATATGA TAAATAATAC TCACTCTTCA
CTTGTTTCTC TTCATCCTGA AAAAGGAATA AATGAGAAAA AGTCAGCTGA ATTAAAGAAA
AAGTTAGAAG AGATTAAAAA TAGTTTTGAT GAAAAGACTT TAAATGAAAT AATTGATAAT
TGTAAAAAGT TAAAAGAAAG ACAAAGTACA CCTGATAAAA AAGAAGATTT AGAAAGTATT
CCTATGTTAT CTTTAGAGGA TATAGATAAA GAAGCAACTA AAATTCCTAC AGAAGAGAAA
GAGATAGATG GAATTACAAC ATTACACCAT GATTTCCATA CTAATAAAAT AGACTATGTT
AATTTCTTCT TTAATACAAA TAGTGTTCCT GAAGATTTAA TACCTTATGT TGGATTACTA
TGCGATATAT TAGGTAAGTG TGGAACAGAA AATTATGATT ATTCTAAGTT ATCAAATGCC
ATAAATATAA GTACAGGTGG AATAAGTTTT GGAGCTATAA CTTTTGCTAA TCTAAAGAAA
AATAATGAGT TTAGACCATA TTTAGAAATT TCATATAAAG CATTAAGCAG CAAGACTAAT
AAAGCTATAG AATTAGTTGA TGAGATTGTA AATCACACTG ACTTAGATGA TATGGACAGA
ATTATGCAAA TAATTAGAGA AAAGAGAGCT AGATTAGAAG GTGCTATATT CGATAGTGGT
CATAGAATAG CTATGAAAAA AGTTTTATCA TACTCTACAA ATAGAGGAGC TTATGATGAA
AAAATAAGTG GATTAGATTA TTATGATTTT CTAGTAAATA TAGAGAAGGA AGATAAAAAA
TCAAAGATAT CAGATAGCTT AAAAAAGGTG AGAGACTTAA TCTTTAATAA GGGAAATATG
CTTATAAGTT ATTCAGGAAA AGAAGAGGAA TATGAAAACT TTAAGGAAAA AGTAAAATAT
TTAATAAGCA AAACAAATAA TAATGATTTT GAAAAAGAAG AATATAATTT TGAGTTAGGA
AAGAAAAATG AAGGGCTTTT AACTCAAGGA AATGTACAAT ATGTAGCTAA GGGTGGAAAT
TATAAAACTC ATGGATATAA GTATTCTGGT GCACTATCTT TATTAGAAAG TATTCTAGGA
TTTGACTACT TATGGAATGC CGTAAGGGTT AAAGGTGGAG CTTATGGAGT GTTCTCTAAC
TTTAGAAGAG ATGGCGGAGC ATATATAGTT TCATATAGAG ATCCTAATAT AAAAAGCACT
TTAGAAGCTT ATGATAATAT ACCTAAGTAT TTAAATGATT TTGAAGCTGA CGAAAGAGAA
ATGACTAAAT ACATCATAGG TACAATAAGA AAATATGATC AACCTATAAG CAATGGAATA
AAAGGTGATA TAGCAGTTTC ATACTACTTA AGTAACTTTA CTTATGAAGA TCTTCAAAAG
GAAAGAGAAG AAATCATAAA TGCAGATGTA GAAAAAATTA AGAGTTTTGC ACCTATGATT
AAAGATTTAA TGAAGGAAGA CTACATCTGT GTACTAGGCA ATGAAGAAAA GATAAAAGAA
AATAAAGACC TATTTAATAA TATTAAAAGT GTAATTAAAT AG
 
Protein sequence
MNFKENNIYS GFKLLNIENL NEIGGVGLRF EHEKTKAKLI KILSEDDNKC FAIGFRTPPE 
NSTGVPHILE HSVLCGSRKF NTKEPFVELL KGSLNTFLNA MTYPDKTIYP VASRNEKDFM
NLMDVYLDAV LYPNIYKHKE IFMQEGWHYY IENKEDELKY NGVVYNEMKG AYSSPDSILY
RKIPQTIYPD TCYALSSGGD PDEIPNLTYE EFVEFHKKYY HPSNSYIFLY GNGDTEKELE
FINEEYLKNF EYKEIDSEIK EQKSFESMKE ESFTYGIAES EDLNHKSYYS LNFVIGDATD
GEKGLAFDVL AYLLTRSTAA PLKKALIDAG IGKAVSGDFD NSTKQSAFTV LVKNAELNKE
EEFKKVVMDT LKDLVENGID KELIEASINR VEFELREGDY GSYPNGLIYY LKVMDSWLYD
GDPYVHLEYE KNLEKIKSAL TSNYFEDLIE KYMINNTHSS LVSLHPEKGI NEKKSAELKK
KLEEIKNSFD EKTLNEIIDN CKKLKERQST PDKKEDLESI PMLSLEDIDK EATKIPTEEK
EIDGITTLHH DFHTNKIDYV NFFFNTNSVP EDLIPYVGLL CDILGKCGTE NYDYSKLSNA
INISTGGISF GAITFANLKK NNEFRPYLEI SYKALSSKTN KAIELVDEIV NHTDLDDMDR
IMQIIREKRA RLEGAIFDSG HRIAMKKVLS YSTNRGAYDE KISGLDYYDF LVNIEKEDKK
SKISDSLKKV RDLIFNKGNM LISYSGKEEE YENFKEKVKY LISKTNNNDF EKEEYNFELG
KKNEGLLTQG NVQYVAKGGN YKTHGYKYSG ALSLLESILG FDYLWNAVRV KGGAYGVFSN
FRRDGGAYIV SYRDPNIKST LEAYDNIPKY LNDFEADERE MTKYIIGTIR KYDQPISNGI
KGDIAVSYYL SNFTYEDLQK EREEIINADV EKIKSFAPMI KDLMKEDYIC VLGNEEKIKE
NKDLFNNIKS VIK