Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1656 |
Symbol | |
ID | 4202674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 1871497 |
End bp | 1874418 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 638082533 |
Product | putative peptidase |
Protein accession | YP_696097 |
Protein GI | 110799577 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0884161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTA AAGAGAATAA TATTTATAGT GGATTTAAAC TTTTAAACAT AGAAAATTTA AATGAAATAG GTGGAGTAGG TTTAAGGTTT GAGCATGAAA AAACTAAGGC TAAACTTATA AAAATCCTAA GTGAAGATGA CAATAAGTGC TTTGCAATAG GATTTAGAAC ACCACCTGAA AATAGTACAG GAGTTCCTCA TATTTTAGAG CATTCAGTTT TATGTGGTTC TAGAAAATTT AATACTAAGG AACCCTTTGT AGAGCTTTTA AAAGGGTCTT TAAATACATT CTTAAATGCT ATGACATATC CAGATAAAAC AATATATCCA GTAGCATCAA GAAATGAAAA AGACTTTATG AATCTTATGG ATGTTTACTT AGATGCTGTA TTATATCCAA ATATATATAA GCATAAGGAA ATATTCATGC AAGAGGGATG GCATTATTAT ATAGAAAATA AGGAAGATGA ATTAAAGTAT AATGGTGTTG TTTATAATGA GATGAAAGGG GCATACTCAT CTCCAGATTC TATACTTTAT AGAAAGATTC CTCAAACAAT ATACCCAGAT ACTTGTTATG CCTTATCTTC AGGAGGAGAT CCTGATGAAA TACCAAATTT AACTTATGAA GAGTTTGTGG AATTTCATAA GAAATATTAT CATCCATCGA ACTCATATAT TTTCTTATAT GGTAATGGAG ATACTGAAAA AGAATTAGAA TTTATAAATG AAGAGTATTT AAAGAATTTT GAATATAAAG AGATAGATTC AGAAATAAAA GAACAAAAAT CCTTTGAAAG TATGAAAGAA GAAAGTTTTA CTTATGGAAT AGCTGAAAGT GAAGATTTAA ATCATAAAAG TTATTATAGT TTAAACTTTG TAATTGGAGA TGCCACAGAC GGAGAAAAAG GCTTAGCTTT TGATGTTTTA GCATATCTTC TAACAAGAAG CACAGCAGCA CCATTAAAGA AAGCATTAAT AGATGCAGGT ATAGGGAAAG CTGTATCAGG AGACTTTGAT AACTCAACTA AACAATCAGC CTTTACTGTT TTAGTTAAGA ATGCAGAGCT AAACAAAGAA GAAGAATTTA AAAAAGTAGT AATGGATACT TTAAAGGATT TAGTTGAAAA TGGAATAGAT AAAGAACTTA TAGAAGCTTC CATAAATAGA GTTGAATTTG AATTAAGAGA AGGAGATTAT GGTTCTTATC CTAATGGATT AATTTATTAT TTAAAAGTTA TGGATAGTTG GCTTTATGAT GGGGATCCAT ATGTTCATTT AGAATATGAA AAAAATCTTG AAAAAATAAA ATCTGCTTTA ACAAGCAATT ACTTTGAAGA TTTAATAGAA AAATATATGA TAAATAATAC TCACTCTTCA CTTGTTTCTC TTCATCCTGA AAAAGGAATA AATGAGAAAA AGTCAGCTGA ATTAAAGAAA AAGTTAGAAG AGATTAAAAA TAGTTTTGAT GAAAAGACTT TAAATGAAAT AATTGATAAT TGTAAAAAGT TAAAAGAAAG ACAAAGTACA CCTGATAAAA AAGAAGATTT AGAAAGTATT CCTATGTTAT CTTTAGAGGA TATAGATAAA GAAGCAACTA AAATTCCTAC AGAAGAGAAA GAGATAGATG GAATTACAAC ATTACACCAT GATTTCCATA CTAATAAAAT AGACTATGTT AATTTCTTCT TTAATACAAA TAGTGTTCCT GAAGATTTAA TACCTTATGT TGGATTACTA TGCGATATAT TAGGTAAGTG TGGAACAGAA AATTATGATT ATTCTAAGTT ATCAAATGCC ATAAATATAA GTACAGGTGG AATAAGTTTT GGAGCTATAA CTTTTGCTAA TCTAAAGAAA AATAATGAGT TTAGACCATA TTTAGAAATT TCATATAAAG CATTAAGCAG CAAGACTAAT AAAGCTATAG AATTAGTTGA TGAGATTGTA AATCACACTG ACTTAGATGA TATGGACAGA ATTATGCAAA TAATTAGAGA AAAGAGAGCT AGATTAGAAG GTGCTATATT CGATAGTGGT CATAGAATAG CTATGAAAAA AGTTTTATCA TACTCTACAA ATAGAGGAGC TTATGATGAA AAAATAAGTG GATTAGATTA TTATGATTTT CTAGTAAATA TAGAGAAGGA AGATAAAAAA TCAAAGATAT CAGATAGCTT AAAAAAGGTG AGAGACTTAA TCTTTAATAA GGGAAATATG CTTATAAGTT ATTCAGGAAA AGAAGAGGAA TATGAAAACT TTAAGGAAAA AGTAAAATAT TTAATAAGCA AAACAAATAA TAATGATTTT GAAAAAGAAG AATATAATTT TGAGTTAGGA AAGAAAAATG AAGGGCTTTT AACTCAAGGA AATGTACAAT ATGTAGCTAA GGGTGGAAAT TATAAAACTC ATGGATATAA GTATTCTGGT GCACTATCTT TATTAGAAAG TATTCTAGGA TTTGACTACT TATGGAATGC CGTAAGGGTT AAAGGTGGAG CTTATGGAGT GTTCTCTAAC TTTAGAAGAG ATGGCGGAGC ATATATAGTT TCATATAGAG ATCCTAATAT AAAAAGCACT TTAGAAGCTT ATGATAATAT ACCTAAGTAT TTAAATGATT TTGAAGCTGA CGAAAGAGAA ATGACTAAAT ACATCATAGG TACAATAAGA AAATATGATC AACCTATAAG CAATGGAATA AAAGGTGATA TAGCAGTTTC ATACTACTTA AGTAACTTTA CTTATGAAGA TCTTCAAAAG GAAAGAGAAG AAATCATAAA TGCAGATGTA GAAAAAATTA AGAGTTTTGC ACCTATGATT AAAGATTTAA TGAAGGAAGA CTACATCTGT GTACTAGGCA ATGAAGAAAA GATAAAAGAA AATAAAGACC TATTTAATAA TATTAAAAGT GTAATTAAAT AG
|
Protein sequence | MNFKENNIYS GFKLLNIENL NEIGGVGLRF EHEKTKAKLI KILSEDDNKC FAIGFRTPPE NSTGVPHILE HSVLCGSRKF NTKEPFVELL KGSLNTFLNA MTYPDKTIYP VASRNEKDFM NLMDVYLDAV LYPNIYKHKE IFMQEGWHYY IENKEDELKY NGVVYNEMKG AYSSPDSILY RKIPQTIYPD TCYALSSGGD PDEIPNLTYE EFVEFHKKYY HPSNSYIFLY GNGDTEKELE FINEEYLKNF EYKEIDSEIK EQKSFESMKE ESFTYGIAES EDLNHKSYYS LNFVIGDATD GEKGLAFDVL AYLLTRSTAA PLKKALIDAG IGKAVSGDFD NSTKQSAFTV LVKNAELNKE EEFKKVVMDT LKDLVENGID KELIEASINR VEFELREGDY GSYPNGLIYY LKVMDSWLYD GDPYVHLEYE KNLEKIKSAL TSNYFEDLIE KYMINNTHSS LVSLHPEKGI NEKKSAELKK KLEEIKNSFD EKTLNEIIDN CKKLKERQST PDKKEDLESI PMLSLEDIDK EATKIPTEEK EIDGITTLHH DFHTNKIDYV NFFFNTNSVP EDLIPYVGLL CDILGKCGTE NYDYSKLSNA INISTGGISF GAITFANLKK NNEFRPYLEI SYKALSSKTN KAIELVDEIV NHTDLDDMDR IMQIIREKRA RLEGAIFDSG HRIAMKKVLS YSTNRGAYDE KISGLDYYDF LVNIEKEDKK SKISDSLKKV RDLIFNKGNM LISYSGKEEE YENFKEKVKY LISKTNNNDF EKEEYNFELG KKNEGLLTQG NVQYVAKGGN YKTHGYKYSG ALSLLESILG FDYLWNAVRV KGGAYGVFSN FRRDGGAYIV SYRDPNIKST LEAYDNIPKY LNDFEADERE MTKYIIGTIR KYDQPISNGI KGDIAVSYYL SNFTYEDLQK EREEIINADV EKIKSFAPMI KDLMKEDYIC VLGNEEKIKE NKDLFNNIKS VIK
|
| |