Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0840 |
Symbol | cloSI |
ID | 4201384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 993403 |
End bp | 994971 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 638081724 |
Product | alpha-clostripain |
Protein accession | YP_695291 |
Protein GI | 110798971 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02806] clostripain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.832986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAGA AAAAGTTATC ATTATTGATG GCTACTATTA CCATTGGTAG CGTTCTATTA GGAGGAGTTT CAACAGTAAG TGCTGCTCCA AGACAAAAGC ATAAAACTGT ATCAGAGAAA ATTAAAGAGG CCGAAAAGAC TGAAGGTGAT AAAAAACTAA CGGTTATGGT ATATGCTGAT TGTGATAATA ACCTAGAAGA GTACATACTT AATGATATTG AAGAAATGAA AGAGGGTTAT AAAAATAATC CTAATCTAAA TATTATAGTA TTAGTTGATA GAATTCCTGG TTATTCTAAT GATTCAAAAG TGTTAGGTTC TAATTTTGAA GACACAAGAC TTTATAAAAT AGGTGAAAAT TCTGCTGAAA GAATTAGTGG GAAAAGTGAA TTTCCAGAGA TAACTACTAC AAGTAATTAT GAAGCTAACA TGGGAGATGC TAATACTTTA AAAAAATTCA TTAAATTCTG TAAAAAGAAC TATGAAGCTG ATAAGTATAT GTTAATCATG TCTAACCATG GTGGTGGGGC TAAGGATGAT AAAGATAGAG CTAGTACTGT AAATAAGGCA ATATGCTGGG ATGATAGTAA CAACAAAGAT TGTCTTTATA CTGGTGAAAT TTCAGATGTT TTAACTAAGG ATGAATCTGT TGATGTTTTA GTATTTGATG CTTGTCTAAT GGGAAATTCA GAGGTAGCTT ATCAATATAG ACCTAATAAT GGAAGTTTTG AAGCTAAAAC ACTTGTAGCA TCAGCACCAG TTGTTTGGGG TTTTGGATAT CCTTATGACA AAATATTTTC TCGTTTAAGA TCTACTAAAG GTGACAATGG AGAAGTTGAT TCTACTTTAG GAGGAAAAGA AAAAATATTT GATCCATCTA CAGTTACAAA TAATGAACTT GGTGCCTTAT TTGTAGAAGA GCAGCGTGAT TCTGTAAATA GCTGTGGAGT AACAGATCAA CAACTAAGTT GCTACGATCT TTCAAAAATT GAAAAAGTTA AAAAATCAGT TGATACTTTA GCTAGAAATT TAAGTAAAAA TAATAAAAAA GATGCTATTG AAAGCTTAAG AGGTACAGGA AAAAATGCAC CAACTATGCA CTATTTTAAG AATTATGATG AATATGAATG GATTGAATAT CCATACTTTG ATTTATATGA TTTATGTGAA AAAATTAGTT TAAGTGATGA ATTTAATGAA ACTACTAAAA AATTATCAAA GAATGTTATG AAAAATGTTG ACCAATTAAT TTTATATTCA TTTGCAGGTA ATGACTTTAA AGGCTTCAAA GAAGGGAAAA ATGGTATAAG TATTTTCCTT CCTGATGGTA ATAGAAATTA TTATGATCAA TATTCTGGAC AAGCGATACC ACATTGGGCT ATTCAAAGAT GGTACAATCC TTTAGACACT AATGCTTATA GATTAAGAAG TGGATATGGA AAACTATCAT GGTGTAAAGA TGGATTAGAT CCAAAAATAA ATAAAGTTGG TAACTGGTTT GAACTTTTAG ATTCTTGGTT CGATAAAGAT AATACTTCAC TTGGTGGATA TAATAGATAT AGATATTAA
|
Protein sequence | MFKKKLSLLM ATITIGSVLL GGVSTVSAAP RQKHKTVSEK IKEAEKTEGD KKLTVMVYAD CDNNLEEYIL NDIEEMKEGY KNNPNLNIIV LVDRIPGYSN DSKVLGSNFE DTRLYKIGEN SAERISGKSE FPEITTTSNY EANMGDANTL KKFIKFCKKN YEADKYMLIM SNHGGGAKDD KDRASTVNKA ICWDDSNNKD CLYTGEISDV LTKDESVDVL VFDACLMGNS EVAYQYRPNN GSFEAKTLVA SAPVVWGFGY PYDKIFSRLR STKGDNGEVD STLGGKEKIF DPSTVTNNEL GALFVEEQRD SVNSCGVTDQ QLSCYDLSKI EKVKKSVDTL ARNLSKNNKK DAIESLRGTG KNAPTMHYFK NYDEYEWIEY PYFDLYDLCE KISLSDEFNE TTKKLSKNVM KNVDQLILYS FAGNDFKGFK EGKNGISIFL PDGNRNYYDQ YSGQAIPHWA IQRWYNPLDT NAYRLRSGYG KLSWCKDGLD PKINKVGNWF ELLDSWFDKD NTSLGGYNRY RY
|
| |