Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0833 |
Symbol | cloSI |
ID | 4206086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 962323 |
End bp | 963891 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642565392 |
Product | alpha-clostripain |
Protein accession | YP_698158 |
Protein GI | 110803836 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02806] clostripain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00458898 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAGA AAAAGTTATC ATTATTGATG GCTACTATTA CCATTGGTAG CGTTCTATTA GGAGGAGCTT CAACAGTAAG TGCTGCTCCA AGACAAAAGC ATAAAACTGT ATCAGAGAAA ATTAAAGAGG CTGAAAAGAC TGAAGGTGAT AAAAAACTAA CGGTTATGGT ATATGCTGAT TGTGATAATA ACCTAGAAGA GTACATACTT AATGATATTG AAGAAATGAA AGAGGGTTAT AAAAATAATC CTAATCTAAA TATTGTAGTA TTAGTTGATA GAATTCCTGG TTATTCTAAT GATTCAAAAG TGTTAGGTTC TAATTTTGAA GACACAAGAC TTTATAAAAT AGGTGAAAAT TCTGCTGAAA GAATTAGTGG GAAAAGTGAA TTCCCAGAGA TAACTACTAC AAGTAATTAT GAAGCTAACA TGGGAGATGC AAATACTTTA AAAAAATTCA TTAAATTCTG TAAAAAGAAC TATGAAGCTG ATAAGTATAT GTTAATCATG TCTAACCATG GTGGTGGGGC TAAGGATGAT AAAGATAGAG CTAGTACTGT AAATAAGGCA ATATGCTGGG ATGATAGTAA CAACAAAGAT TGTCTTTATA CTGGAGAAAT TTCAGATGTT TTAACTAAGG ATGAATCTGT TGATGTTTTA GTATTTGATG CTTGTTTAAT GGGAACTTCA GAGGTAGCTT ATCAATATAG ACCTAATAAT GGAAGTTTTG AAGCTAAAAC ACTTGTAGCA TCAGCACCAG TTGTTTGGGG AAATGGATAT CCTTATGATA AAATATTTTC TCGTTTAAAA TCTACTAAAG GTGACAATGG AGAAGTTGAT TCTACTTTAG GAGGAAAGGA AAAAATATTT GAGCCATCAC TTGTTACAAA TAATGAACTT GGTGCCTTAT TTGTAGAAGA ACAACGTGAT TCTGTAAATA GCTATGGAGT AACAGATCAA CAATTAAGTT GCTATGATCT TTCAAAAATT GAAACGGTTA AAAAATCAGT TGATGCTTTA GCTAGAAATT TAAGCAAAAA TAATAAAAAA GATGCTATTG AAAACTTAAG AGGTACAGGA AAAAATGCAC CAACTATGCA CTATTTTAAG AATTATGATG AATATGAATG GATTGAGTAT CCATACTTTG ATTTATATGA TTTATGTGAA AAAATTAGCT TAAGTAATGA ATTTGATGAA ACAACTAAAA AATTATCAAA GAATGTTATG AAAAATGTTG ACCAATTAAT TTTATATTCA TTTGCAGGTA ATGACTTTAA AGGCTTCAAA GAAGGAAAAA ATGGTATAAG TATTTTCCTT CCTGATGGTA ATAGAAATTA TTATGATCAA TATTCAGGAC AAGTAATACC ACATTGGGCT ATTCAAAGAT GGTACAATCC TCTAGACACT AATGCTTATA GATTAAGAAG TGGATATGGA AAACTAGCAT GGTGTAAAGA TGGATTAGAT CCAAAAATAA ATAAAGTTGG TAACTGGTTT GAACTTTTAG ATTCTTGGTT TGATAAAGAT AATACTTCAC TTGGTGGATA TAATAGATAT AGATATTAA
|
Protein sequence | MFKKKLSLLM ATITIGSVLL GGASTVSAAP RQKHKTVSEK IKEAEKTEGD KKLTVMVYAD CDNNLEEYIL NDIEEMKEGY KNNPNLNIVV LVDRIPGYSN DSKVLGSNFE DTRLYKIGEN SAERISGKSE FPEITTTSNY EANMGDANTL KKFIKFCKKN YEADKYMLIM SNHGGGAKDD KDRASTVNKA ICWDDSNNKD CLYTGEISDV LTKDESVDVL VFDACLMGTS EVAYQYRPNN GSFEAKTLVA SAPVVWGNGY PYDKIFSRLK STKGDNGEVD STLGGKEKIF EPSLVTNNEL GALFVEEQRD SVNSYGVTDQ QLSCYDLSKI ETVKKSVDAL ARNLSKNNKK DAIENLRGTG KNAPTMHYFK NYDEYEWIEY PYFDLYDLCE KISLSNEFDE TTKKLSKNVM KNVDQLILYS FAGNDFKGFK EGKNGISIFL PDGNRNYYDQ YSGQVIPHWA IQRWYNPLDT NAYRLRSGYG KLAWCKDGLD PKINKVGNWF ELLDSWFDKD NTSLGGYNRY RY
|
| |