Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2200 |
Symbol | |
ID | 4205206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2429001 |
End bp | 2430329 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642566750 |
Product | serine protease |
Protein accession | YP_699500 |
Protein GI | 110803298 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.550757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATT TCAATAAAAA AGATGAAGGT ATAGATAACT ACTTTGGTAT GGAAGACAAA GAAAACATAG AATCAAACAA TTATACTGAG CAAACTAATA TAGATGAAAC TAATAAATTT AATATAGATA ATGAAATTAA TTCAAAAGAT GAAGTTGAAA AAGAGGATGA TAAAAACTTC TCTGACATAA AATCAAAAAA TTCTAATGAT AATATTAAAA GTAAAAAAGT AAAGAAAAAG AGTGGGTTTA AAAGGGTAAT AGCTCTAGTA GCTGGTGCTG TCATAGTTGC TATACTAGGA GGATCTATAG GAGCTAGTGG AGTTTATTAT GCTTTTAAAA ATAGCATACC AGTAAGTACA CTAGAGAATA ATAGTAATAC CCAAGTTAAT CCACCAGCCT TTAAAGTGGA AGATGGAGCA TTAACTGTTC CTCAAGTAGT TGAAAAAGTT ACACCTGCTG TTGTAGGAGT ATCCACAAAG AGCTTAGTAA GAGATCAATT CTTTAATGTA AAAGAACAAG AAGGATTAGG ATCTGGATTT ATAATAAATG AAGATGGATA TGTAGTTACA AACTACCATG TTATAAATGG AGCTCAAGAA GTTAAAGTAA TATTCTCTGA TGGAAAAGAA GTAAATGCTA AGGTTGTAAA TTATGATGCT GAAAGAGATA TTGCAGTAAT AAAAATAACA GACAATGTTA AAATGCCTGG AATAGCACAA TTAGGAGATT CATCTATAGT TAAAGCTGGT GAAGAAGTAA TTGCTATAGG AAATCCCCTA GGAAAAGAAT TTAGCTCAAC AGTAACTAAG GGTATAATAA GTTCTCCAAA TAGAAAAATG AAGACTGAAA ATGGAAATGT ATTAGATTAT ATACAAACAG ATGCAGCTAT CAACCCAGGT AATAGTGGGG GTCCATTAAT AAACTCTAAG GGAGAAGTTA TTGGAATAAA TACGGCTAAA AAAGTTGGTG AAGATATTGA AGGTATCGGA TTTGCAATTC CTATAAATGA AGTAAAAACT AGATTAGGTT CTTTATCAAA ACCAATATTA AAACTTGGTA TTACGGCTAG AACTGTCACT CCAGAATTAG CAAAAGAAAA TAATATAGAA GAAGGAATTT ATGTTGTAGG TGTACAAGAA TTTAGTCCAG CAGAAAAATC AGGATTAAAA ATAGGTGATT TAATAATTGC TTTTGGTGGA AAAAGAGTAA AAACTTTAGA AGAATTAAAT CAGATTAAAA GTCAATATAA TGATGGAGAT TCAGTACCGA TTGAAATAAT TCGAGATGGT AAAAAAGTAA ACTTAAATTT AACATTAGTT GCTAATTAA
|
Protein sequence | MSDFNKKDEG IDNYFGMEDK ENIESNNYTE QTNIDETNKF NIDNEINSKD EVEKEDDKNF SDIKSKNSND NIKSKKVKKK SGFKRVIALV AGAVIVAILG GSIGASGVYY AFKNSIPVST LENNSNTQVN PPAFKVEDGA LTVPQVVEKV TPAVVGVSTK SLVRDQFFNV KEQEGLGSGF IINEDGYVVT NYHVINGAQE VKVIFSDGKE VNAKVVNYDA ERDIAVIKIT DNVKMPGIAQ LGDSSIVKAG EEVIAIGNPL GKEFSSTVTK GIISSPNRKM KTENGNVLDY IQTDAAINPG NSGGPLINSK GEVIGINTAK KVGEDIEGIG FAIPINEVKT RLGSLSKPIL KLGITARTVT PELAKENNIE EGIYVVGVQE FSPAEKSGLK IGDLIIAFGG KRVKTLEELN QIKSQYNDGD SVPIEIIRDG KKVNLNLTLV AN
|
| |