Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2620 |
Symbol | |
ID | 4205233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 2854089 |
End bp | 2855285 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642567170 |
Product | subtilase family protein |
Protein accession | YP_699867 |
Protein GI | 110803033 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTCAA TAAAACAAAA ATTGGATAGC AATCTAAAAG TTTATATAAA TCGCTCTTAT TATACAAATT ATAGAGTTCT TATAAAATGC AAAAAATTTA TGGAAGATAT AACAAAGAAA ATACCCAAGC TTAGAGGTAT TGTTATAAGA GAAATTAAGT CATTAAATTT GATTTGCGCT ATTCTTACAC CAAAAGCTAT TAATAGATTA ATTGAATATC CTGAGATAGA ATTTATTTCT TTTGATGACC ATGCTATACT TTGTGGGCTT AGTATAGGTA CTGCTAATAG AATTGCCACC AATAAATCTT TTAATTTTAC TGGCAAGAAT GTATCTATTG GTTTAATTGA TAGTGGAGTC TATCCTCATC AAGATCTAAC AAATCCTACA AATAAAATAG ATATGTTTTT AGATTTATTA AATAACTACT CTTATCCTTA TGATGATAAT GGACATGGAA CTGCTTTAAG TGGGATTATA TGTGGAAGTG GATACTCCTC AAAGCTTGTT TTTAGAGGCA TTGCAGAAAA CACCAAAATA TCTTGTGTAA AGGCCTTTGA TGCTAATGGA AAGGGCTATG TTTCAGATAT ACTTTTCGCC ATTGAAACTC TTATAAATCA AGAGAATAAT CCTATAAGAG TTTTATGTTT ACCTTTTGAA CTTACCAGCC ATAATATTAA AATATCAGAT TACTTTAACG AACTTTTTAA ATTAGCAGTT AGTAAAAATA TAATTCCTGT TGTCCCTTCC GGAAGCATAG AAGGAGATAA CACTATTCAG GGTTTAGCAT TATCTCCTTG GTGTATAACA GTTGGTGGTA TAGATTCTAC AAAGACACCA ACAACAACTT TTAAATTTTC TTCATCTGGA AATTCTAATG TGAAAAAACC AGATTTTTGT GCAGCCTGTG CTAATATAAT GTGCTTAAAC TCAGATAAAA AATATATTTC TGAAAGAAAT GGAATAAAAC TATATCCTCA TAAATTAGAT AGTAGTTACA CAGTCTTTCA AGGAACCTCC TTAGCTTGTG CCTACATATC TGGAGTATGT GCACTACTTT TAGAGGCCAA ACCAGAACTA AATTACAAAG ATTTATGTTC TTTATTAAAA ATAGCTTCTA ATAATAAATA TGAACTACCT TCTGATTCTG TTGGAGAAGG AGTCATAGAT TTATCATTTT TACTTGAAAA TATTTAA
|
Protein sequence | MFSIKQKLDS NLKVYINRSY YTNYRVLIKC KKFMEDITKK IPKLRGIVIR EIKSLNLICA ILTPKAINRL IEYPEIEFIS FDDHAILCGL SIGTANRIAT NKSFNFTGKN VSIGLIDSGV YPHQDLTNPT NKIDMFLDLL NNYSYPYDDN GHGTALSGII CGSGYSSKLV FRGIAENTKI SCVKAFDANG KGYVSDILFA IETLINQENN PIRVLCLPFE LTSHNIKISD YFNELFKLAV SKNIIPVVPS GSIEGDNTIQ GLALSPWCIT VGGIDSTKTP TTTFKFSSSG NSNVKKPDFC AACANIMCLN SDKKYISERN GIKLYPHKLD SSYTVFQGTS LACAYISGVC ALLLEAKPEL NYKDLCSLLK IASNNKYELP SDSVGEGVID LSFLLENI
|
| |