Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2649 |
Symbol | |
ID | 4206208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2874915 |
End bp | 2876774 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642567197 |
Product | ATP-dependent protease |
Protein accession | YP_699884 |
Protein GI | 110803724 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease [TIGR02903] ATP-dependent protease, Lon family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGTG AACTTGGAGT AGAATCTCAA GTTGAAGCAC TAAAAGATAT AATAAATAAT ATATTAGACG AAGGTGCATT TAGAGCGAGA GTTATAAGAT TTAAAGTACA AAATTATATA AATTCAACTG ATCCTTATGA AAGACTTTAT GGATTAAGTA AAATTGTTTC TGAAGGAAAG GGATTAAGTG AAGTTCCAAC TGAAGAAACT ATAAATGAAG CTTTAGAAGA TGTTTGTGCT ATGATATCAG ATGCTATTGC TAGAAGATAT GTTCAAAATA AAATAGAAAA AGAAGTTGAA CAATTCTTAA TGGAAAAGCA AGAAAAGTAT GTTGATGAAC TTAGAGTAAA CATAATGAAA AAGAAAAAAG GTCCAGAAAA TGCTAAGACA GAGAAAAAGC TTGAGGAACT TGAAGAACTA GATGAGAGAG TTCCAAATAA GAATATAATG TCTTTATTAA GACCTGATTC ATTTGATGAG GTAGTTGGTC AAGAGAGAGC TGTTAAGTCA CTTCTTTCAA AACTAGCTTC ACCATATCCT CAACATATAA TACTTTATGG ACCTCCAGGG GTTGGTAAAA CAACAGCTGC TAGAATTGCT CTAGAAACAG CTAAGAAATT AAAATCAACT CCATTTGATG ATAGATCAAA ATTCATAGAG GTTAATGGTA CAACTTTAAG ATGGGATCCA AGAGAAATCA CAAACCCACT TTTAGGTTCA GTACATGATC CAATATATCA AGGTAGCAAA AGAGACTTAG CTGAAATAGG AGTTCCAGAA CCAAAACCAG GTTTAGTTAC TGAAGCTCAC GGTGGTATAT TATTTATAGA TGAAATTGGA GAATTAGATG AAATACTTCA AAATAAACTT TTAAAAGTTT TAGAAGATAA GAGAGTTGAA TTCTCATCAT CTTACTATGA TCCAGATGAT GAAAATACAC CTAAATATAT AAAATATCTT TTTGATAAGG GAGCTCCAGC AGATTTTGTT CTAATAGGAG CAACTACTAG AGAGCCGGGA GAAATAAATC CTGCTTTACG TTCAAGATGT ACAGAGGTTT ATTTTGAACC ACTATCATCA AGAGATATTG AAAAGATAGT ATTAAATGCA GCTAAGAAGC TTAATGTTAA GCTTGAAGAA GGTTTAGAAA AGAAAATAGC TTCTTACACT ATAGAAGGTA GAAGAGCTGT AAATATATTA GCAGATGCTT ATGGTCATGC TATTTATGGC TTAGAGGGAG AAGTTCCAGA AGACTTAGAA ATAACTTCAA AGGATTTAAA TGAAGTTGTA AGCATAGGAA GATTTACTCC GTATGAAATA CTAGAAAATT TAGAGGAAAA AGAAGTAGGT CATGTTTATG GACTTGGAGT TTCAGGATTC TTAGGCTCAA CAATAGAGAT TGAAGCCACT GCTTTTAAAG CTAAGAAAAA GGGTGCTGGA AAAATAAGAT TCAATGATAC TGCTGGTTCA ATGGCTAAGG ATTCTGTATT TAATGCTGCA TCTGTAATAA AAAGGCTAAC TGATAAGGAT ATAAATGATT ATGATATACA TGTTAATGTA ATTGGTGGAG GAAAGATAGA TGGACCATCT GCTGGGGCCG CTATTACAAT ATGTATAATG AGTGCTTTAT TAGAAAAGCC AATAAGACAA GACTTAGCTA TAACTGGAGA GATTTCTTTA AGAGGAAAGA TTAAGCCAGT TGGAGGTATA TTTGAAAAAA TATACGGAGC TAGAAGAAAG GGAATTAAGT TAGTAACTGT TCCTAAAGAT AATGAAAATG AAATCCCTAA AGGATTAGAA GATATAGAAG TTAAAGCTAT AAGTTCTATA GAAGAGCTTA TGGAAATAGC TTTTAATTAA
|
Protein sequence | MNSELGVESQ VEALKDIINN ILDEGAFRAR VIRFKVQNYI NSTDPYERLY GLSKIVSEGK GLSEVPTEET INEALEDVCA MISDAIARRY VQNKIEKEVE QFLMEKQEKY VDELRVNIMK KKKGPENAKT EKKLEELEEL DERVPNKNIM SLLRPDSFDE VVGQERAVKS LLSKLASPYP QHIILYGPPG VGKTTAARIA LETAKKLKST PFDDRSKFIE VNGTTLRWDP REITNPLLGS VHDPIYQGSK RDLAEIGVPE PKPGLVTEAH GGILFIDEIG ELDEILQNKL LKVLEDKRVE FSSSYYDPDD ENTPKYIKYL FDKGAPADFV LIGATTREPG EINPALRSRC TEVYFEPLSS RDIEKIVLNA AKKLNVKLEE GLEKKIASYT IEGRRAVNIL ADAYGHAIYG LEGEVPEDLE ITSKDLNEVV SIGRFTPYEI LENLEEKEVG HVYGLGVSGF LGSTIEIEAT AFKAKKKGAG KIRFNDTAGS MAKDSVFNAA SVIKRLTDKD INDYDIHVNV IGGGKIDGPS AGAAITICIM SALLEKPIRQ DLAITGEISL RGKIKPVGGI FEKIYGARRK GIKLVTVPKD NENEIPKGLE DIEVKAISSI EELMEIAFN
|
| |