Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_2971 |
Symbol | |
ID | 4201430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 3234588 |
End bp | 3236447 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638083838 |
Product | ATP-dependent protease |
Protein accession | YP_697325 |
Protein GI | 110800560 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease [TIGR02903] ATP-dependent protease, Lon family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGTG AACTTGGAGT AGAATCTCAA GTTGAAGCAC TAAAAGATAT AATAAATAAT ATATTAGACG AAGGTGCATT TAGAGCGAGA GTTATAAGAT TTAAAGTACA AAATTATATA AATTCAACTG ATCCTTATGA AAGACTTTAT GGATTAAGTA AAATTGTTTC TGAAGGAAAG GGTTTAAGTG AAGTTCCAAC TGAAGAAACT ATAAATGAAG CTTTAGAAGA TGTTTGTGCT ATGATATCAG ATGCTATTGC TAGAAGATAT GTCCAAAATA AAATAGAAAA AGAAGTTGAA CAATTCTTAA TGGAAAAGCA AGAAAAGTAT GTTGATGAAC TTAGAGTAAA CATAATGAAA AAGAAAAAAG GTCCAGAAAA TGCTAAGACA GAGAAAAAGC TTGAGGAACT TGAAGAACTA GATGAAAGAG TTCCAAATAA GAATATAATG TCTTTATTAA GACCTGATTC ATTTGATGAG GTAGTTGGTC AAGAGAGAGC TGTTAAGTCA CTTCTTTCAA AACTAGCTTC ACCATATCCT CAACATATAA TACTTTATGG ACCTCCAGGG GTTGGTAAAA CAACAGCTGC TAGAATTGCT TTAGAAACAG CTAAGAAATT AAAATCAACT CCATTTGATG ATAGATCAAA ATTCATAGAG GTTAATGGTA CAACTTTAAG ATGGGATCCA AGAGAAATCA CAAACCCACT TTTAGGTTCA GTACATGATC CAATATATCA AGGTAGCAAA AGAGACTTAG CTGAAATAGG AGTTCCAGAA CCAAAACCAG GTTTAGTTAC TGAAGCTCAT GGTGGTATAT TATTCATAGA TGAAATTGGA GAATTAGATG AAATACTTCA AAATAAACTT TTAAAAGTTT TAGAAGATAA GAGAGTTGAA TTCTCATCAT CTTACTATGA TCCAGATGAT GAAAATACAC CTAAATATAT AAAATATCTT TTTGATAAGG GAGCTCCAGC AGACTTTGTT CTAATAGGAG CAACTACTAG AGAACCAGGA GAAATCAATC CTGCTTTACG TTCAAGATGT ACAGAGGTTT ATTTTGAACC ACTATCATCA AGAGATATTG AAAAGATAGT ATTAAATGCA GCTAAGAAGC TTAATGTTAA GCTTGAAGAA GGTTTAGAAA AGAAAATAGC TTCTTATACT ATAGAAGGTA GAAGAGCTGT AAATATATTA GCAGATGCTT ATGGTCATGC CATTTATGGC TTAGAGGGAG AAGTTCCAGA AGACTTAGAA ATAACTTCAA AGGATTTAAA TGAAGTTGTA AGCATAGGAA GATTTACTCC GTATGAAATA CTAGAAAATT TAGAGGAAAA AGAAGTAGGT CATGTTTATG GACTTGGAGT TTCAGGATTC TTAGGCTCAA CAATAGAGAT TGAAGCCACT GCTTTTAAAG CTAAGAAAAA GGGTGCTGGA AAAATAAGAT TCAATGATAC TGCTGGTTCA ATGGCTAAGG ATTCTGTATT TAATGCTGCA TCTGTAATAA AAAGGCTAAC TGATAAGGAT ATAAATGACT ATGATATACA TGTTAACGTA ATTGGTGGAG GAAAGATAGA TGGACCATCT GCTGGAGCTG CCATTACCAT ATGTATAATG AGTGCTTTAT TAGAAAAGCC AATAAGACAA GACTTAGCTA TAACTGGAGA GATTTCTTTA AGAGGAAAGA TTAAACCAGT TGGAGGTATA TTTGAAAAAA TATACGGGGC TAGAAGAAAG GGAATTAAGT TAGTAACTGT TCCTAAAGAT AATGAAAATG AAATCCCTAA AGGATTAGAA GATATAGAAG TTAAAGCTAT AAGTTCTATA GAAGAGCTTA TGGAAATTGC TTTTAATTAA
|
Protein sequence | MNSELGVESQ VEALKDIINN ILDEGAFRAR VIRFKVQNYI NSTDPYERLY GLSKIVSEGK GLSEVPTEET INEALEDVCA MISDAIARRY VQNKIEKEVE QFLMEKQEKY VDELRVNIMK KKKGPENAKT EKKLEELEEL DERVPNKNIM SLLRPDSFDE VVGQERAVKS LLSKLASPYP QHIILYGPPG VGKTTAARIA LETAKKLKST PFDDRSKFIE VNGTTLRWDP REITNPLLGS VHDPIYQGSK RDLAEIGVPE PKPGLVTEAH GGILFIDEIG ELDEILQNKL LKVLEDKRVE FSSSYYDPDD ENTPKYIKYL FDKGAPADFV LIGATTREPG EINPALRSRC TEVYFEPLSS RDIEKIVLNA AKKLNVKLEE GLEKKIASYT IEGRRAVNIL ADAYGHAIYG LEGEVPEDLE ITSKDLNEVV SIGRFTPYEI LENLEEKEVG HVYGLGVSGF LGSTIEIEAT AFKAKKKGAG KIRFNDTAGS MAKDSVFNAA SVIKRLTDKD INDYDIHVNV IGGGKIDGPS AGAAITICIM SALLEKPIRQ DLAITGEISL RGKIKPVGGI FEKIYGARRK GIKLVTVPKD NENEIPKGLE DIEVKAISSI EELMEIAFN
|
| |