Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1747 |
Symbol | |
ID | 5741421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 2151434 |
End bp | 2156131 |
Gene Length | 4698 bp |
Protein Length | 1565 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641292847 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001558858 |
Protein GI | 160879890 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0213398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATA AGAGTATGAA ACGTATGCTG AGCCTGTTAG TCAGCATACT GATGATCTTC GGATCAATCT GGTTGACGCC GGGAGTGGCA AGTGCGGCGG ATATTGGTTC TCAGAAGCTG CAGACCTCAG AGGCAAATAG TGCTTATCAG GAAAAGAATC TAAACGGTAT GCCCGTTGTG GATAATAAGA ACTTTACGAG TGACAAAACA CAGATTACAG CTAAACTGAG TACAGAACAA TTTAATCTCA GTGATGAAGA AGTCGACTAT GTAAACAAGC TGGCAGACTC TTTGCAAACA CCGGTCGGTG CGGTTGGATT TACGGACCTT GGTTCCACAA ACTCACAAAA CGTTATCGTA CAGTTTGATG TGCTGCCAAG CAGAATACTG AATATATACA ATAAAATACA TAACATAAAA GGCATAAACA GCGAAGCAAC AGCGGCAGCA TCACTGTCAA AGTTCAAAAC GTCGCTTAAA AACTCAGGCA TTAGCGTTAA ATTCGGTTAT GAATTCCACG ACGTATTTAA CGGTGTCGCT GTTACGATGC CAGCAAACAT GATTGAAAAA GTCGCTAAGC TACCTGGTGT TTACTCTGTA TCTCCGGACT ATATGATGTA TGCAGCAGAC GACTCGAATG GATATGTTAA TTTTAACGGC GTCGGTATGC AGGAAAGCCG CGAGGTGCTT CGTTCAGCAG AGCTTAACGA TATGGGATTT GACGGCAAGG GCATCAAAGT AGGTGTTCTT GACACCGGTA TTGATTACAA CCACCCAGAT CTGAAAGACA ACTATAGGGG CGGACACGAT TACGTCGGTG GGGCAGTAGC AAAAGTGGAT TCGTCGGGTA ATGTCAGCTT TATTGTTCCG GTCAGTGAAG ACAACGATCC GATGGAAACA ACTTATAAAG ACTGGCAAGC TGCAGCTGCG GCTAATGGAA CTACACTCTG CCCGGAAGTC AGCGCAAAGG GTAGTGAATA TTATACAAGC CACGGTTCGC ATGTCAGCGG CACAGTTGCG GCGGATGGTC AGAACACATC GTCTCCGTTT AATACATTAG GTATCGCTCC GAAGGCAGAC CTGTATGTAT ACAGAGTGTT GGGACCTTAC GGTTCCGGCC CGTCTTCGGG TATTATAGCC GCTATTAACC AGTCTGTTAT TGATGGCATG CAGGTCATTA ACCTGTCGCT TGGTGCTAAT CAAAATACAG CTTACGGCGC GGACGTAATT GCGCTTAATA ACGCATGTAT CGCGGGTACA ATCGTATGTG TATCAGCGGG TAACAACGCC GAGCCAAACA ACACTCCTCC ACGTCTTGGC TTATCGCTTG GTACACCAGG CACGGCTTAC CTTCCAATTA CCGTAGCAGC TTCAAACTAC GGCGGTGGTG CAACAACATT TTATATGGAT GCAGTTGCAA ATTCATCTGC GAACGTGACA GCATCAGTTC CGTTTAATGT AGTTGGTCGA GACGTTACGA ATGTATTCGC GGATAACCAG ATTACAGCGG CAAACCTAAA CTATGTGGAT GGCCTAGGAT ACCAGTATAC ACTTGTTGTC GGACCGATGA CTGGCAAGCT ACCTGGTCCT ACCCAGCTTG TGCAGCTGCA GGCGCTCGCG GACAACAGCC TAAAGGGGCA GATTCTTGTT GTTAAACGAG GTTCTCTGAC CTTTACGGAT CTTCCGCCGC AGGCCAAACG CCTTGGCGCG GGAGCAATCT TAATTATTGA CAAAGATGCA GAGGAAGGCT TCATCTCTGG CATTACCATC GGTGGAGAAA AGATAAATCA CCTACCGGTT CTCTCTACAA CACACCAAGC GGGGGACGCG TTAGCTGCTT CCTATAATGC AGCAGTAGTA TCATCTGTAC CGGCTTATAT CCGGATCGGC AATATGTCAA AGGTGCAGCT TGATAAGACA CCAGCAAGCT TTAGCTCAAT TGGTCCTGTT ACCGAAACCG CTGGCATTAA GCCTGATATC ATCGCTCCGG GTGTTGACAT CATGTCGACC CAGCCAGCTT TTATTACGAA TGCTGATCAC AACAGTATCG ACTATTCCTA CGCGTATGCG CGTATGAACG GAACCTCGAT GTCCTGTCCA CATATGACTG GTATCAGTGT TTTGATGCGT CAGGCTTTCC CGAATGCCAG CGTCGCTGAA ATTAAGGCTC GTTTGATGAA CACAGCTGAT CCGACGCTCA TTACTTCTGG TGTTCCGGCT ACGCCACAGG CCAGCGTATT TGAAGTTGGA GCAGGCTTTG TGAATCCGTA CCGTGCTATT GTAACCGACA CCTCGACCTA TGTGACGGTG CAGGATGACA TCCCTGGACA GAAGTCTGGC GAGATCCTCG TGAATCAGAC GCTATCGTCC TTGAGTTTCG GTACTATTAA GCCAAGTTCA ACAACAACCG TTAATTCCAG AATACTACCT GTGACCGTCC ACAATACTTC TGTGAACGAA CAGACTTACT ATATCACCGG TGTTTATAAC GACCATACCG GATATTCGCG TTCGAGCGTG GATGGCGTAA CGATTAGTTG CACATCGACC GATGTTACAG TACCAGCGGG TCAGACCGGA ACATTCGAAG TTTATACAAC GGTTCCTGTG AACTGCCCGA AAGGTTACTA TGAAGGCTAT GTACATGTAA CCTCGAATAG CGGTAACGAT TATGTGCTTC CATTTGCATT TGCTACCGGC GATGCGCCGA AGCCATTTAT AATCGATGAC GCTTGGATTA TTAAGCCTGT CATTACGGCA AACACGAATA CAAACATCCG TTGCACAACC TACTCGAACA CAACACCGTT CGCTTTGGCT TATGAGGGTG ATTATCCGGG CGGTGCAATG GACATTTTGC TACTTGACTT AAATGACACA CCGATTGGCT ACTTTGGTAC CTACTCCGGT ATGGGCAAGG GAGACGGTAC AGCGAAGCTT TACATGAACG CAATCACATA CCAGTGTTAT CCAATCGATA AGGATGGCAA CATCGGAACG AAACAGAACA TTCCAGAAGG AGCGTATCAT ATAGCTTTCG CGGATTCGGA TTACTATTAC CCATTCGGTG GGCTAGTTGT AGACAACCAA AGACCAGTTC TAACGTTTAA CCCGACTCCA AACTATGAGT ATAGCAATGG TGCAACTACA GTTCACATCA CAGGTAGAAT CTGGAGCCAC GCTGGCCAGC TGCTTGTTGA CAATAAGATT ACTAGCGATA ATTTGGCTGG CAGTCCGCTG ATTGGACAGG AACTTAACGG TTTAATCATC GGCAGCACAA TTTATCAATA CTGCGACAAA GACGGTTACT TCGATATTCC GCTGAATCCT TCAACAGCTA GCAAGACCGG TATCCAGACA TCTACCGCTT ACGCACAGGA CTACTATGCG ATAACGTACT ACTCACTGTT AGGCTCAGTA GTCTCCACAA AGACTGGTAA CAACCGCACA AACGGAACGG TTAATTTTAA CTACACGCAG TCTCCAGTAT TGAGCGCTGT CGCAGCAACA AATAGCACTG CTGAAGTGAA ACTCAGCTAC AATCCGAGAC TAATTGCTCC GGTTGCGGAC GACTTTAAGG TACAATATTC CGTTAACGGC GGCGCATTTA CCGATCTGAT GACGACATCG TTTAACTATA CCGCAGCAAC CGCGATAGCA AACTTCACAT TCGCTCCGTT CCCAGTTATA TCGGAGGAGC AGAGAATTGT TGTCAGCGTT AGCTACAAGG GTGGCACAGC TGTGGAAGCA GCGGGGGTTG TTGTGCCAGC TGCTAAACTT GTCGGATTAA CAGGTGATAA TGGCTCGTTA ATTGCTACAC TTGATATGGT TCCGACTGAT GGAACAGCAT ATCCGTTCGA GGTATCCTAC ACGATTAACG GTGGAGATAG ACAGCTACTT GGTAACGCAA CCTATCAAAA TGATGGTACG GCAACATTTA ACTTTACGCC GTTCGAGAAA TCTTCAATCG CACAGAATAT TGTAGTTTAT GTCAATTACA AAGACACAGA GCTGAGTTAT TCGTTCGGCC TCACAGTGGG TATGGGCACA CTTGTACTTA ATTACAACCT GTACCAGAAT AAGAGCATAT CAGCGAATGA ATTTACAACT AGCGGCATGA AGTTCTTCCA GACTGGTAAC TTTTTTGAAT GTACGAAGAT CAACGTAGCT GAAGCGAAGA CCGAGTACGG TTACCATGGC TTATTGATGG CAGGCCCACA TAGTGATCAG GTTGGTACAT TCACAGCAAA AATCGATGAG CAAAACAATC TTACGTTTAC TTATTACCTA TATGATGGCT TAAAGACAAC CGATACAGTT ACTTTTGTTT ATTCGTCAAC TTTATTCACC TCGAAGAATC AGGGGCAGTT GTACAAGGAT TATGCGAATA CAATCCAGAT ACAAGAGAAA GGTGCTTCGA CCGGTACGTT CATAGTCAAT AATGTTACCG GTACTGAGTT CTATGTATAC CTGAGAAGTG CGAACTCATC TGGAACCACG AAGGTTCTCG CACCGCCAGA TCCGAATAAG GTCATTACAC TGCAGCTTAC GAACGATCTG ACACCGCAGA CTCTTTCGTT TACCTATGGC TCTACGTTAA CACAGGTCAT TCCAGCTGGC ACCTACATGC TGGAGGGGTA TGGTACAGTA GTTGTAACGC TCGATGGGGT TACGTATGTG GAGTATAATG TGAAGTAA
|
Protein sequence | MKNKSMKRML SLLVSILMIF GSIWLTPGVA SAADIGSQKL QTSEANSAYQ EKNLNGMPVV DNKNFTSDKT QITAKLSTEQ FNLSDEEVDY VNKLADSLQT PVGAVGFTDL GSTNSQNVIV QFDVLPSRIL NIYNKIHNIK GINSEATAAA SLSKFKTSLK NSGISVKFGY EFHDVFNGVA VTMPANMIEK VAKLPGVYSV SPDYMMYAAD DSNGYVNFNG VGMQESREVL RSAELNDMGF DGKGIKVGVL DTGIDYNHPD LKDNYRGGHD YVGGAVAKVD SSGNVSFIVP VSEDNDPMET TYKDWQAAAA ANGTTLCPEV SAKGSEYYTS HGSHVSGTVA ADGQNTSSPF NTLGIAPKAD LYVYRVLGPY GSGPSSGIIA AINQSVIDGM QVINLSLGAN QNTAYGADVI ALNNACIAGT IVCVSAGNNA EPNNTPPRLG LSLGTPGTAY LPITVAASNY GGGATTFYMD AVANSSANVT ASVPFNVVGR DVTNVFADNQ ITAANLNYVD GLGYQYTLVV GPMTGKLPGP TQLVQLQALA DNSLKGQILV VKRGSLTFTD LPPQAKRLGA GAILIIDKDA EEGFISGITI GGEKINHLPV LSTTHQAGDA LAASYNAAVV SSVPAYIRIG NMSKVQLDKT PASFSSIGPV TETAGIKPDI IAPGVDIMST QPAFITNADH NSIDYSYAYA RMNGTSMSCP HMTGISVLMR QAFPNASVAE IKARLMNTAD PTLITSGVPA TPQASVFEVG AGFVNPYRAI VTDTSTYVTV QDDIPGQKSG EILVNQTLSS LSFGTIKPSS TTTVNSRILP VTVHNTSVNE QTYYITGVYN DHTGYSRSSV DGVTISCTST DVTVPAGQTG TFEVYTTVPV NCPKGYYEGY VHVTSNSGND YVLPFAFATG DAPKPFIIDD AWIIKPVITA NTNTNIRCTT YSNTTPFALA YEGDYPGGAM DILLLDLNDT PIGYFGTYSG MGKGDGTAKL YMNAITYQCY PIDKDGNIGT KQNIPEGAYH IAFADSDYYY PFGGLVVDNQ RPVLTFNPTP NYEYSNGATT VHITGRIWSH AGQLLVDNKI TSDNLAGSPL IGQELNGLII GSTIYQYCDK DGYFDIPLNP STASKTGIQT STAYAQDYYA ITYYSLLGSV VSTKTGNNRT NGTVNFNYTQ SPVLSAVAAT NSTAEVKLSY NPRLIAPVAD DFKVQYSVNG GAFTDLMTTS FNYTAATAIA NFTFAPFPVI SEEQRIVVSV SYKGGTAVEA AGVVVPAAKL VGLTGDNGSL IATLDMVPTD GTAYPFEVSY TINGGDRQLL GNATYQNDGT ATFNFTPFEK SSIAQNIVVY VNYKDTELSY SFGLTVGMGT LVLNYNLYQN KSISANEFTT SGMKFFQTGN FFECTKINVA EAKTEYGYHG LLMAGPHSDQ VGTFTAKIDE QNNLTFTYYL YDGLKTTDTV TFVYSSTLFT SKNQGQLYKD YANTIQIQEK GASTGTFIVN NVTGTEFYVY LRSANSSGTT KVLAPPDPNK VITLQLTNDL TPQTLSFTYG STLTQVIPAG TYMLEGYGTV VVTLDGVTYV EYNVK
|
| |