Gene Cphy_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1747 
Symbol 
ID5741421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2151434 
End bp2156131 
Gene Length4698 bp 
Protein Length1565 aa 
Translation table11 
GC content46% 
IMG OID641292847 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001558858 
Protein GI160879890 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0213398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATA AGAGTATGAA ACGTATGCTG AGCCTGTTAG TCAGCATACT GATGATCTTC 
GGATCAATCT GGTTGACGCC GGGAGTGGCA AGTGCGGCGG ATATTGGTTC TCAGAAGCTG
CAGACCTCAG AGGCAAATAG TGCTTATCAG GAAAAGAATC TAAACGGTAT GCCCGTTGTG
GATAATAAGA ACTTTACGAG TGACAAAACA CAGATTACAG CTAAACTGAG TACAGAACAA
TTTAATCTCA GTGATGAAGA AGTCGACTAT GTAAACAAGC TGGCAGACTC TTTGCAAACA
CCGGTCGGTG CGGTTGGATT TACGGACCTT GGTTCCACAA ACTCACAAAA CGTTATCGTA
CAGTTTGATG TGCTGCCAAG CAGAATACTG AATATATACA ATAAAATACA TAACATAAAA
GGCATAAACA GCGAAGCAAC AGCGGCAGCA TCACTGTCAA AGTTCAAAAC GTCGCTTAAA
AACTCAGGCA TTAGCGTTAA ATTCGGTTAT GAATTCCACG ACGTATTTAA CGGTGTCGCT
GTTACGATGC CAGCAAACAT GATTGAAAAA GTCGCTAAGC TACCTGGTGT TTACTCTGTA
TCTCCGGACT ATATGATGTA TGCAGCAGAC GACTCGAATG GATATGTTAA TTTTAACGGC
GTCGGTATGC AGGAAAGCCG CGAGGTGCTT CGTTCAGCAG AGCTTAACGA TATGGGATTT
GACGGCAAGG GCATCAAAGT AGGTGTTCTT GACACCGGTA TTGATTACAA CCACCCAGAT
CTGAAAGACA ACTATAGGGG CGGACACGAT TACGTCGGTG GGGCAGTAGC AAAAGTGGAT
TCGTCGGGTA ATGTCAGCTT TATTGTTCCG GTCAGTGAAG ACAACGATCC GATGGAAACA
ACTTATAAAG ACTGGCAAGC TGCAGCTGCG GCTAATGGAA CTACACTCTG CCCGGAAGTC
AGCGCAAAGG GTAGTGAATA TTATACAAGC CACGGTTCGC ATGTCAGCGG CACAGTTGCG
GCGGATGGTC AGAACACATC GTCTCCGTTT AATACATTAG GTATCGCTCC GAAGGCAGAC
CTGTATGTAT ACAGAGTGTT GGGACCTTAC GGTTCCGGCC CGTCTTCGGG TATTATAGCC
GCTATTAACC AGTCTGTTAT TGATGGCATG CAGGTCATTA ACCTGTCGCT TGGTGCTAAT
CAAAATACAG CTTACGGCGC GGACGTAATT GCGCTTAATA ACGCATGTAT CGCGGGTACA
ATCGTATGTG TATCAGCGGG TAACAACGCC GAGCCAAACA ACACTCCTCC ACGTCTTGGC
TTATCGCTTG GTACACCAGG CACGGCTTAC CTTCCAATTA CCGTAGCAGC TTCAAACTAC
GGCGGTGGTG CAACAACATT TTATATGGAT GCAGTTGCAA ATTCATCTGC GAACGTGACA
GCATCAGTTC CGTTTAATGT AGTTGGTCGA GACGTTACGA ATGTATTCGC GGATAACCAG
ATTACAGCGG CAAACCTAAA CTATGTGGAT GGCCTAGGAT ACCAGTATAC ACTTGTTGTC
GGACCGATGA CTGGCAAGCT ACCTGGTCCT ACCCAGCTTG TGCAGCTGCA GGCGCTCGCG
GACAACAGCC TAAAGGGGCA GATTCTTGTT GTTAAACGAG GTTCTCTGAC CTTTACGGAT
CTTCCGCCGC AGGCCAAACG CCTTGGCGCG GGAGCAATCT TAATTATTGA CAAAGATGCA
GAGGAAGGCT TCATCTCTGG CATTACCATC GGTGGAGAAA AGATAAATCA CCTACCGGTT
CTCTCTACAA CACACCAAGC GGGGGACGCG TTAGCTGCTT CCTATAATGC AGCAGTAGTA
TCATCTGTAC CGGCTTATAT CCGGATCGGC AATATGTCAA AGGTGCAGCT TGATAAGACA
CCAGCAAGCT TTAGCTCAAT TGGTCCTGTT ACCGAAACCG CTGGCATTAA GCCTGATATC
ATCGCTCCGG GTGTTGACAT CATGTCGACC CAGCCAGCTT TTATTACGAA TGCTGATCAC
AACAGTATCG ACTATTCCTA CGCGTATGCG CGTATGAACG GAACCTCGAT GTCCTGTCCA
CATATGACTG GTATCAGTGT TTTGATGCGT CAGGCTTTCC CGAATGCCAG CGTCGCTGAA
ATTAAGGCTC GTTTGATGAA CACAGCTGAT CCGACGCTCA TTACTTCTGG TGTTCCGGCT
ACGCCACAGG CCAGCGTATT TGAAGTTGGA GCAGGCTTTG TGAATCCGTA CCGTGCTATT
GTAACCGACA CCTCGACCTA TGTGACGGTG CAGGATGACA TCCCTGGACA GAAGTCTGGC
GAGATCCTCG TGAATCAGAC GCTATCGTCC TTGAGTTTCG GTACTATTAA GCCAAGTTCA
ACAACAACCG TTAATTCCAG AATACTACCT GTGACCGTCC ACAATACTTC TGTGAACGAA
CAGACTTACT ATATCACCGG TGTTTATAAC GACCATACCG GATATTCGCG TTCGAGCGTG
GATGGCGTAA CGATTAGTTG CACATCGACC GATGTTACAG TACCAGCGGG TCAGACCGGA
ACATTCGAAG TTTATACAAC GGTTCCTGTG AACTGCCCGA AAGGTTACTA TGAAGGCTAT
GTACATGTAA CCTCGAATAG CGGTAACGAT TATGTGCTTC CATTTGCATT TGCTACCGGC
GATGCGCCGA AGCCATTTAT AATCGATGAC GCTTGGATTA TTAAGCCTGT CATTACGGCA
AACACGAATA CAAACATCCG TTGCACAACC TACTCGAACA CAACACCGTT CGCTTTGGCT
TATGAGGGTG ATTATCCGGG CGGTGCAATG GACATTTTGC TACTTGACTT AAATGACACA
CCGATTGGCT ACTTTGGTAC CTACTCCGGT ATGGGCAAGG GAGACGGTAC AGCGAAGCTT
TACATGAACG CAATCACATA CCAGTGTTAT CCAATCGATA AGGATGGCAA CATCGGAACG
AAACAGAACA TTCCAGAAGG AGCGTATCAT ATAGCTTTCG CGGATTCGGA TTACTATTAC
CCATTCGGTG GGCTAGTTGT AGACAACCAA AGACCAGTTC TAACGTTTAA CCCGACTCCA
AACTATGAGT ATAGCAATGG TGCAACTACA GTTCACATCA CAGGTAGAAT CTGGAGCCAC
GCTGGCCAGC TGCTTGTTGA CAATAAGATT ACTAGCGATA ATTTGGCTGG CAGTCCGCTG
ATTGGACAGG AACTTAACGG TTTAATCATC GGCAGCACAA TTTATCAATA CTGCGACAAA
GACGGTTACT TCGATATTCC GCTGAATCCT TCAACAGCTA GCAAGACCGG TATCCAGACA
TCTACCGCTT ACGCACAGGA CTACTATGCG ATAACGTACT ACTCACTGTT AGGCTCAGTA
GTCTCCACAA AGACTGGTAA CAACCGCACA AACGGAACGG TTAATTTTAA CTACACGCAG
TCTCCAGTAT TGAGCGCTGT CGCAGCAACA AATAGCACTG CTGAAGTGAA ACTCAGCTAC
AATCCGAGAC TAATTGCTCC GGTTGCGGAC GACTTTAAGG TACAATATTC CGTTAACGGC
GGCGCATTTA CCGATCTGAT GACGACATCG TTTAACTATA CCGCAGCAAC CGCGATAGCA
AACTTCACAT TCGCTCCGTT CCCAGTTATA TCGGAGGAGC AGAGAATTGT TGTCAGCGTT
AGCTACAAGG GTGGCACAGC TGTGGAAGCA GCGGGGGTTG TTGTGCCAGC TGCTAAACTT
GTCGGATTAA CAGGTGATAA TGGCTCGTTA ATTGCTACAC TTGATATGGT TCCGACTGAT
GGAACAGCAT ATCCGTTCGA GGTATCCTAC ACGATTAACG GTGGAGATAG ACAGCTACTT
GGTAACGCAA CCTATCAAAA TGATGGTACG GCAACATTTA ACTTTACGCC GTTCGAGAAA
TCTTCAATCG CACAGAATAT TGTAGTTTAT GTCAATTACA AAGACACAGA GCTGAGTTAT
TCGTTCGGCC TCACAGTGGG TATGGGCACA CTTGTACTTA ATTACAACCT GTACCAGAAT
AAGAGCATAT CAGCGAATGA ATTTACAACT AGCGGCATGA AGTTCTTCCA GACTGGTAAC
TTTTTTGAAT GTACGAAGAT CAACGTAGCT GAAGCGAAGA CCGAGTACGG TTACCATGGC
TTATTGATGG CAGGCCCACA TAGTGATCAG GTTGGTACAT TCACAGCAAA AATCGATGAG
CAAAACAATC TTACGTTTAC TTATTACCTA TATGATGGCT TAAAGACAAC CGATACAGTT
ACTTTTGTTT ATTCGTCAAC TTTATTCACC TCGAAGAATC AGGGGCAGTT GTACAAGGAT
TATGCGAATA CAATCCAGAT ACAAGAGAAA GGTGCTTCGA CCGGTACGTT CATAGTCAAT
AATGTTACCG GTACTGAGTT CTATGTATAC CTGAGAAGTG CGAACTCATC TGGAACCACG
AAGGTTCTCG CACCGCCAGA TCCGAATAAG GTCATTACAC TGCAGCTTAC GAACGATCTG
ACACCGCAGA CTCTTTCGTT TACCTATGGC TCTACGTTAA CACAGGTCAT TCCAGCTGGC
ACCTACATGC TGGAGGGGTA TGGTACAGTA GTTGTAACGC TCGATGGGGT TACGTATGTG
GAGTATAATG TGAAGTAA
 
Protein sequence
MKNKSMKRML SLLVSILMIF GSIWLTPGVA SAADIGSQKL QTSEANSAYQ EKNLNGMPVV 
DNKNFTSDKT QITAKLSTEQ FNLSDEEVDY VNKLADSLQT PVGAVGFTDL GSTNSQNVIV
QFDVLPSRIL NIYNKIHNIK GINSEATAAA SLSKFKTSLK NSGISVKFGY EFHDVFNGVA
VTMPANMIEK VAKLPGVYSV SPDYMMYAAD DSNGYVNFNG VGMQESREVL RSAELNDMGF
DGKGIKVGVL DTGIDYNHPD LKDNYRGGHD YVGGAVAKVD SSGNVSFIVP VSEDNDPMET
TYKDWQAAAA ANGTTLCPEV SAKGSEYYTS HGSHVSGTVA ADGQNTSSPF NTLGIAPKAD
LYVYRVLGPY GSGPSSGIIA AINQSVIDGM QVINLSLGAN QNTAYGADVI ALNNACIAGT
IVCVSAGNNA EPNNTPPRLG LSLGTPGTAY LPITVAASNY GGGATTFYMD AVANSSANVT
ASVPFNVVGR DVTNVFADNQ ITAANLNYVD GLGYQYTLVV GPMTGKLPGP TQLVQLQALA
DNSLKGQILV VKRGSLTFTD LPPQAKRLGA GAILIIDKDA EEGFISGITI GGEKINHLPV
LSTTHQAGDA LAASYNAAVV SSVPAYIRIG NMSKVQLDKT PASFSSIGPV TETAGIKPDI
IAPGVDIMST QPAFITNADH NSIDYSYAYA RMNGTSMSCP HMTGISVLMR QAFPNASVAE
IKARLMNTAD PTLITSGVPA TPQASVFEVG AGFVNPYRAI VTDTSTYVTV QDDIPGQKSG
EILVNQTLSS LSFGTIKPSS TTTVNSRILP VTVHNTSVNE QTYYITGVYN DHTGYSRSSV
DGVTISCTST DVTVPAGQTG TFEVYTTVPV NCPKGYYEGY VHVTSNSGND YVLPFAFATG
DAPKPFIIDD AWIIKPVITA NTNTNIRCTT YSNTTPFALA YEGDYPGGAM DILLLDLNDT
PIGYFGTYSG MGKGDGTAKL YMNAITYQCY PIDKDGNIGT KQNIPEGAYH IAFADSDYYY
PFGGLVVDNQ RPVLTFNPTP NYEYSNGATT VHITGRIWSH AGQLLVDNKI TSDNLAGSPL
IGQELNGLII GSTIYQYCDK DGYFDIPLNP STASKTGIQT STAYAQDYYA ITYYSLLGSV
VSTKTGNNRT NGTVNFNYTQ SPVLSAVAAT NSTAEVKLSY NPRLIAPVAD DFKVQYSVNG
GAFTDLMTTS FNYTAATAIA NFTFAPFPVI SEEQRIVVSV SYKGGTAVEA AGVVVPAAKL
VGLTGDNGSL IATLDMVPTD GTAYPFEVSY TINGGDRQLL GNATYQNDGT ATFNFTPFEK
SSIAQNIVVY VNYKDTELSY SFGLTVGMGT LVLNYNLYQN KSISANEFTT SGMKFFQTGN
FFECTKINVA EAKTEYGYHG LLMAGPHSDQ VGTFTAKIDE QNNLTFTYYL YDGLKTTDTV
TFVYSSTLFT SKNQGQLYKD YANTIQIQEK GASTGTFIVN NVTGTEFYVY LRSANSSGTT
KVLAPPDPNK VITLQLTNDL TPQTLSFTYG STLTQVIPAG TYMLEGYGTV VVTLDGVTYV
EYNVK