Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3606 |
Symbol | |
ID | 5742630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4452740 |
End bp | 4453996 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641294716 |
Product | carboxyl-terminal protease |
Protein accession | YP_001560692 |
Protein GI | 160881724 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000172646 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA GATTTGTGAC CGGCGTAGTC TCCGGAGTGA TCTGTACCTT GATTATTTGT ACTCTTTCTT TTGGCATTCT ATATCGTAAT GCTTTAGCAC AAAAAGATAA TGCCTACACG AATGAATCAG CACAGTCAAC AGATAATGGC ACGAAAACAG AAGGAAGTAC TGCAATTGAT GAAGCAACGT TTCAAAAGAA ACTAAAGTAT ATTAAAAACT TAGTTAATAA TTATTATTTA TGGGATGTAA ATGAGGGGGA TTTTCAAACC GGTATGTTAA AGGGGATGAT GAGTGCCCTA AATGACCCAT ATTCTACTTA TTATACGAAA GAGGAATATG ACGCCTTGAT GGAAACTACC AATGGTATTT ACTATGGTAT CGGTGCTACT GTTAGCCAGA ATGTAAATAC TGGTATCATT ACTATAGTAA AGCCATTTGT CAATGGACCT GCAAATAAGG CTGGTGTTCT TCCGGGAGAT ATTCTTTATA AGGTGGAAGA CGAAGAGGTA ACAGGTACTG AACTCACCAA AGTTGTTAGT AAGATGAAAG GTGAAGAAAA TACCATAGTT AAAATCACTG TTATGAGAGA AGGCAAAAGT GAACCAATTG AGATTTCGAT TACAAGAGGT CAGGTAGAGA TTCCAACCAT TGAACATGAG ATGTTAAAAG ATAAAATTGG TTACATTAGC ATTCTAGAAT TTGATAAAAT AACAGTAGAT CAATACATGG CAGCGATTAA TGACTTAGAA AAACAAGGAA TGAAAGGTCT TGTAATTGAC CTTCGTGATA ATCCAGGTGG ATTATATGAT TCTGCAGTTA AGATGCTCGA TCGTATCATA GGGAAAGGGC TATTAGTTTA TACTGAGACT AAGGATGGTA CTCGTTCCGA AGATTATGCG ACTTCAAAAG AAGAATTAAA GGTTCCATTG ACTGTAATCG TAAATGGCAA TAGTGCAAGC GCTTCCGAGA TCTTCGCTGG TGCAATTCAG GATTATAAGA AGGGTACTAT TGTAGGTACG CAGAGTTTTG GAAAAGGTAT TGTACAGTCC CTCTTCCCAT TGTTTGATGG AAGTGCGGTG AAGGTAACGG TATCCAACTA CTTTACTCCA AATGGAAGAA GCATTCATAA AACAGGAATT ACCCCAGATG TGGTAGTAGA GTTAAATGAA GAATTAAAGA AAAAAGTAGT GATTACTCAT GATGAAGATA ATCAGCTTCA GAAAGCTATT GAGGTCTTGA AAAGTCAAAT AAAATAA
|
Protein sequence | MKNRFVTGVV SGVICTLIIC TLSFGILYRN ALAQKDNAYT NESAQSTDNG TKTEGSTAID EATFQKKLKY IKNLVNNYYL WDVNEGDFQT GMLKGMMSAL NDPYSTYYTK EEYDALMETT NGIYYGIGAT VSQNVNTGII TIVKPFVNGP ANKAGVLPGD ILYKVEDEEV TGTELTKVVS KMKGEENTIV KITVMREGKS EPIEISITRG QVEIPTIEHE MLKDKIGYIS ILEFDKITVD QYMAAINDLE KQGMKGLVID LRDNPGGLYD SAVKMLDRII GKGLLVYTET KDGTRSEDYA TSKEELKVPL TVIVNGNSAS ASEIFAGAIQ DYKKGTIVGT QSFGKGIVQS LFPLFDGSAV KVTVSNYFTP NGRSIHKTGI TPDVVVELNE ELKKKVVITH DEDNQLQKAI EVLKSQIK
|
| |