Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4232 |
Symbol | |
ID | 4073158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5016842 |
End bp | 5018107 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637986263 |
Product | carboxyl-terminal protease |
Protein accession | YP_593306 |
Protein GI | 94971258 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGAC TCCAAAGGAG TCTTTGCCCG CCAAAGCAGC CCATGTCAAA GATCACGAAG TTCGTTCTGT TAAGCAGCTC TCTCCTTGTC GTTCTGTTTG TCGTATACGG AAGCCTCGGC GTTCGCGCCG ATTCCAAGAA CGATGGCGCG TACCGCCAGC TCGGCGTGTA CAGCGAGGTA CTCTCGCGCA TCCGTACAGA GTACGTCGTC GATCCGGACA TGAACCTCGT GACCGACGGA GCGCTACATG GCCTGCTCGA GTCGCTCGAC GCGAACTCCA GCTACCTGAG CCCGACGGAG TACAAGCAGT ACCAGCAGCG TAAGTCCGAG GGCAAAGCGG GCATCGGTGC TGCGATCTCT AAGCGGTACG GTTACGCCGC GGTGGTCTCA GTCATTCCAG GTGGCCCCGC CGACAAAGCC CAGGTGGAAA GCGGCGACAT TATTGAAGCA ATCGAGGGCA AAACGACTCG CGAGATGTCG CTGGCTGAGA TTGATGGCAT CCTCGCTGGA CAGCCCGGCT CGGTCATTAA TTTGAGCATT GTGCGCCCGC GCAAGGCGCA GCCACAGAAG ACGCCGATCA CGCGAGAGGT AGTCACTACC CCGCCAGTCG CCGAGAAGCT GATGGAAGAC AGCATCGGTT ATATCAAGGT CATTACCTTT ACCAAGGGCC GCACGCAACA GGTAGCCGAA CAGGTGAAGG CCGCGCAGAA ACAGGGCGCG AAGAAACTTA TCCTCGACCT GCGGAATTGC GGAGCCGGCG AGGAACAGGA AGGCGTCGCT ACGGCGAATC TCTTCCTGAA CCACGGCATG ATCGCCTACC TGCAAGGCCA GAAATTCGCC AAGCAGACGT TCACAGCTGA AGCCTCTAAG GCCATTACAA ATCTCCCGCT GGTCGTGCTG GTCAACAAAG GTACGGCTGG TCCAGCCGAG ATCGTCGCGG CGTCGGTATT GGAGAATGCT CGCGGTGATG TTCTTGGCGA CAAGACATTC GGTGATGGCG CAGTCCAGCA GCTCTTCCCA ATGAGCGATG GTTCTGCCCT CATGCTCTCG ATCGCGAAGT ACTACTCGCC GAGCGGCAAA GCTATCCAGG ACACGGCAGT CACGCCCAAC ATCCTGGTTG CCGACAACGA CGACTACGCT TCTCCGGATG ATGGCGACGA TACCACTGAC AATGCCAACC AGCCGGAAAC GCGCCAGAAG GACCAGACCG ACGAGCAGCT TCGCCGCGCG GTCGAGGTTC TGAAGAATAA AGACCAGCAC AGCTAG
|
Protein sequence | MSGLQRSLCP PKQPMSKITK FVLLSSSLLV VLFVVYGSLG VRADSKNDGA YRQLGVYSEV LSRIRTEYVV DPDMNLVTDG ALHGLLESLD ANSSYLSPTE YKQYQQRKSE GKAGIGAAIS KRYGYAAVVS VIPGGPADKA QVESGDIIEA IEGKTTREMS LAEIDGILAG QPGSVINLSI VRPRKAQPQK TPITREVVTT PPVAEKLMED SIGYIKVITF TKGRTQQVAE QVKAAQKQGA KKLILDLRNC GAGEEQEGVA TANLFLNHGM IAYLQGQKFA KQTFTAEASK AITNLPLVVL VNKGTAGPAE IVAASVLENA RGDVLGDKTF GDGAVQQLFP MSDGSALMLS IAKYYSPSGK AIQDTAVTPN ILVADNDDYA SPDDGDDTTD NANQPETRQK DQTDEQLRRA VEVLKNKDQH S
|
| |