Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0173 |
Symbol | degP |
ID | 6146103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 191079 |
End bp | 192503 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615074 |
Product | serine endoprotease |
Protein accession | YP_001742290 |
Protein GI | 170680398 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA CCACATTAGC ACTGAGTGCA CTGGCTCTGA GTTTAGGTTT GGCGTTATCT CCGCTCTCTG CAACGGCGGC TGAGACTTCT TCAGCAACGA CAGCCCAGCA GATGCCAAGC CTTGCACCGA TGCTCGAAAA GGTGATGCCT TCAGTGGTCA GCATTAACGT AGAAGGTAGC ACAACCGTTA ATACGCCGCG TATGCCGCGT AATTTCCAGC AGTTCTTCGG TGATGATTCT CCGTTCTGCC AGGAAGGTTC TCCGTTCCAG AGTTCTCCGT TCTGCCAGGG TGGCCAGGGC GGTAATGGTG GCGGCCAGCA ACAGAAATTC ATGGCGCTGG GTTCCGGCGT CATCATTGAT GCCGATAAAG GCTATGTCGT CACCAACAAC CACGTTGTTG ATAACGCGAC GGTGATTAAA GTCCAACTGA GCGATGGCCG TAAGTTCGAC GCGAAGATGG TTGGCAAAGA TCCGCGCTCT GATATCGCGC TGATCCAGAT CCAGAACCCG AAAAACCTGA CCGCAATTAA GATGGCGGAT TCTGATGCAC TGCGCGTGGG TGATTACACC GTAGCGATTG GTAACCCGTT TGGTCTGGGC GAGACGGTAA CTTCCGGGAT TGTCTCTGCG CTGGGGCGTA GCGGCCTGAA TGCCGAAAAC TACGAAAACT TCATCCAGAC CGATGCAGCG ATTAACCGTG GTAACTCCGG TGGTGCGCTG GTTAACCTGA ACGGTGAACT GATCGGTATC AACACCGCGA TCCTCGCACC GGACGGCGGC AACATCGGTA TCGGTTTTGC TATCCCGAGC AACATGGTGA AAAACCTGAC CTCGCAGATG GTGGAATACG GCCAGGTGAA ACGCGGTGAG TTGGGTATTA TGGGCACTGA GCTGAACTCC GATCTGGCGA AAGCGATGAA AGTTGACGCC CAGCGCGGTG CTTTCGTAAG CCAGGTTCTG CCGAATTCTT CCGCCGCGAA AGCGGGCATT AAAGCGGGTG ATGTGATCAC CTCACTGAAC GGTAAGCCAA TCAGCAGCTT TGCCGCACTG CGTGCTCAGG TGGGCACTAT GCCGGTAGGT AGCAAACTGA CCCTGGGCTT ACTGCGCGAC GGGAAGCAGG TTAACGTGAA CCTGGAACTT CAGCAGAGCA GCCAGAATCA GGTTGATTCC AGCACCATCT TCAACGGCAT TGAAGGCGCT GAGATGAGTA ACAAAGGCAA AGATCAGGGC GTGGTGGTGA ACAACGTGAA AACGGGCACT CCGGCTGCGC AGATCGGCCT GAAGAAAGGT GATGTGATTA TTGGCGCGAA CCAGCAGGCA GTGAAAAACA TCGCTGAACT GCGTAAAGTT CTCGACAGCA AACCGTCTGT GCTGGCACTG AACATTCAGC GCGGCGACAG CACCATCTAC CTGTTAATGC AGTAA
|
Protein sequence | MKKTTLALSA LALSLGLALS PLSATAAETS SATTAQQMPS LAPMLEKVMP SVVSINVEGS TTVNTPRMPR NFQQFFGDDS PFCQEGSPFQ SSPFCQGGQG GNGGGQQQKF MALGSGVIID ADKGYVVTNN HVVDNATVIK VQLSDGRKFD AKMVGKDPRS DIALIQIQNP KNLTAIKMAD SDALRVGDYT VAIGNPFGLG ETVTSGIVSA LGRSGLNAEN YENFIQTDAA INRGNSGGAL VNLNGELIGI NTAILAPDGG NIGIGFAIPS NMVKNLTSQM VEYGQVKRGE LGIMGTELNS DLAKAMKVDA QRGAFVSQVL PNSSAAKAGI KAGDVITSLN GKPISSFAAL RAQVGTMPVG SKLTLGLLRD GKQVNVNLEL QQSSQNQVDS STIFNGIEGA EMSNKGKDQG VVVNNVKTGT PAAQIGLKKG DVIIGANQQA VKNIAELRKV LDSKPSVLAL NIQRGDSTIY LLMQ
|
| |