Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0171 |
Symbol | degP |
ID | 6967528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 185064 |
End bp | 186488 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384247 |
Product | serine endoprotease |
Protein accession | YP_002268770 |
Protein GI | 209395925 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA CCACATTAGC ACTGAGTGCA CTGGCTCTGA GTTTAGGTTT GGCGTTATCT CCGCTCTCTG CAACGGCGGC TGAGACTTCT TCAGCAACGA CAGCCCAGCA GATGCCAAGC CTTGCACCGA TGCTCGAAAA GGTGATGCCT TCAGTGGTCA GCATTAACGT AGAAGGTAGC ACAACCGTTA ATACGCCGCG TATGCCGCGT AATTTCCAGC AGTTCTTCGG TGATGATTCT CCGTTCTGCC AGGAAGGTTC TCCGTTCCAG AGCTCTCCGT TCTGCCAGGG TGGCCAGGGC GGTAATGGTG GCGGCCAGCA ACAGAAATTC ATGGCGCTGG GTTCCGGCGT CATCATTGAT GCCGATAAAG GCTATGTCGT CACCAACAAC CACGTTGTTG ATAACGCGAC GGTCATTAAA GTTCAACTGA GCGATGGCCG TAAGTTCGAC GCGAAGATGG TTGGCAAAGA TCCGCGCTCT GATATCGCGC TGATCCAAAT CCAGAATCCG AAAAACCTGA CCGCAATTAA GATGGCGGAT TCTGATGCAC TGCGCGTGGG TGATTACACC GTAGCGATTG GTAACCCGTT TGGTCTGGGC GAGACGGTAA CTTCCGGGAT TGTCTCTGCG CTGGGGCGTA GCGGCCTGAA TGCCGAAAAC TACGAAAACT TCATCCAGAC CGATGCAGCG ATCAACCGTG GTAACTCCGG TGGTGCGCTG GTTAACCTGA ACGGCGAACT GATCGGTATC AACACCGCGA TCCTAGCACC GGACGGCGGT AACATCGGTA TCGGTTTTGC TATCCCGAGT AACATGGTGA AAAACCTGAC CTCGCAGATG GTGGAATACG GCCAGGTGAA ACGCGGTGAG CTGGGTATTA TGGGGACTGA GCTGAACTCC GAACTGGCGA AAGCGATGAA AGTTGACGCC CAGCGCGGTG CTTTCGTAAG CCAGGTTCTG CCTAATTCCT CCGCTGCAAA AGCGGGCATT AAAGCGGGTG ATGTGATCAC CTCACTGAAC GGTAAGCTGA TCAGCAGCTT TGCCGCACTG CGTGCTCAAG TGGGTACTAT GCCGGTGGGC AGCAAACTGA CCCTGGGCTT GCTGCGCGAC GGTAAGCAGG TCAACGTGAA TCTGGAACTG CAGCAGAGCA GCCAGAATCA GGTTGATTCC AGCTCCATCT TCAACGGCAT TGAAGGCGCT GAGATGAGCA ACAAAGGCAA AGATCAGGGC GTGGTAGTGA ACAACGTGAA AACGGGCACT CCGGCTGCGC AGATCGGCCT GAAGAAAGGT GATGTGATTA TTGGCGCGAA CCAGCAGGCT GTGAAAAACA TCGCTGAACT GCGTAAAGTT CTCGACAGCA AACCGTCTGT GCTGGCACTG AACATTCAGC GCGGCGACAG CACCATCTAC CTGTTAATGC AGTAA
|
Protein sequence | MKKTTLALSA LALSLGLALS PLSATAAETS SATTAQQMPS LAPMLEKVMP SVVSINVEGS TTVNTPRMPR NFQQFFGDDS PFCQEGSPFQ SSPFCQGGQG GNGGGQQQKF MALGSGVIID ADKGYVVTNN HVVDNATVIK VQLSDGRKFD AKMVGKDPRS DIALIQIQNP KNLTAIKMAD SDALRVGDYT VAIGNPFGLG ETVTSGIVSA LGRSGLNAEN YENFIQTDAA INRGNSGGAL VNLNGELIGI NTAILAPDGG NIGIGFAIPS NMVKNLTSQM VEYGQVKRGE LGIMGTELNS ELAKAMKVDA QRGAFVSQVL PNSSAAKAGI KAGDVITSLN GKLISSFAAL RAQVGTMPVG SKLTLGLLRD GKQVNVNLEL QQSSQNQVDS SSIFNGIEGA EMSNKGKDQG VVVNNVKTGT PAAQIGLKKG DVIIGANQQA VKNIAELRKV LDSKPSVLAL NIQRGDSTIY LLMQ
|
| |