Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0166 |
Symbol | degP |
ID | 5587049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 182227 |
End bp | 183651 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640923895 |
Product | serine endoprotease |
Protein accession | YP_001461332 |
Protein GI | 157155195 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CCACATTAGC ACTGAGTGCA CTGGCTCTGA GTTTAGGTTT GGCGTTATCT CCGCTCTCTG CAACGGCGGC TGAGACTTCT TCAGCAACGA CAGCCCAGCA GATGCCAAGC CTTGCACCGA TGCTCGAAAA GGTGATGCCT TCAGTGGTCA GCATTAACGT AGAAGGTAGC ACAACCGTTA ATACGCCGCG TATGCCGCGT AATTTCCAGC AGTTCTTCGG TGATGATTCT CCGTTCTGCC AGGAAGGTTC ACCGTTCCAG AGCTCTCCGT TCTGCCAGGG GGGCCAGGGC GGTAATGGCG GCGGCCAGCA ACAGAAATTC ATGGCGCTGG GTTCCGGCGT CATCATTGAT GCCGATAAAG GCTATGTCGT CACCAACAAC CACGTTGTTG ATAACGCGAC GGTGATTAAA GTCCAGCTGA GCGATGGTCG TAAGTTCGAC GCGAAGATGG TTGGCAAAGA TCCGCGCTCT GATATCGCGC TGATTCAGAT CCAGAACCCG AAAAACCTGA CCGCAATTAA GATGGCGGAT TCTGATGCAC TGCGCGTGGG TGATTACACC GTAGCGATTG GTAACCCGTT TGGCCTGGGA GAGACGGTAA CTTCCGGGAT TGTCTCTGCG CTGGGGCGTA GCGGCCTGAA TGCCGAAAAC TACGAAAACT TCATCCAGAC CGATGCAGCG ATCAACCGTG GTAACTCCGG TGGTGCGCTG GTTAACCTGA ACGGCGAACT GATCGGTATC AACACCGCGA TCCTCGCACC GGACGGCGGC AACATCGGTA TCGGTTTTGC TATCCCGAGC AACATGGTGA AAAACCTGAC CTCGCAGATG GTGGAATACG GCCAGGTGAA ACGCGGTGAG CTGGGTATTA TGGGGACTGA GCTGAATTCC GAACTGGCGA AAGCGATGAA AGTTGACGCC CAGCGCGGTG CTTTCGTAAG CCAGGTTCTG CCTAATTCCT CCGCTGCAAA AGCGGGCATT AAAGCGGGTG ATGTGATCAC CTCACTGAAC GGTAAGCCAA TCAGCAGCTT TGCCGCACTG CGTGCTCAGG TGGGTACTAT GCCGGTGGGC AGCAAACTGA CCCTGGGCTT GCTGCGCGAC GGTAAGCAGG TCAACGTGAA TCTGGAACTG CAGCAGAGCA GCCAGAATCA GGTTGATTCC AGCTCCATCT TCAACGGCAT TGAAGGTGCT GAGATGAGCA ACAAAGGCAA AGATCAGGGC GTGGTAGTGA ACAACGTGAA AACGGGCACT CCGGCTGCGC AGATCGGCCT GAAGAAAGGT GATGTGATTA TTGGCGCGAA CCAGCAGGCA GTGAAAAACA TCGCTGAACT GCGTAAAGTT CTCGACAGCA AACCGTCTGT GCTGGCACTG AACATTCAGC GCGGCGACAG CTCCATCTAC CTGTTAATGC AGTAA
|
Protein sequence | MKKTTLALSA LALSLGLALS PLSATAAETS SATTAQQMPS LAPMLEKVMP SVVSINVEGS TTVNTPRMPR NFQQFFGDDS PFCQEGSPFQ SSPFCQGGQG GNGGGQQQKF MALGSGVIID ADKGYVVTNN HVVDNATVIK VQLSDGRKFD AKMVGKDPRS DIALIQIQNP KNLTAIKMAD SDALRVGDYT VAIGNPFGLG ETVTSGIVSA LGRSGLNAEN YENFIQTDAA INRGNSGGAL VNLNGELIGI NTAILAPDGG NIGIGFAIPS NMVKNLTSQM VEYGQVKRGE LGIMGTELNS ELAKAMKVDA QRGAFVSQVL PNSSAAKAGI KAGDVITSLN GKPISSFAAL RAQVGTMPVG SKLTLGLLRD GKQVNVNLEL QQSSQNQVDS SSIFNGIEGA EMSNKGKDQG VVVNNVKTGT PAAQIGLKKG DVIIGANQQA VKNIAELRKV LDSKPSVLAL NIQRGDSSIY LLMQ
|
| |