Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4552 |
Symbol | degS |
ID | 6970871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4219203 |
End bp | 4220270 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388263 |
Product | serine endoprotease |
Protein accession | YP_002272698 |
Protein GI | 209400305 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0248117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGTGA AGCTCTTACG TTCCGTTGCG ATTGGATTAA TTGTCGGCGC TATTCTGCTG GTTGCCATGC CTTCGCTGCG CAGCCTTAAC CCGCTTTCCA CTCCGCAATT TGACAGTACC GATGAGACGC CTGCCAGTTA TAATCTGGCG GTTCGCCGCG CCGCGCCAGC GGTGGTTAAC GTTTACAACC GTGGTTTGAA CACCAACTCT CACAACCAGC TTGAGATCCG CACCCTGGGA TCCGGTGTAA TCATGGATCA ACGCGGTTAT ATCATCACCA ATAAACACGT CATCAACGAC GCCGATCAGA TCATCGTCGC CTTACAGGAT GGACGTGTAT TTGAAGCATT GCTGGTGGGA TCTGACTCTC TAACCGATCT GGCGGTACTT AAAATTAATG CCACTGGCGG TTTACCTACC ATTCCAATTA ATGCACGTCG CGTACCGCAC ATTGGCGACG TGGTACTGGC GATCGGTAAC CCGTACAACC TCGGGCAGAC CATTACCCAG GGGATTATTA GTGCTACGGG TCGAATCGGT CTGAACCCGA CCGGGCGGCA AAACTTCCTA CAAACCGATG CTTCCATTAA CCACGGTAAC TCTGGCGGCG CGCTGGTGAA CTCGCTGGGC GAACTGATGG GCATTAACAC GCTGTCGTTT GATAAGAGTA ACGATGGCGA AACGCCGGAA GGTATCGGCT TTGCGATTCC TTTCCAGTTA GCAACCAAAA TTATGGATAA GCTGATCCGC GATGGTCGCG TGATCCGCGG CTACATTGGT ATCGGCGGAC GCGAGATCGC ACCACTGCAC GCGCAGGGCG GTGGTATAGA TCAACTGCAA GGGATCGTGG TTAATGAAGT GTCACCTGAC GGCCCGGCGG CGAATGCGGG TATTCAGGTC AACGATCTGA TTATTTCGGT GGATAACAAA CCGGCCATCT CTGCTCTGGA GACGATGGAT CAGGTGGCAG AAATTCGCCC TGGTTCGGTG ATCCCGGTTG TAGTGATGCG TGATGATAAG CAGTTAACGC TGCAGGTCAC CATTCAGGAA TATCCGGCAA CCAATTAA
|
Protein sequence | MFVKLLRSVA IGLIVGAILL VAMPSLRSLN PLSTPQFDST DETPASYNLA VRRAAPAVVN VYNRGLNTNS HNQLEIRTLG SGVIMDQRGY IITNKHVIND ADQIIVALQD GRVFEALLVG SDSLTDLAVL KINATGGLPT IPINARRVPH IGDVVLAIGN PYNLGQTITQ GIISATGRIG LNPTGRQNFL QTDASINHGN SGGALVNSLG ELMGINTLSF DKSNDGETPE GIGFAIPFQL ATKIMDKLIR DGRVIRGYIG IGGREIAPLH AQGGGIDQLQ GIVVNEVSPD GPAANAGIQV NDLIISVDNK PAISALETMD QVAEIRPGSV IPVVVMRDDK QLTLQVTIQE YPATN
|
| |