Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5859 |
Symbol | |
ID | 6967589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5511477 |
End bp | 5513231 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643389478 |
Product | putative type I restriction-modification system, S subunit |
Protein accession | YP_002273870 |
Protein GI | 209396465 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTGG AAAAGCTGAT CGTTGATCAT ATGGAAACCT GGACCTCGGC GTTGCAAACC CGTTCCACCG CCGGGCGCGG CAGTTCCGGA AAAATTGATT TATATGGCAT TAAGAAATTA CGTGAGCTGA TTCTGGAACT GGCTGTGCGC GGTAAACTGG TGCCGCAGGA CCCGAACGAT GAACCGGCGT CGGAGCTGCT GAAGCGTATT GCGGCAGAAA AAGCAGAGCT GGTGAAGCAA GGTAAAATTA AAAAGCAAAA ACCACTGCCG GAAATTAGTG AGGAAGAGAA GCCGTTTGAA TTGCCGGAGG GATGGGAGTG GGTGCGTATT AGTGAAATTG GACATGACTG GGGACAAAAA ACTCCAGATA AAGATTTTAC TTACATTGAT GTTGGTTCAA TAAATAAAGA ATATGGAATT ATTGAAGAGC TATCTATTCT TTCAGCGAAA GATGCCCCAT CGCGAGCGAG AAAAATTGTT CAGCAAGGGA CTATCATATA CTCTACTGTT CGCCCGTATT TGTTAAATAT AGCTATTATT GAAAACGAAA TACTACCTGA ACCGATCGCT AGTACTGCCT TCGCTATTAT TCATCCATAT ACGGCGATGG ATGCTAATTT CATTTATTAT TATCTACGTT CTCCGGTATT TGTTTGTTAT GTTGAAAATT GTCAGACAGG GGTTGCCTAT CCAGCAATCA ATGACAAACA ATTTTTTTCT GGAATAACTC CAGTTCCTCC ATCCTTGGAA CAAGTTCGCA TCGCAAACAA AATCAAAGAA TTAATGTCCC TCTGCGACCA ACTGGAACAG CAATCCCTGA CCAGTCTGGA CGCACATCAA CAACTGGTTG AAACCCTGTT GGGAACGCTT ACAGACAGCC AAACCGCCGA GGAACTGGCT GAAAACTGGG CGCGTATTAG CGAGTATTTC GACACTCTAT TTACCACCGA AGCCAGCGTG GATGCGTTAA AACAGACCAT TCTGCAACTG GCCGTAATGG GCAAACTTGT GCCGCAGGAT CCGAATGATG AACCAGCCTC TGAACTGCTC AAACGAATTG CGCAGGAAAA AGCTCAACTG GTGAAAGAAG GAAAAATAAA AAAACAAAAA CCGTTGCCGC CAATTAGCGA TGAGGAAAAA CCGTTTGAAC TTCCGGAAGG GTGGGAGTGG TGTTTATTTG AAGATATTAT TGATATTCAA AGTGGTATCA CTAAAGGAAG AAATTTATCA AATAGAACTT TGGTAAAAGT TCCTTATTTA CGTGTTGCAA ACGTCCAACG CGGATATCTT GATCTTACGG AAATTAAACA GATTGAAATC CCTATTGAGG AAAAAGAAAA ATATCAAGTA GTCAAGGGAG ATTTATTGAT AACAGAAGGC GGCGACTGGG ATACAGTCGG GAGAACTACA GTATGGTGTC ATGACTGGTA TATAGCAAAT CAAAACCATG TATTCAAAGG ACGAAATATA GGGCAAGATG TTGATCCATA TTGGTTAGAA ACATATATGA ATAGCCCATT CTCAAGACAA TATTTTGCTA ACGCAAGTAA GCAAACCACT AATTTAGCTT CTATTAATAA AACCCAGCTC AGAGGTTGTC CTGTTGCTAT TCCTCCTAGC TCAGAAGCAA AAAAAATAAT GAGTAAACTA CATATTTTTT ATAAACTATG TGAAGAATTA AAGAATCATA TCCAATCCGC CCAGCAAACC CAACTACACC TTGCAGATGC ACTCACTGAC GCGGCGGTAA ACTAA
|
Protein sequence | MSVEKLIVDH METWTSALQT RSTAGRGSSG KIDLYGIKKL RELILELAVR GKLVPQDPND EPASELLKRI AAEKAELVKQ GKIKKQKPLP EISEEEKPFE LPEGWEWVRI SEIGHDWGQK TPDKDFTYID VGSINKEYGI IEELSILSAK DAPSRARKIV QQGTIIYSTV RPYLLNIAII ENEILPEPIA STAFAIIHPY TAMDANFIYY YLRSPVFVCY VENCQTGVAY PAINDKQFFS GITPVPPSLE QVRIANKIKE LMSLCDQLEQ QSLTSLDAHQ QLVETLLGTL TDSQTAEELA ENWARISEYF DTLFTTEASV DALKQTILQL AVMGKLVPQD PNDEPASELL KRIAQEKAQL VKEGKIKKQK PLPPISDEEK PFELPEGWEW CLFEDIIDIQ SGITKGRNLS NRTLVKVPYL RVANVQRGYL DLTEIKQIEI PIEEKEKYQV VKGDLLITEG GDWDTVGRTT VWCHDWYIAN QNHVFKGRNI GQDVDPYWLE TYMNSPFSRQ YFANASKQTT NLASINKTQL RGCPVAIPPS SEAKKIMSKL HIFYKLCEEL KNHIQSAQQT QLHLADALTD AAVN
|
| |