Gene ECH74115_5859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5859 
Symbol 
ID6967589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5511477 
End bp5513231 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content42% 
IMG OID643389478 
Productputative type I restriction-modification system, S subunit 
Protein accessionYP_002273870 
Protein GI209396465 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGG AAAAGCTGAT CGTTGATCAT ATGGAAACCT GGACCTCGGC GTTGCAAACC 
CGTTCCACCG CCGGGCGCGG CAGTTCCGGA AAAATTGATT TATATGGCAT TAAGAAATTA
CGTGAGCTGA TTCTGGAACT GGCTGTGCGC GGTAAACTGG TGCCGCAGGA CCCGAACGAT
GAACCGGCGT CGGAGCTGCT GAAGCGTATT GCGGCAGAAA AAGCAGAGCT GGTGAAGCAA
GGTAAAATTA AAAAGCAAAA ACCACTGCCG GAAATTAGTG AGGAAGAGAA GCCGTTTGAA
TTGCCGGAGG GATGGGAGTG GGTGCGTATT AGTGAAATTG GACATGACTG GGGACAAAAA
ACTCCAGATA AAGATTTTAC TTACATTGAT GTTGGTTCAA TAAATAAAGA ATATGGAATT
ATTGAAGAGC TATCTATTCT TTCAGCGAAA GATGCCCCAT CGCGAGCGAG AAAAATTGTT
CAGCAAGGGA CTATCATATA CTCTACTGTT CGCCCGTATT TGTTAAATAT AGCTATTATT
GAAAACGAAA TACTACCTGA ACCGATCGCT AGTACTGCCT TCGCTATTAT TCATCCATAT
ACGGCGATGG ATGCTAATTT CATTTATTAT TATCTACGTT CTCCGGTATT TGTTTGTTAT
GTTGAAAATT GTCAGACAGG GGTTGCCTAT CCAGCAATCA ATGACAAACA ATTTTTTTCT
GGAATAACTC CAGTTCCTCC ATCCTTGGAA CAAGTTCGCA TCGCAAACAA AATCAAAGAA
TTAATGTCCC TCTGCGACCA ACTGGAACAG CAATCCCTGA CCAGTCTGGA CGCACATCAA
CAACTGGTTG AAACCCTGTT GGGAACGCTT ACAGACAGCC AAACCGCCGA GGAACTGGCT
GAAAACTGGG CGCGTATTAG CGAGTATTTC GACACTCTAT TTACCACCGA AGCCAGCGTG
GATGCGTTAA AACAGACCAT TCTGCAACTG GCCGTAATGG GCAAACTTGT GCCGCAGGAT
CCGAATGATG AACCAGCCTC TGAACTGCTC AAACGAATTG CGCAGGAAAA AGCTCAACTG
GTGAAAGAAG GAAAAATAAA AAAACAAAAA CCGTTGCCGC CAATTAGCGA TGAGGAAAAA
CCGTTTGAAC TTCCGGAAGG GTGGGAGTGG TGTTTATTTG AAGATATTAT TGATATTCAA
AGTGGTATCA CTAAAGGAAG AAATTTATCA AATAGAACTT TGGTAAAAGT TCCTTATTTA
CGTGTTGCAA ACGTCCAACG CGGATATCTT GATCTTACGG AAATTAAACA GATTGAAATC
CCTATTGAGG AAAAAGAAAA ATATCAAGTA GTCAAGGGAG ATTTATTGAT AACAGAAGGC
GGCGACTGGG ATACAGTCGG GAGAACTACA GTATGGTGTC ATGACTGGTA TATAGCAAAT
CAAAACCATG TATTCAAAGG ACGAAATATA GGGCAAGATG TTGATCCATA TTGGTTAGAA
ACATATATGA ATAGCCCATT CTCAAGACAA TATTTTGCTA ACGCAAGTAA GCAAACCACT
AATTTAGCTT CTATTAATAA AACCCAGCTC AGAGGTTGTC CTGTTGCTAT TCCTCCTAGC
TCAGAAGCAA AAAAAATAAT GAGTAAACTA CATATTTTTT ATAAACTATG TGAAGAATTA
AAGAATCATA TCCAATCCGC CCAGCAAACC CAACTACACC TTGCAGATGC ACTCACTGAC
GCGGCGGTAA ACTAA
 
Protein sequence
MSVEKLIVDH METWTSALQT RSTAGRGSSG KIDLYGIKKL RELILELAVR GKLVPQDPND 
EPASELLKRI AAEKAELVKQ GKIKKQKPLP EISEEEKPFE LPEGWEWVRI SEIGHDWGQK
TPDKDFTYID VGSINKEYGI IEELSILSAK DAPSRARKIV QQGTIIYSTV RPYLLNIAII
ENEILPEPIA STAFAIIHPY TAMDANFIYY YLRSPVFVCY VENCQTGVAY PAINDKQFFS
GITPVPPSLE QVRIANKIKE LMSLCDQLEQ QSLTSLDAHQ QLVETLLGTL TDSQTAEELA
ENWARISEYF DTLFTTEASV DALKQTILQL AVMGKLVPQD PNDEPASELL KRIAQEKAQL
VKEGKIKKQK PLPPISDEEK PFELPEGWEW CLFEDIIDIQ SGITKGRNLS NRTLVKVPYL
RVANVQRGYL DLTEIKQIEI PIEEKEKYQV VKGDLLITEG GDWDTVGRTT VWCHDWYIAN
QNHVFKGRNI GQDVDPYWLE TYMNSPFSRQ YFANASKQTT NLASINKTQL RGCPVAIPPS
SEAKKIMSKL HIFYKLCEEL KNHIQSAQQT QLHLADALTD AAVN