Gene ECH74115_5860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5860 
Symbol 
ID6967376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5513231 
End bp5514712 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content50% 
IMG OID643389479 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_002273871 
Protein GI209398363 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.822857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACA ACATGTCTAT CAGCTCAGTA ATCAAATCAT TACAAGATAT TATGCGCAAA 
GATGCCGGTG TGGATGGCGA TGCGCAGCGT CTCGGTCAGC TCTCCTGGCT GCTGTTTTTG
AAAATCTTCG ATGCCCAGGA AGAGGCGCTG GAACTGGAGC AGGATAACTA TCAATATCCG
ATCCCACAGC GTTATTTATG GCGCAGTTGG GCCGCCAACG CGCAGGGCAT TACCGGTGAT
TCCCTGCTGG AATTCGTTAA TGATGATCTG TTCCCGGCGT TGAAAAACCT CACTGCGCCT
ATCGATAAAA ACCCACGCGG CTACGTGGTG AAGCAGGCGT TCAGCGATGC CTATAACTAT
ATGAAAAACG GTACGCTACT GCGTCAGGTG ATCAATAAGC TGAACGAAAT TGACTTTACC
AGCGCCAGCG AACGCCATCT GTTTGGTGAT ATTTACGAAC AGATCCTTAA AGATCTGCAA
TCTGCGGGCA ATGCGGGCGA ATTCTATACT CCACGCGCCG TCACTCGCTT TATGGTGGAT
CGCGTTGATC CGAAACTCGG CGAATCCATT ATGGACCCGG CCTGCGGTAC GGGCGGTTTT
CTTGCCTGCG CATTTGATCA TGTAAAGAAC AAATACGTGA AGAGCGTCGC CGATCATCAG
ACGCTGCAAC AACAGATCCA CGGTGTTGAG AAAAAACAGC TTCCGCACCT GCTGGCGACC
ACCAATATGC TGCTACACGG CATTGAAGTG CCAGTGCAAA TTCGTCACGA CAACACTCTG
AACAAACCGC TTTCCTCCCG GGATGAGCAA CTGGATGTCA TTGTTACCAA CCCGCCGTTT
GGTGGCACGG AAGAAGACGG TATTGAGAAG AACTTTCCGG CAGAGATGCA AACCCGCGAA
ACGGCGGATT TGTTCCTGCA ACTGATTGTG GAAGTACTGG CGAAAAATGG TCGCGCGGCG
GTGGTATTGC CGGATGGCAC ACTATTTGGC GAAGGCGTTA AAACCAAAAT CAAAAAGCTA
CTTACCGAAG AGTGCAACCT GCATACCATC GTGCGCTTAC CGAATGGTGT GTTTAACCCC
TATACCGGCA TTAAAACCAA CCTGTTGTTC TTTACCAAAG GTCAGCCAAC CAAAGAGATT
TGGTTCTATG AGCATCCGTA TCCGGCGGGC GTAAAAAACT ACAGCAAAAC TAAGCCGATG
AAGTTTGAAG AGTTTCAGGC GGAGATCGAC TGGTGGGGTA ACGAGGCCGA TGGTTTTGCC
AGCCGCGTAG AGAATGAGCA GGCGTGGAAA GTCAGCATTG ATGACGTGAT TGCGCGTAAC
TTCAATCTGG ATATTAAAAA CCCACATCAG GCGGAAACCG TCAGCCATGA TCCGGCCGAG
CTGTTAGCGC AGTATGCAAA ACAGCAGGCG GAGATCCAGA CGCTGCGTAA TCAACTGCGC
GATATTCTTG GCGCTGCGCT GTCTGGCAAG GAGGTTAACT AA
 
Protein sequence
MEYNMSISSV IKSLQDIMRK DAGVDGDAQR LGQLSWLLFL KIFDAQEEAL ELEQDNYQYP 
IPQRYLWRSW AANAQGITGD SLLEFVNDDL FPALKNLTAP IDKNPRGYVV KQAFSDAYNY
MKNGTLLRQV INKLNEIDFT SASERHLFGD IYEQILKDLQ SAGNAGEFYT PRAVTRFMVD
RVDPKLGESI MDPACGTGGF LACAFDHVKN KYVKSVADHQ TLQQQIHGVE KKQLPHLLAT
TNMLLHGIEV PVQIRHDNTL NKPLSSRDEQ LDVIVTNPPF GGTEEDGIEK NFPAEMQTRE
TADLFLQLIV EVLAKNGRAA VVLPDGTLFG EGVKTKIKKL LTEECNLHTI VRLPNGVFNP
YTGIKTNLLF FTKGQPTKEI WFYEHPYPAG VKNYSKTKPM KFEEFQAEID WWGNEADGFA
SRVENEQAWK VSIDDVIARN FNLDIKNPHQ AETVSHDPAE LLAQYAKQQA EIQTLRNQLR
DILGAALSGK EVN