Gene EcSMS35_4760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4760 
Symbol 
ID6144043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4857545 
End bp4859524 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content48% 
IMG OID641619573 
Producttype I restriction-modification system DNA methylase 
Protein accessionYP_001746680 
Protein GI170683208 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.362482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTAC AAGACAAAGA ACAAAGCGCC AAACTCTCCA GTGCTATTTG GCGTATGGCA 
GATGATTTAT GGGGTGATTT CAAACACACC GATTTTGCGC GGATTATCCT GCCATTTTTG
CTACTGCGTC GTATTGAATG CGTGTTAGAA CCCACCCGTG AGGAAGTTCG TAAGTTCTAT
CTGGCCGAAA AACAATCCGG CATAGACCTT GGTTTAGTGT TGCCAGAAAT TGCGGGATTC
GCCTTTTACA ACACCAGTGA ATATAGCCTA GAAACCTTAG GCGCGTCAGA CACCGGCGAC
AACCTAGAAC ACTACATCAG CCAGTTCAGC AAAAACGTTC GTACCATCTT TGATGAGTTT
AAATTTGGTC AAACCATAGA AGATCTGGAA AAAGCCAAGT TGCTGTATCG CATGGTAAAC
CATTTTGCCA ACCTCGACTT GCATCCTGAT GTAGTGTCTG ACCGCGTATT ATCAGATGCT
TATGAAGAGC TGATTCTAAA GTTCGCCAGT TCGGTTAATG AAAAGGCCGG GGAATTTATG
ACCCCGCGTG ATGCCGTGCG TCTTGCCACC AAACTGGTGC TGGCGGCGGA TGAAGATATC
TTTAGTGAAA AGGGTGTGAT CCGCACCATC TATGACCCAA CCTGTGGCAC GGGCGGTTTC
CTTTCCGATG CCATTAGCCA AATTGAAGAA ATGGGCAGCA GCGCTAAAGT CGTGCCTTTT
GGTCAAGAGC TCGACCCCGC GACCCATGCC ATGGCGCTGA CCAATATGAT GATCCGCGGT
TTTGATGCCA ATAACATCAA GCAAGGCAAC ACCTTATCAG ACGATCAGTT ACGTGCCGAC
AAGTTTCACT ACGGCCTTGC CAACCCGCCA TTTGGCATTA AGTGGGAAAA AGCCAAGAAA
GAAGTAGAAC GTGAGCACAA ACAGCTCAAA TACGCCGGAC GCTTTGGGCC AGGCCTGCCG
AGTATTTCTG ATGGTTCAAT GCTGTTTTTA CTGCACTTGG TATCAAAAAT GGAAACGCCG
GAAAATGGTG GTGGCCGAGT GGGTATAGTC CTTTCTGGCT CGCCACTGTT TAATGGCGAT
GCAGGCTCAG GCCCATCAGA AATCCGCCGT TGGCTGCTAG AACAAGACTT AGTCGAAGCG
ATTGTTGCCC TGCCAACCGA TATGTTCTTC AACACCGGCA TTGGCACCTA CATCTGGATT
TTGACTAACC ATAAAGAGCC AAGACGCAAG AATCAGGTGC AACTGATTAA CTTGGCAGAT
ATTTGGACTC CCATGCGTAA ATCGCAAGGC GATAAGCGCA AGTACCTGAG TGATGAACAG
ATTGACGATA TTGTCCGTGC GTACGATGGC TTTGAAGCCA GTGACAACTG TAAGATCTTC
CAGACTACTG ATTTTGCCTA CCGTAAAGTC ACCATTCAGC GTCCACTGCG TGCCAAACTA
GATATTACCG CTGCGGGCAT TGCTGCATTT GTGCAGCAAG ATACGTTTAA AAAACTCAAA
CCAGAGCAAC AAGCGGCATG GGTACAATAC CTCACCGATA ACCTTGGCCT TCAGCCTTAT
GAATGGGCAC GTTTGGCGGT TAAGAAGAAT AACAATAAGG GTGACTTTGG TAAGTGCTCA
AAAGCACTGG CTACGGCCTT AACTGCGCAC TTTGTAAAAA TTGATCCGCA ATTTGAGCCT
GCACTGGATG AAAAAGGCCA AGTGATCGCC GACCCTAAAC TCAAAGACAC CGAAAGCATT
CCTTTTGACC GCGACGTTGA AGATTACTTC GCACAAGAGG TGCTGCCACA TGTACCGGAT
GCCTTTATTG ATCACTCAGT GCGTGATGAA AAAGACGGCG AAGTGGGCAT TGTCGGTTAT
GAAATTAACT TTAACCGCTA TTTCTACCAA TACGTGCCAC CACGCGAGTT GAGTGTGATT
GATCGTGAGC TAAAAGCATG TGAAGCGCGC ATTCAGGCTC TGCTGAATGA GGTGGCGTAA
 
Protein sequence
MNLQDKEQSA KLSSAIWRMA DDLWGDFKHT DFARIILPFL LLRRIECVLE PTREEVRKFY 
LAEKQSGIDL GLVLPEIAGF AFYNTSEYSL ETLGASDTGD NLEHYISQFS KNVRTIFDEF
KFGQTIEDLE KAKLLYRMVN HFANLDLHPD VVSDRVLSDA YEELILKFAS SVNEKAGEFM
TPRDAVRLAT KLVLAADEDI FSEKGVIRTI YDPTCGTGGF LSDAISQIEE MGSSAKVVPF
GQELDPATHA MALTNMMIRG FDANNIKQGN TLSDDQLRAD KFHYGLANPP FGIKWEKAKK
EVEREHKQLK YAGRFGPGLP SISDGSMLFL LHLVSKMETP ENGGGRVGIV LSGSPLFNGD
AGSGPSEIRR WLLEQDLVEA IVALPTDMFF NTGIGTYIWI LTNHKEPRRK NQVQLINLAD
IWTPMRKSQG DKRKYLSDEQ IDDIVRAYDG FEASDNCKIF QTTDFAYRKV TIQRPLRAKL
DITAAGIAAF VQQDTFKKLK PEQQAAWVQY LTDNLGLQPY EWARLAVKKN NNKGDFGKCS
KALATALTAH FVKIDPQFEP ALDEKGQVIA DPKLKDTESI PFDRDVEDYF AQEVLPHVPD
AFIDHSVRDE KDGEVGIVGY EINFNRYFYQ YVPPRELSVI DRELKACEAR IQALLNEVA