Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4760 |
Symbol | |
ID | 6144043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4857545 |
End bp | 4859524 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641619573 |
Product | type I restriction-modification system DNA methylase |
Protein accession | YP_001746680 |
Protein GI | 170683208 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.362482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTAC AAGACAAAGA ACAAAGCGCC AAACTCTCCA GTGCTATTTG GCGTATGGCA GATGATTTAT GGGGTGATTT CAAACACACC GATTTTGCGC GGATTATCCT GCCATTTTTG CTACTGCGTC GTATTGAATG CGTGTTAGAA CCCACCCGTG AGGAAGTTCG TAAGTTCTAT CTGGCCGAAA AACAATCCGG CATAGACCTT GGTTTAGTGT TGCCAGAAAT TGCGGGATTC GCCTTTTACA ACACCAGTGA ATATAGCCTA GAAACCTTAG GCGCGTCAGA CACCGGCGAC AACCTAGAAC ACTACATCAG CCAGTTCAGC AAAAACGTTC GTACCATCTT TGATGAGTTT AAATTTGGTC AAACCATAGA AGATCTGGAA AAAGCCAAGT TGCTGTATCG CATGGTAAAC CATTTTGCCA ACCTCGACTT GCATCCTGAT GTAGTGTCTG ACCGCGTATT ATCAGATGCT TATGAAGAGC TGATTCTAAA GTTCGCCAGT TCGGTTAATG AAAAGGCCGG GGAATTTATG ACCCCGCGTG ATGCCGTGCG TCTTGCCACC AAACTGGTGC TGGCGGCGGA TGAAGATATC TTTAGTGAAA AGGGTGTGAT CCGCACCATC TATGACCCAA CCTGTGGCAC GGGCGGTTTC CTTTCCGATG CCATTAGCCA AATTGAAGAA ATGGGCAGCA GCGCTAAAGT CGTGCCTTTT GGTCAAGAGC TCGACCCCGC GACCCATGCC ATGGCGCTGA CCAATATGAT GATCCGCGGT TTTGATGCCA ATAACATCAA GCAAGGCAAC ACCTTATCAG ACGATCAGTT ACGTGCCGAC AAGTTTCACT ACGGCCTTGC CAACCCGCCA TTTGGCATTA AGTGGGAAAA AGCCAAGAAA GAAGTAGAAC GTGAGCACAA ACAGCTCAAA TACGCCGGAC GCTTTGGGCC AGGCCTGCCG AGTATTTCTG ATGGTTCAAT GCTGTTTTTA CTGCACTTGG TATCAAAAAT GGAAACGCCG GAAAATGGTG GTGGCCGAGT GGGTATAGTC CTTTCTGGCT CGCCACTGTT TAATGGCGAT GCAGGCTCAG GCCCATCAGA AATCCGCCGT TGGCTGCTAG AACAAGACTT AGTCGAAGCG ATTGTTGCCC TGCCAACCGA TATGTTCTTC AACACCGGCA TTGGCACCTA CATCTGGATT TTGACTAACC ATAAAGAGCC AAGACGCAAG AATCAGGTGC AACTGATTAA CTTGGCAGAT ATTTGGACTC CCATGCGTAA ATCGCAAGGC GATAAGCGCA AGTACCTGAG TGATGAACAG ATTGACGATA TTGTCCGTGC GTACGATGGC TTTGAAGCCA GTGACAACTG TAAGATCTTC CAGACTACTG ATTTTGCCTA CCGTAAAGTC ACCATTCAGC GTCCACTGCG TGCCAAACTA GATATTACCG CTGCGGGCAT TGCTGCATTT GTGCAGCAAG ATACGTTTAA AAAACTCAAA CCAGAGCAAC AAGCGGCATG GGTACAATAC CTCACCGATA ACCTTGGCCT TCAGCCTTAT GAATGGGCAC GTTTGGCGGT TAAGAAGAAT AACAATAAGG GTGACTTTGG TAAGTGCTCA AAAGCACTGG CTACGGCCTT AACTGCGCAC TTTGTAAAAA TTGATCCGCA ATTTGAGCCT GCACTGGATG AAAAAGGCCA AGTGATCGCC GACCCTAAAC TCAAAGACAC CGAAAGCATT CCTTTTGACC GCGACGTTGA AGATTACTTC GCACAAGAGG TGCTGCCACA TGTACCGGAT GCCTTTATTG ATCACTCAGT GCGTGATGAA AAAGACGGCG AAGTGGGCAT TGTCGGTTAT GAAATTAACT TTAACCGCTA TTTCTACCAA TACGTGCCAC CACGCGAGTT GAGTGTGATT GATCGTGAGC TAAAAGCATG TGAAGCGCGC ATTCAGGCTC TGCTGAATGA GGTGGCGTAA
|
Protein sequence | MNLQDKEQSA KLSSAIWRMA DDLWGDFKHT DFARIILPFL LLRRIECVLE PTREEVRKFY LAEKQSGIDL GLVLPEIAGF AFYNTSEYSL ETLGASDTGD NLEHYISQFS KNVRTIFDEF KFGQTIEDLE KAKLLYRMVN HFANLDLHPD VVSDRVLSDA YEELILKFAS SVNEKAGEFM TPRDAVRLAT KLVLAADEDI FSEKGVIRTI YDPTCGTGGF LSDAISQIEE MGSSAKVVPF GQELDPATHA MALTNMMIRG FDANNIKQGN TLSDDQLRAD KFHYGLANPP FGIKWEKAKK EVEREHKQLK YAGRFGPGLP SISDGSMLFL LHLVSKMETP ENGGGRVGIV LSGSPLFNGD AGSGPSEIRR WLLEQDLVEA IVALPTDMFF NTGIGTYIWI LTNHKEPRRK NQVQLINLAD IWTPMRKSQG DKRKYLSDEQ IDDIVRAYDG FEASDNCKIF QTTDFAYRKV TIQRPLRAKL DITAAGIAAF VQQDTFKKLK PEQQAAWVQY LTDNLGLQPY EWARLAVKKN NNKGDFGKCS KALATALTAH FVKIDPQFEP ALDEKGQVIA DPKLKDTESI PFDRDVEDYF AQEVLPHVPD AFIDHSVRDE KDGEVGIVGY EINFNRYFYQ YVPPRELSVI DRELKACEAR IQALLNEVA
|
| |