Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2253 |
Symbol | hsdM |
ID | 6147463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2271642 |
End bp | 2273213 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641617128 |
Product | type I restriction-modification system, M subunit |
Protein accession | YP_001744301 |
Protein GI | 170682082 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.475184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGTA TTCAACAACG AGCAGAACTG CACCGCCAGA TATGGCAAAT TGCCAATGAT GTCAGAGGTT CTGTTGACGG ATGGGATTTT AAGCAATACG TTCTGGGCGC GCTTTTCTAC CGTTTTATCA GCGAAAATTT TTCCAGCTAT ATTGAAGCCG GTGATGACAG TATCTGTTAT GCGAAACTGG ATGACAGCGT AATTACTGAT GACATTAAAG ACGATGCCAT CAAAACTAAA GGCTACTTCA TCTACCCAAG TCAGCTTTTC TGCAACGTAG CTGCGAAAGC AAATACCAAT GACAGACTGA ATGCAGATTT AAACAGTATC TTCGTTGCTA TCGAAAGTTC TGCTTACGGT TACCCTTCAG AAGCTGACAT CAAAGGTTTG TTTGCTGATT TCGATACCAC CAGTAACCGC CTGGGTAACA CCGTTAAGGA TAAAAATGCC CGCCTGGCTG CGGTTCTGAA AGGGGTTGAA GGGTTAAAAC TTGGTGACTT CAACGAACAT CAGATTGACC TGTTCGGCGA TGCCTATGAG TTCCTGATTT CTAACTATGC GGCGAATGCT GGTAAGTCAG GCGGCGAGTT TTTTACACCG CAGCATGTCT CTAAGCTGAT TGCACAACTG GCTATGCACG GGCAGACCCA CGTTAACAAA ATCTACGACC CAGCCGCAGG TTCCGGTTCG CTGTTGTTAC AGGCGAAAAA GCAGTTTGAT GACCATATCA TCGAAGAAGG CTTTTTTGGT CAGGAGATCA ACCATACGAC CTATAACCTG GCGCGTATGA ACATGTTTTT GCACAACATC AACTACGACA AGTTTGATAT CAAGCTGGGC AATACGCTGA CTGAGCCGCA CTTCAGAGAT GAAAAACCGT TTGATGCCAT CGTTTCTAAC CCGCCGTATT CAGTGAAATG GATTGGCAGC GATGACCCGA CGCTGATTAA CGATGAGCGT TTTGCCCCGG CTGGCGTCCT GGCCCCCAAA TCAAAAGCTG ACTTCGCGTT TGTATTACAT GCGCTGAACT ATCTTTCGGC CAAAGGTCGC GCCGCGATTG TCTGCTTCCC GGGTATTTTT TACCGTGGCG GCGCAGAGCA GAAAATCCGT CAGTATCTGG TCGATAATAA CTATGTCGAA ACCGTGATTT CACTGGCACC GAACCTGTTC TTTGGCACCA CCATTGCCGT AAACATTCTG GTGCTGTCTA AACATAAAAC GGATACCAAA GTTCAGTTTA TTGACGCCAG CGAACTGTTC AAAAAAGAGA CTAACAACAA TATCCTGACC GATGCCCATA TCGAACAGAT TATGCAGGTA TTTGCCAGCA AGGAAGATGT TGCTCATCTG GCGAAATCTG TTGCGTTTGA GACTGTTGTC GCGAATGACT ATAACCTGTC GGTGAGCAGC TATGTGGAAG CGAAAGATAA CCGCGAAATT ATCAATATTG CTGAACTGAA TGCAGAGCTG AAAACCACGG TCAGCAAAAT CGACCAGTTG CGTAAAGATA TTGATGCAAT TGTGGCTGAA ATTGAAGGCT GCGAGGTGCA GGTAATGACT CCAACTTATT GA
|
Protein sequence | MTSIQQRAEL HRQIWQIAND VRGSVDGWDF KQYVLGALFY RFISENFSSY IEAGDDSICY AKLDDSVITD DIKDDAIKTK GYFIYPSQLF CNVAAKANTN DRLNADLNSI FVAIESSAYG YPSEADIKGL FADFDTTSNR LGNTVKDKNA RLAAVLKGVE GLKLGDFNEH QIDLFGDAYE FLISNYAANA GKSGGEFFTP QHVSKLIAQL AMHGQTHVNK IYDPAAGSGS LLLQAKKQFD DHIIEEGFFG QEINHTTYNL ARMNMFLHNI NYDKFDIKLG NTLTEPHFRD EKPFDAIVSN PPYSVKWIGS DDPTLINDER FAPAGVLAPK SKADFAFVLH ALNYLSAKGR AAIVCFPGIF YRGGAEQKIR QYLVDNNYVE TVISLAPNLF FGTTIAVNIL VLSKHKTDTK VQFIDASELF KKETNNNILT DAHIEQIMQV FASKEDVAHL AKSVAFETVV ANDYNLSVSS YVEAKDNREI INIAELNAEL KTTVSKIDQL RKDIDAIVAE IEGCEVQVMT PTY
|
| |