Gene EcSMS35_2253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2253 
SymbolhsdM 
ID6147463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2271642 
End bp2273213 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content46% 
IMG OID641617128 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_001744301 
Protein GI170682082 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.475184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGTA TTCAACAACG AGCAGAACTG CACCGCCAGA TATGGCAAAT TGCCAATGAT 
GTCAGAGGTT CTGTTGACGG ATGGGATTTT AAGCAATACG TTCTGGGCGC GCTTTTCTAC
CGTTTTATCA GCGAAAATTT TTCCAGCTAT ATTGAAGCCG GTGATGACAG TATCTGTTAT
GCGAAACTGG ATGACAGCGT AATTACTGAT GACATTAAAG ACGATGCCAT CAAAACTAAA
GGCTACTTCA TCTACCCAAG TCAGCTTTTC TGCAACGTAG CTGCGAAAGC AAATACCAAT
GACAGACTGA ATGCAGATTT AAACAGTATC TTCGTTGCTA TCGAAAGTTC TGCTTACGGT
TACCCTTCAG AAGCTGACAT CAAAGGTTTG TTTGCTGATT TCGATACCAC CAGTAACCGC
CTGGGTAACA CCGTTAAGGA TAAAAATGCC CGCCTGGCTG CGGTTCTGAA AGGGGTTGAA
GGGTTAAAAC TTGGTGACTT CAACGAACAT CAGATTGACC TGTTCGGCGA TGCCTATGAG
TTCCTGATTT CTAACTATGC GGCGAATGCT GGTAAGTCAG GCGGCGAGTT TTTTACACCG
CAGCATGTCT CTAAGCTGAT TGCACAACTG GCTATGCACG GGCAGACCCA CGTTAACAAA
ATCTACGACC CAGCCGCAGG TTCCGGTTCG CTGTTGTTAC AGGCGAAAAA GCAGTTTGAT
GACCATATCA TCGAAGAAGG CTTTTTTGGT CAGGAGATCA ACCATACGAC CTATAACCTG
GCGCGTATGA ACATGTTTTT GCACAACATC AACTACGACA AGTTTGATAT CAAGCTGGGC
AATACGCTGA CTGAGCCGCA CTTCAGAGAT GAAAAACCGT TTGATGCCAT CGTTTCTAAC
CCGCCGTATT CAGTGAAATG GATTGGCAGC GATGACCCGA CGCTGATTAA CGATGAGCGT
TTTGCCCCGG CTGGCGTCCT GGCCCCCAAA TCAAAAGCTG ACTTCGCGTT TGTATTACAT
GCGCTGAACT ATCTTTCGGC CAAAGGTCGC GCCGCGATTG TCTGCTTCCC GGGTATTTTT
TACCGTGGCG GCGCAGAGCA GAAAATCCGT CAGTATCTGG TCGATAATAA CTATGTCGAA
ACCGTGATTT CACTGGCACC GAACCTGTTC TTTGGCACCA CCATTGCCGT AAACATTCTG
GTGCTGTCTA AACATAAAAC GGATACCAAA GTTCAGTTTA TTGACGCCAG CGAACTGTTC
AAAAAAGAGA CTAACAACAA TATCCTGACC GATGCCCATA TCGAACAGAT TATGCAGGTA
TTTGCCAGCA AGGAAGATGT TGCTCATCTG GCGAAATCTG TTGCGTTTGA GACTGTTGTC
GCGAATGACT ATAACCTGTC GGTGAGCAGC TATGTGGAAG CGAAAGATAA CCGCGAAATT
ATCAATATTG CTGAACTGAA TGCAGAGCTG AAAACCACGG TCAGCAAAAT CGACCAGTTG
CGTAAAGATA TTGATGCAAT TGTGGCTGAA ATTGAAGGCT GCGAGGTGCA GGTAATGACT
CCAACTTATT GA
 
Protein sequence
MTSIQQRAEL HRQIWQIAND VRGSVDGWDF KQYVLGALFY RFISENFSSY IEAGDDSICY 
AKLDDSVITD DIKDDAIKTK GYFIYPSQLF CNVAAKANTN DRLNADLNSI FVAIESSAYG
YPSEADIKGL FADFDTTSNR LGNTVKDKNA RLAAVLKGVE GLKLGDFNEH QIDLFGDAYE
FLISNYAANA GKSGGEFFTP QHVSKLIAQL AMHGQTHVNK IYDPAAGSGS LLLQAKKQFD
DHIIEEGFFG QEINHTTYNL ARMNMFLHNI NYDKFDIKLG NTLTEPHFRD EKPFDAIVSN
PPYSVKWIGS DDPTLINDER FAPAGVLAPK SKADFAFVLH ALNYLSAKGR AAIVCFPGIF
YRGGAEQKIR QYLVDNNYVE TVISLAPNLF FGTTIAVNIL VLSKHKTDTK VQFIDASELF
KKETNNNILT DAHIEQIMQV FASKEDVAHL AKSVAFETVV ANDYNLSVSS YVEAKDNREI
INIAELNAEL KTTVSKIDQL RKDIDAIVAE IEGCEVQVMT PTY