Gene ECH74115_2738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2738 
Symboldcm 
ID6970819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2563056 
End bp2564474 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID643386597 
ProductDNA cytosine methylase 
Protein accessionYP_002271076 
Protein GI209400193 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.285306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.358012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA ATATATCAGT AACCGATTCA TACAGCACCG GGAATGCCGC ACAGGCAATG 
CTGGAGAAAC TGCTGCAAAT TTATGATGTT AAAACGCTGG TGGCTCAGCT TAATGGTGTG
GGTGAGAATC ACTGGAGCGC GGCAATTTTA AAACGTGCGC TGGCGAATGA CTCGGCATGG
CACCGTTTAA GTGAGAAAGA GTTCGCCCAT CTGCAAACGT TGTTACCCAA ACCACCGGCA
CATCATCCGC ATTATGCGTT TCGCTTTATC GATCTATTTG CCGGAATTGG CGGCATCCGT
CGCGGTTTTG AATCGATTGG CGGACAGTGC GTGTTTACCA GCGAATGGAA CAAACATGCG
GTACGCACTT ATAAAGCCAA TCATTATTGC GACCCGGCGA CGCATCATTT TAATGAAGAT
ATCCGCGACA TCACCCTCAG CCATAAAGAA GGCGTGAGTG ATGAGGCTGC GGCGGAACAT
ATTCGTCAAC ACATTCCTGA ACACGATGTT TTACTGGCCG GTTTCCCTTG TCAGCCATTT
TCGCTGGCTG GCGTATCGAA AAAGAACTCG CTCGGGCGGG CGCACGGTTT TGCCTGCGAT
ACTCAGGGCA CGCTGTTCTT TGATGTGGTG CGCATTATCG ACGCGCGTCG TCCGGCGATG
TTTGTGCTCG AAAACGTCAA AAACCTGAAA AGCCACGACC AGGGTAAAAC GTTCCGCATC
ATCATGCAGA CGCTGGACGA ACTGGGCTAT GACGTGGCTG ATGCAGAAGA TAACGGGCCG
GACGATCCGA AAATCATCGA TGGCAAACAT TTTCTGCCGC AGCACCGTGA ACGCATCGTG
CTGGTGGGTT TTCGTCGCGA TCTTAATCTG AAAGCCGATT TTACTCTGCG TGATATCAGC
GAATGTTTCC CTGCACAGCG AGTGACGCTG GCGCAGCTGT TGGACCCGAT GGTCGAGGCG
AAATATATCC TGACGCCGGT GCTGTGGAAG TACCTCTATC GATATGCGAA AAAACATCAG
GCGCGCGGAA ACGGCTTCGG TTATGGAATG GTTTATCCGA ACAATCCGCA AAGCGTCACC
CGTACGCTGT CTGCGCGTTA TTACAAAGAT GGCGCGGAAA TTTTAATCGA TCGCGGCTGG
GATATGGCCA CGGGTGAGAA AGACTTTGAC GATCCGCTGA ATCAGCAACA TCGTCCACGT
CGGTTAACGC CTCGGGAATG CGCGCGCTTA ATGGGTTTTG AAGCGCCGGG AGAAGCGAAA
TTCCGCATTC CGGTTTCGGA CACTCAGGCC TATCGCCAGT TCGGTAACTC GGTGGTCGTG
CCGGTCTTTG CCGCGGTGGC AAAACTGCTT GAGCCAAAAA TCAAACAGGC GGTGGCGTTG
CGTCAGCAAG AGGCACAACA TGGCCGACGT TCACGATAA
 
Protein sequence
MQENISVTDS YSTGNAAQAM LEKLLQIYDV KTLVAQLNGV GENHWSAAIL KRALANDSAW 
HRLSEKEFAH LQTLLPKPPA HHPHYAFRFI DLFAGIGGIR RGFESIGGQC VFTSEWNKHA
VRTYKANHYC DPATHHFNED IRDITLSHKE GVSDEAAAEH IRQHIPEHDV LLAGFPCQPF
SLAGVSKKNS LGRAHGFACD TQGTLFFDVV RIIDARRPAM FVLENVKNLK SHDQGKTFRI
IMQTLDELGY DVADAEDNGP DDPKIIDGKH FLPQHRERIV LVGFRRDLNL KADFTLRDIS
ECFPAQRVTL AQLLDPMVEA KYILTPVLWK YLYRYAKKHQ ARGNGFGYGM VYPNNPQSVT
RTLSARYYKD GAEILIDRGW DMATGEKDFD DPLNQQHRPR RLTPRECARL MGFEAPGEAK
FRIPVSDTQA YRQFGNSVVV PVFAAVAKLL EPKIKQAVAL RQQEAQHGRR SR