Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2738 |
Symbol | dcm |
ID | 6970819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2563056 |
End bp | 2564474 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643386597 |
Product | DNA cytosine methylase |
Protein accession | YP_002271076 |
Protein GI | 209400193 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.285306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.358012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGAAA ATATATCAGT AACCGATTCA TACAGCACCG GGAATGCCGC ACAGGCAATG CTGGAGAAAC TGCTGCAAAT TTATGATGTT AAAACGCTGG TGGCTCAGCT TAATGGTGTG GGTGAGAATC ACTGGAGCGC GGCAATTTTA AAACGTGCGC TGGCGAATGA CTCGGCATGG CACCGTTTAA GTGAGAAAGA GTTCGCCCAT CTGCAAACGT TGTTACCCAA ACCACCGGCA CATCATCCGC ATTATGCGTT TCGCTTTATC GATCTATTTG CCGGAATTGG CGGCATCCGT CGCGGTTTTG AATCGATTGG CGGACAGTGC GTGTTTACCA GCGAATGGAA CAAACATGCG GTACGCACTT ATAAAGCCAA TCATTATTGC GACCCGGCGA CGCATCATTT TAATGAAGAT ATCCGCGACA TCACCCTCAG CCATAAAGAA GGCGTGAGTG ATGAGGCTGC GGCGGAACAT ATTCGTCAAC ACATTCCTGA ACACGATGTT TTACTGGCCG GTTTCCCTTG TCAGCCATTT TCGCTGGCTG GCGTATCGAA AAAGAACTCG CTCGGGCGGG CGCACGGTTT TGCCTGCGAT ACTCAGGGCA CGCTGTTCTT TGATGTGGTG CGCATTATCG ACGCGCGTCG TCCGGCGATG TTTGTGCTCG AAAACGTCAA AAACCTGAAA AGCCACGACC AGGGTAAAAC GTTCCGCATC ATCATGCAGA CGCTGGACGA ACTGGGCTAT GACGTGGCTG ATGCAGAAGA TAACGGGCCG GACGATCCGA AAATCATCGA TGGCAAACAT TTTCTGCCGC AGCACCGTGA ACGCATCGTG CTGGTGGGTT TTCGTCGCGA TCTTAATCTG AAAGCCGATT TTACTCTGCG TGATATCAGC GAATGTTTCC CTGCACAGCG AGTGACGCTG GCGCAGCTGT TGGACCCGAT GGTCGAGGCG AAATATATCC TGACGCCGGT GCTGTGGAAG TACCTCTATC GATATGCGAA AAAACATCAG GCGCGCGGAA ACGGCTTCGG TTATGGAATG GTTTATCCGA ACAATCCGCA AAGCGTCACC CGTACGCTGT CTGCGCGTTA TTACAAAGAT GGCGCGGAAA TTTTAATCGA TCGCGGCTGG GATATGGCCA CGGGTGAGAA AGACTTTGAC GATCCGCTGA ATCAGCAACA TCGTCCACGT CGGTTAACGC CTCGGGAATG CGCGCGCTTA ATGGGTTTTG AAGCGCCGGG AGAAGCGAAA TTCCGCATTC CGGTTTCGGA CACTCAGGCC TATCGCCAGT TCGGTAACTC GGTGGTCGTG CCGGTCTTTG CCGCGGTGGC AAAACTGCTT GAGCCAAAAA TCAAACAGGC GGTGGCGTTG CGTCAGCAAG AGGCACAACA TGGCCGACGT TCACGATAA
|
Protein sequence | MQENISVTDS YSTGNAAQAM LEKLLQIYDV KTLVAQLNGV GENHWSAAIL KRALANDSAW HRLSEKEFAH LQTLLPKPPA HHPHYAFRFI DLFAGIGGIR RGFESIGGQC VFTSEWNKHA VRTYKANHYC DPATHHFNED IRDITLSHKE GVSDEAAAEH IRQHIPEHDV LLAGFPCQPF SLAGVSKKNS LGRAHGFACD TQGTLFFDVV RIIDARRPAM FVLENVKNLK SHDQGKTFRI IMQTLDELGY DVADAEDNGP DDPKIIDGKH FLPQHRERIV LVGFRRDLNL KADFTLRDIS ECFPAQRVTL AQLLDPMVEA KYILTPVLWK YLYRYAKKHQ ARGNGFGYGM VYPNNPQSVT RTLSARYYKD GAEILIDRGW DMATGEKDFD DPLNQQHRPR RLTPRECARL MGFEAPGEAK FRIPVSDTQA YRQFGNSVVV PVFAAVAKLL EPKIKQAVAL RQQEAQHGRR SR
|
| |