Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1767 |
Symbol | |
ID | 6971334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1694423 |
End bp | 1695481 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385717 |
Product | DNA methylase |
Protein accession | YP_002270209 |
Protein GI | 209399852 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000189808 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.00347478 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTAATA CTGTAAAAAT ATCCAGTTGT GAGTTAATCA ACGCCGACTG CCTGGAATTT ATGCGGTCGT TACCCGAAAA TTCTGTTGAC CTGATAGTCA CGGACCCGCC GTACTTCAAA GTGAAACCCG AGGGCTGGGA TAACCAGTGG GCGGGTGATG AAGATTACCT GAAGTGGCTG GACCAGTGTC TTGCGCAGTT CTGGCGGGTG CTGAAACCTG CCGGAAGTCT TTACCTGTTC TGTGGCCATC GTCTGGCATC TGACACCGAA ATCATGATGC GTGAGCGGTT TAACGTGCTG AACCATATCA TCTGGGCAAA GCCGTCCGGA CGCTGGAACG GGTGCAACAA GGAAAGCCTG CGGGCGTATT TCCCCGCCAC AGAGCGCATT CTGTTCGCAG AGCATTATCA GGGGCCGTAT CGTCCGAAAG ATGCCGGGTA TGAGGCGAAG GGTAGGACAC TGAAACAGCA TGTGATGGCC CCGCTGATTG CTTACTTTCG TGATGCGCGC GCTGTCCTGG GGATAACGGC AAAACAGATT GCAGATGCCA CAGGAAAGAA AAACATGGTG TCGCACTGGT TCAGTGCCGG TCAGTGGCAG CTGCCGAACG AAAGCGATTA TCTGAAATTA CAGGCACTGT TTGCCCGGGT GGCAGAAGAG AAGCATCAGC GGGGTGAACT GGAAAAGCCC CACCACCAGC TGGTGGATAC GTATGCCTCT CTGAACCGAC AGTATGCGGA GCTGCAGAGT GAATATAAGC ATCTGCGGCG GTATTTCGGT GTGACGGTGC AGGTGCCGTA CACCGATGTG TGGACGTATA AACCGGTGCA GTACTATCCA GGGAAACATC CGTGCGAAAA ACCGGCAGAA ATGTTGCAGC AGATAATCAG CGCAAGCAGT CGTCCGGGAG ACCTGGTTGC AGATTTCTTC ATGGGGTCGG GGTCGACAGT GAAAGCAGCG ATGGCGCTGG GACGTCGTGC AACTGGCGTT GAACTGGAGA CTGAACGTTT TGAGCAGACG GTGCGGGAAG TACAGGATTT AATCATTCGT AACGGATGA
|
Protein sequence | MLNTVKISSC ELINADCLEF MRSLPENSVD LIVTDPPYFK VKPEGWDNQW AGDEDYLKWL DQCLAQFWRV LKPAGSLYLF CGHRLASDTE IMMRERFNVL NHIIWAKPSG RWNGCNKESL RAYFPATERI LFAEHYQGPY RPKDAGYEAK GRTLKQHVMA PLIAYFRDAR AVLGITAKQI ADATGKKNMV SHWFSAGQWQ LPNESDYLKL QALFARVAEE KHQRGELEKP HHQLVDTYAS LNRQYAELQS EYKHLRRYFG VTVQVPYTDV WTYKPVQYYP GKHPCEKPAE MLQQIISASS RPGDLVADFF MGSGSTVKAA MALGRRATGV ELETERFEQT VREVQDLIIR NG
|
| |