Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3151 |
Symbol | |
ID | 6972395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2918759 |
End bp | 2919817 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643386974 |
Product | DNA methylase |
Protein accession | YP_002271441 |
Protein GI | 209396181 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000168176 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000000000119452 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTAATA CTGTAAAAAT ATCCAGTTGT GAGTTAATCA ACGCCGACTG CCTGGAATTT ATCCGGTCGT TACCCGAAAA TTCTGTTGAC CTGATAGTCA CGGACCCGCC GTACTTTAAA GTGAAGCCTG AGGGCTGGGA TAACCAGTGG ACGGGTGATG AGGATTACCT GAAATGGCTG GACCAGTGTC TGGCGCAGTT CTGGCGGGTG CTGAAACCTG CCGGAAGTCT TTACCTGTTC TGTGGCCATC GCCTGGCATC TGACATTGAA ATCATGATGC GTGAACGCTT CAGTGTGCTG AACCATATTA TCTGGGCGAA GCCGTCCGGA CGCTGGAACG GGTGCAACAA GGAAAGCCTG AGGGCGTATT TCCCCGCCAC AGAGCGCATT CTGTTCGCGG AACATTATCA GGGGCCGTAT CGTCCGAAAG ATGCCGGGTA TGAGGCGAAG GGCAGGGCAC TGAAACAGCA TGTGATGGCT CCGCTGATTT CTTACTTTCG TGATGCGCGT GCTGCCCTGG GGATAACGGC AAAACAGATA GTGGATGCCA CAGGAAAGAA AAACATGGTG TCGCACTGGT TCAGTGCCAG TCAGTGGCAG CTACCGAACG AAAGCGATTA TCTGAAATTA CAGGCGCTGT TTGCCCGGGT GGCAGAAGAG AAGCATCAGC GCGGTGAACT GGAAAAGCCC CACCACCAGC TGCTGGAGAC GTATACTTCA CTGAACCGGC AGTATGCGGA ACTGCAGAGT GAATATAAAC ATCTGCGGCG GTATTTTGGC GTGACGGCGC AGGTGCCGTA CACGGATGTG TGGACGCATA AACCGGTGCA GTACTATCCC GGGAAACATC CGTGCGAAAA ACCGGCAGAA ATGCTGCAGC AGATAATCAG TGCGAGCAGT CGTCCGGGTG ACCTGGTTGC AGATTTCTTC ATGGGGTCGG GTTCGACAGT CAAAGCCGCG ATGGCGCTGG GGCGTCGTGC AACTGGCGTT GAGCTGGAGA CTGAACGTTT TGAGCAGACG GTCAGGGAAG TACAGGATTT AGTCAGTCAG AACGGATGA
|
Protein sequence | MFNTVKISSC ELINADCLEF IRSLPENSVD LIVTDPPYFK VKPEGWDNQW TGDEDYLKWL DQCLAQFWRV LKPAGSLYLF CGHRLASDIE IMMRERFSVL NHIIWAKPSG RWNGCNKESL RAYFPATERI LFAEHYQGPY RPKDAGYEAK GRALKQHVMA PLISYFRDAR AALGITAKQI VDATGKKNMV SHWFSASQWQ LPNESDYLKL QALFARVAEE KHQRGELEKP HHQLLETYTS LNRQYAELQS EYKHLRRYFG VTAQVPYTDV WTHKPVQYYP GKHPCEKPAE MLQQIISASS RPGDLVADFF MGSGSTVKAA MALGRRATGV ELETERFEQT VREVQDLVSQ NG
|
| |