Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5686 |
Symbol | mutL |
ID | 6967356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5326337 |
End bp | 5328184 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643389319 |
Product | DNA mismatch repair protein |
Protein accession | YP_002273712 |
Protein GI | 209398397 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000161678 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACG CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAACTTA TCCGCATTCG TGATAACGGC TGTGGTATCA AAAAAGACGA GCTGGCGCTG GCGCTGGCGC GTCATGCCAC CAGTAAAATC GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC TGGCAGGCCT ATGCCGAAGG GCGCGATATG GACGTGACGG TTAAACCGGC GGCGCATCCG GTGGGAACGA CGCTGGAGGT ACTGGACCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC CTGCGCACCG AGAAAACCGA ATTTAACCAT ATCGATGAAA TCATCCGCCG CATCGCACTG GCGCGTTTCG ACGTCACGAT CAACCTGTCG CATAACGGCA AAATCGTACG CCAGTACCGC GCGGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGTGG CACCGCTTTT CTCGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACGCTGCG CGGCTGGGTG GCCGATCCAA ATCACACCAC GCCCGCACTG GCGGAAATTC AGTATTGCTA CGTGAATGGT CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCCTGTGA AGACAAACTG GGGGCCGATC AGCAACCTGC ATTTGTGTTG TATCTGGAGA TCGACCCGCA TCAGGTGGAC GTCAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTTCATC AGTCGCGTCT GGTGCATGAC TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAAC TGGAAACGCC GCTACCGCTG GACGATGAAC CCCAACCTGC ACCGCGTCCC ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC AATCACTTTG CTGAACCGGC AGTTCGTGAG CCAGTAGCTC CGCGCTACAC TCCTGCGCCA GCATCAGGTA GTCGTCCGGC TGCCCCCTGG CCGAATGCGC AGCCAGGTTA CCAGAAACAG CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ACCAAAAGCG CCGGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGACGGCA ACATTTCACT TTTAGCCTTG CCAGTGGCAG AACGTTGGCT GCGTCAGGTA CAACTGACGC CGGGTGAAGC GCCCGTTTGC GCCCAGCCGT TGCTGATTCC GTTGCGGCTA AAAGTTTCTG GCGAAGAAAA ATCGGCATTA GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT GAACTGATAG GCTACCTAGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRP IPENRVAAGR NHFAEPAVRE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKPKA PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLAL PVAERWLRQV QLTPGEAPVC AQPLLIPLRL KVSGEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL QSVDLHPAIK ALKDE
|
| |