Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4412 |
Symbol | mutL |
ID | 5595092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4422148 |
End bp | 4423995 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640923510 |
Product | DNA mismatch repair protein |
Protein accession | YP_001460951 |
Protein GI | 157163633 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.0194282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACA CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAACTTA TCCGCATTCG TGATAACGGC TGCGGTATCA AAAAAGACGA GCTGGCGCTG GCGCTGGCGC GTCATGCCAC CAGTAAAATC GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC TGGCAGGCCT ATGCCGAAGG GCGCGATATG AACGTGACGG TAAAACCGGC GGCGCATCCT GTGGGGACGA CGCTGGAGGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC CTGCGCACCG AGAAAACCGA ATTTAACCAC ATTGATGAGA TCATCCGCCG CATTGCGCTG GCGCGTTTCG ACGTCACGAT CAACCTGTCG CATAACGGTA AAATTGTGCG TCAGTACCGC GCAGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGCGG CACCGCTTTT CTTGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACGCTACG CGGCTGGGTG GCCGATCCAA ATCACACCAC GCCCGCACTG GCAGAAATTC AGTATTGCTA CGTGAACGGT CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCCTGCGA AGACAAACTG GGGGCCGATC AGCAACCGGC ATTTGTGCTG TATCTGGAGA TCGATCCGCA TCAGGTGGAC GTCAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTCCATC AGTCGCGTCT GGTGCATGAT TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAAC TGGAAACGCC GCTACCGCTG GACGATGAAC CCCAACCTGC ACCGCGTTCC ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC AATCACTTTG CAGAACCGGC AGCTCGTGAG CCGGTAGCTC CGCGCTACAC TCCTGCGCCA GCATCAGGCA GTCGTCCGGC TGCCCCCTGG CCGAATGCGC AGCCAGGCTA CCAGAAACAG CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ACCAAAAGCG CCGGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGATGGCA ACATTTCACT TTTATCCTTG CCAGTGGCAG AACGTTGGCT ACGTCAGGCA CAATTGACGC CGGGTGAAGC GCCCGTTTGC GCCCAGCCGC TGCTGATTCC GTTGCGGCTA AAAGTTTCTG CCGAAGAAAA ATCGGCATTA GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT GAACTGATAG GCTACCTGGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA WQAYAEGRDM NVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRS IPENRVAAGR NHFAEPAARE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKPKA PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLSL PVAERWLRQA QLTPGEAPVC AQPLLIPLRL KVSAEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL QSVDLHPAIK ALKDE
|
| |