Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3843 |
Symbol | mutL |
ID | 6066878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4197817 |
End bp | 4199664 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641603255 |
Product | DNA mismatch repair protein |
Protein accession | YP_001726774 |
Protein GI | 170021820 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000795964 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000387336 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACG CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAAATTA TCCGCATTCG TGATAACGGC TGCGGTATCA AAAAAGATGA GCTGGCGCTG GCGCTGGCTC GTCATGCCAC CAGTAAAATC GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC TGGCAGGCCT ATGCCGAAGG GCGCGATATG GACGTGACGG TTAAACCGGC GGCGCATCCG GTGGGAACGA CGCTGGAGGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC CTGCGCACCG AGAAAACCGA ATTTAACCAT ATCGATGAAA TCATCCGCCG CATTGCGCTG GCGCGTTTCG ACGTTACGAT CAACCTGTCG CATAACGGTA AAATTGTGCG TCAGTACCGT GCAGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGCGG CACCGCTTTT CTCGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACCCTGCG CGGCTGGGTG GCCGATCCGA ATCACACCAC GCCCGCCCTG GCGGAAATTC AGTATTGCTA CGTGAATGGT CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCTTGCGA AGACAAACTG GGGGCCGATC AGCAACCGGC ATTTGTGTTG TATCTGGAGA TCGACCCACA TCAGGTGGAC GTTAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTTCATC AGTCGCGTCT GGTGCATGAC TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAGC TGGAAACGCC GCTACCGCTG GACGATGAAC CCCAACCTGC ACCGCGTGCT ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC AATCATTTTG CTGAACCGGC AGCTCGTGAG CCGGTAGCTC CGCGTTACAC TCCTGCGCCC GCCTCAGGCA GTCGTCCGGC TGCCCCCTGG CCGAATGCGC AGCCAGGTTA CCAGAAACAG CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ACCAAAAGCG CCAGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGACGGCA ACATTTCACT TTTAGCCTTG TCAGTGGCAG AACGTTGGCT GCGTCAGGCA CAATTGACGC CGGGTGAAGC GCCCGTTTGC GCCCAGCCGT TGCTGATTCC GTTGAGGCTA AAAGTTTCTG GCGAAGAAAA ATCGGCATTA GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT GAACTGATAG GCTACCTGGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKIIRIRDNG CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRA IPENRVAAGR NHFAEPAARE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKPKA PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLAL SVAERWLRQA QLTPGEAPVC AQPLLIPLRL KVSGEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL QSVDLHPAIK ALKDE
|
| |