Gene ECH74115_5686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5686 
SymbolmutL 
ID6967356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5326337 
End bp5328184 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content56% 
IMG OID643389319 
ProductDNA mismatch repair protein 
Protein accessionYP_002273712 
Protein GI209398397 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000161678 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC 
GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACG
CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAACTTA TCCGCATTCG TGATAACGGC
TGTGGTATCA AAAAAGACGA GCTGGCGCTG GCGCTGGCGC GTCATGCCAC CAGTAAAATC
GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG
AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC
TGGCAGGCCT ATGCCGAAGG GCGCGATATG GACGTGACGG TTAAACCGGC GGCGCATCCG
GTGGGAACGA CGCTGGAGGT ACTGGACCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC
CTGCGCACCG AGAAAACCGA ATTTAACCAT ATCGATGAAA TCATCCGCCG CATCGCACTG
GCGCGTTTCG ACGTCACGAT CAACCTGTCG CATAACGGCA AAATCGTACG CCAGTACCGC
GCGGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGTGG CACCGCTTTT
CTCGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACGCTGCG CGGCTGGGTG
GCCGATCCAA ATCACACCAC GCCCGCACTG GCGGAAATTC AGTATTGCTA CGTGAATGGT
CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCCTGTGA AGACAAACTG
GGGGCCGATC AGCAACCTGC ATTTGTGTTG TATCTGGAGA TCGACCCGCA TCAGGTGGAC
GTCAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTTCATC AGTCGCGTCT GGTGCATGAC
TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAAC TGGAAACGCC GCTACCGCTG
GACGATGAAC CCCAACCTGC ACCGCGTCCC ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC
AATCACTTTG CTGAACCGGC AGTTCGTGAG CCAGTAGCTC CGCGCTACAC TCCTGCGCCA
GCATCAGGTA GTCGTCCGGC TGCCCCCTGG CCGAATGCGC AGCCAGGTTA CCAGAAACAG
CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ACCAAAAGCG
CCGGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT
ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGACGGCA ACATTTCACT TTTAGCCTTG
CCAGTGGCAG AACGTTGGCT GCGTCAGGTA CAACTGACGC CGGGTGAAGC GCCCGTTTGC
GCCCAGCCGT TGCTGATTCC GTTGCGGCTA AAAGTTTCTG GCGAAGAAAA ATCGGCATTA
GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT
GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT
GAACTGATAG GCTACCTAGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG
ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG
CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA
CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG 
CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA
WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL
ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV
ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRP IPENRVAAGR
NHFAEPAVRE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKPKA
PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLAL PVAERWLRQV QLTPGEAPVC
AQPLLIPLRL KVSGEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP
ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL
QSVDLHPAIK ALKDE