Gene EcE24377A_4728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4728 
SymbolmutL 
ID5588560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4729537 
End bp4731384 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content56% 
IMG OID640928340 
ProductDNA mismatch repair protein 
Protein accessionYP_001465668 
Protein GI157158119 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0929403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC 
GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACA
CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAACTTA TCCGCATTCG TGATAACGGC
TGCGGTATCA AAAAAGACGA GCTGGCGCTG GCGCTGGCGC GTCATGCCAC CAGTAAAATC
GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG
AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC
TGGCAGGCCT ATGCCGAAGG GCGCGATATG AACGTGACGG TAAAACCGGC GGCGCATCCT
GTGGGGACGA CGCTGGAGGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC
CTGCGCACCG AGAAAACCGA ATTTAACCAC ATTGATGAGA TCATCCGCCG CATTGCGCTG
GCGCGTTTCG ACGTCACGAT CAACCTGTCG CATAACGGTA AAATTGTGCG TCAGTACCGC
GCAGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGCGG CACCGCTTTT
CTTGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACGCTACG CGGCTGGGTG
GCCGATCCAA ATCACACCAC GCCCGCACTG GCAGAAATTC AGTATTGCTA CGTGAACGGT
CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCCTGCGA AGACAAACTG
GGGGCCGATC AGCAACCGGC ATTTGTGCTG TATCTGGAGA TCGATCCGCA TCAGGTGGAC
GTCAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTCCATC AGTCGCGTCT GGTGCATGAT
TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAAC TGGAAACGCC GCTACCGCTG
GACGATGAAC CCCAACCTGC ACCGCGTTCC ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC
AATCACTTTG CAGAACCGGC AGCTCGTGAG CCGGTAGCTC CGCGCTACAC TCCTGCGCCA
GCATCAGGCA GTCGTCCGGC TGCCCCCTGG CCGAATGCGC AGCCAGGCTA CCAGAAACAG
CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ACCAAAAGCG
CCGGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT
ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGATGGCA ACATTTCACT TTTATCCTTG
CCAGTGGCAG AACGTTGGCT ACGTCAGGCA CAATTGACGC CGGGTGAAGC GCCCGTTTGC
GCCCAGCCGC TGCTGATTCC GTTGCGGCTA AAAGTTTCTG CCGAAGAAAA ATCGGCATTA
GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT
GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT
GAACTGATAG GCTACCTGGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG
ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG
CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA
CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG 
CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA
WQAYAEGRDM NVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL
ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV
ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRS IPENRVAAGR
NHFAEPAARE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKPKA
PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLSL PVAERWLRQA QLTPGEAPVC
AQPLLIPLRL KVSAEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP
ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL
QSVDLHPAIK ALKDE