Gene EcSMS35_4641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4641 
SymbolmutL 
ID6144684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4742194 
End bp4744041 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content56% 
IMG OID641619457 
ProductDNA mismatch repair protein 
Protein accessionYP_001746565 
Protein GI170681264 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.231193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0766044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC 
GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACG
CGTATCGATA TTGATATCGA ACGTGGCGGG GCGAAACTTA TCCGCATTCG TGATAACGGC
TGCGGTATCA AAAAAGATGA GCTGGCGCTG GCGCTGGCTC GTCATGCCAC CAGTAAAATC
GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG
AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC
TGGCAGGCCT ATGCCGAAGG GCGCGATATG GACGTGACGG TTAAACCGGC GGCGCATCCG
GTGGGAACGA CGCTGGAGGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC
CTGCGCACCG AGAAAACCGA ATTTAACCAT ATCGATGAAA TTATTCGCCG CATTGCGCTG
GCGCGTTTCG ACGTCACGAT CAACCTGTCG CATAACGGTA AAATTGTGCG TCAGTACCGT
GCAGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGCGG CACCGCTTTT
CTCGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACCCTGCG CGGCTGGGTG
GCCGATCCGA ATCACACCAC GCCCGCCCTG GCAGAAATTC AGTATTGCTA CGTGAACGGT
CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCCTGTGA AGACAAACTG
GGGGCCGATC AGCAACCGGC ATTTGTGTTG TATCTGGAGA TCGACCCACA TCAGGTGGAC
GTCAACGTGC ACCCTGCCAA ACACGAAGTG CGTTTCCATC AGTCGCGTCT GGTGCATGAT
TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAAC TGGAAACGCC GCTACCGCTG
GACGATGAAC CCCAACCTGC ACCGCGTGCT ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC
AATCATTTTG CTGAACCGGC AGTTCGTGAG CCAGTAGCTC CGCGTTACAC TCCTGCGCCC
GCCTCAGGCA GTCGTCCGGC TGCCCCCTGG CCGAATGCGC AGCCAGGTTA CCAGAAACAG
CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ACCAAAAGCG
CCAGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT
ATCGTCCATT CCGACTGTGC CTTGCTGGAG CGCGACGGAA ACATTTCACT TTTAGCCTTG
CCAGTGGCAG AACGTTGGCT GCGTCAGGCG CAACTGACGC CGGGTGAAGC GCCAGTTTGC
GCCCAACCGC TGCTTATTCC GTTGCGGCTA AAAGTTTCTG GCGAAGAAAA ATCGGCATTA
GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCCGA TGCGCAGCAT
GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT
GAACTGATAG GCTACCTGGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG
ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG
CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA
CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG 
CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA
WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL
ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV
ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRA IPENRVAAGR
NHFAEPAVRE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKPKA
PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLAL PVAERWLRQA QLTPGEAPVC
AQPLLIPLRL KVSGEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP
ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL
QSVDLHPAIK ALKDE