Gene EcolC_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3843 
SymbolmutL 
ID6066878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4197817 
End bp4199664 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content56% 
IMG OID641603255 
ProductDNA mismatch repair protein 
Protein accessionYP_001726774 
Protein GI170021820 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000795964 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000387336 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC 
GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACG
CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAAATTA TCCGCATTCG TGATAACGGC
TGCGGTATCA AAAAAGATGA GCTGGCGCTG GCGCTGGCTC GTCATGCCAC CAGTAAAATC
GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG
AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC
TGGCAGGCCT ATGCCGAAGG GCGCGATATG GACGTGACGG TTAAACCGGC GGCGCATCCG
GTGGGAACGA CGCTGGAGGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC
CTGCGCACCG AGAAAACCGA ATTTAACCAT ATCGATGAAA TCATCCGCCG CATTGCGCTG
GCGCGTTTCG ACGTTACGAT CAACCTGTCG CATAACGGTA AAATTGTGCG TCAGTACCGT
GCAGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGCGG CACCGCTTTT
CTCGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACCCTGCG CGGCTGGGTG
GCCGATCCGA ATCACACCAC GCCCGCCCTG GCGGAAATTC AGTATTGCTA CGTGAATGGT
CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCTTGCGA AGACAAACTG
GGGGCCGATC AGCAACCGGC ATTTGTGTTG TATCTGGAGA TCGACCCACA TCAGGTGGAC
GTTAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTTCATC AGTCGCGTCT GGTGCATGAC
TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAGC TGGAAACGCC GCTACCGCTG
GACGATGAAC CCCAACCTGC ACCGCGTGCT ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC
AATCATTTTG CTGAACCGGC AGCTCGTGAG CCGGTAGCTC CGCGTTACAC TCCTGCGCCC
GCCTCAGGCA GTCGTCCGGC TGCCCCCTGG CCGAATGCGC AGCCAGGTTA CCAGAAACAG
CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ACCAAAAGCG
CCAGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT
ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGACGGCA ACATTTCACT TTTAGCCTTG
TCAGTGGCAG AACGTTGGCT GCGTCAGGCA CAATTGACGC CGGGTGAAGC GCCCGTTTGC
GCCCAGCCGT TGCTGATTCC GTTGAGGCTA AAAGTTTCTG GCGAAGAAAA ATCGGCATTA
GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT
GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT
GAACTGATAG GCTACCTGGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG
ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG
CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA
CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKIIRIRDNG 
CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA
WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL
ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV
ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRA IPENRVAAGR
NHFAEPAARE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKPKA
PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLAL SVAERWLRQA QLTPGEAPVC
AQPLLIPLRL KVSGEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP
ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL
QSVDLHPAIK ALKDE