Gene EcDH1_3823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3823 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4115322 
End bp4117169 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content56% 
IMG OID 
ProductDNA mismatch repair protein MutL 
Protein accessionACX41425 
Protein GI260451003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.000414583 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC 
GAGCGACCTG CGTCGGTAGT CAAAGAACTA GTGGAAAACA GCCTCGATGC AGGTGCGACG
CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAACTTA TCCGCATTCG TGATAACGGC
TGCGGTATCA AAAAAGATGA GCTGGCGCTG GCGCTGGCTC GTCATGCCAC CAGTAAAATC
GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG
AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC
TGGCAGGCCT ATGCCGAAGG GCGCGATATG AACGTGACGG TAAAACCGGC GGCGCATCCT
GTGGGGACGA CGCTGGAGGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC
CTGCGCACCG AGAAAACCGA ATTTAACCAC ATTGATGAGA TCATCCGCCG CATTGCGCTG
GCGCGTTTCG ACGTCACGAT CAACCTGTCG CATAACGGTA AAATTGTGCG TCAGTACCGC
GCAGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGCGG CACCGCTTTT
CTTGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACGCTACG CGGCTGGGTG
GCCGATCCAA ATCACACCAC GCCCGCACTG GCAGAAATTC AGTATTGCTA CGTGAACGGT
CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCCTGCGA AGACAAACTG
GGGGCCGATC AGCAACCGGC ATTTGTGTTG TATCTGGAGA TCGACCCACA TCAGGTGGAC
GTCAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTCCATC AGTCGCGTCT GGTGCATGAT
TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAAC TGGAAACGCC GCTACCGCTG
GACGATGAAC CCCAACCTGC ACCGCGTTCC ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC
AATCACTTTG CAGAACCGGC AGCTCGTGAG CCGGTAGCTC CGCGCTACAC TCCTGCGCCA
GCATCAGGCA GTCGTCCGGC TGCCCCCTGG CCGAATGCGC AGCCAGGCTA CCAGAAACAG
CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ATTAAAAGCG
CCGGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT
ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGACGGCA ACATTTCACT TTTATCCTTG
CCAGTGGCAG AACGTTGGCT GCGTCAGGCA CAATTGACGC CGGGTGAAGC GCCCGTTTGC
GCCCAGCCGC TGCTGATTCC GTTGCGGCTA AAAGTTTCTG CCGAAGAAAA ATCGGCATTA
GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT
GTGACCATCA GGGCAGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT
GAACTGATAG GCTACCTGGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG
ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG
CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA
CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG 
CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA
WQAYAEGRDM NVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL
ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV
ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRS IPENRVAAGR
NHFAEPAARE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKLKA
PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLSL PVAERWLRQA QLTPGEAPVC
AQPLLIPLRL KVSAEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP
ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL
QSVDLHPAIK ALKDE