Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4756 |
Symbol | mutL |
ID | 6872371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4614209 |
End bp | 4616065 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642787649 |
Product | DNA mismatch repair protein |
Protein accession | YP_002218243 |
Protein GI | 198245330 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATTC AGGTTCTGCC GCCGCAGCTT GCGAACCAAA TCGCCGCTGG CGAAGTGGTG GAACGCCCTG CGTCGGTTGT TAAAGAGCTG GTAGAGAATA GTCTGGATGC AGGCGCCACC CGCGTTGATA TCGACATTGA GCGTGGCGGC GCGAAGCTTA TTCGTATTCG CGACAATGGC TGCGGCATTA AAAAAGAGGA GCTGGCGCTG GCGCTGGCCC GTCATGCCAC CAGTAAAATC GCCTCGCTTG ACGATCTGGA AGCGATTATC AGTCTGGGAT TTCGCGGCGA AGCGCTGGCG AGTATCAGTT CGGTCTCGCG TTTGACGCTA ACGTCGCGCA CGGCGGAGCA GGCGGAAGCC TGGCAGGCGT ATGCGGAAGG GCGTGACATG GACGTGACGG TAAAACCCGC CGCGCACCCG GTCGGCACCA CCCTGGAAGT TCTGGATCTC TTTTACAATA CGCCCGCCCG GCGCAAATTC ATGCGTACCG AAAAAACGGA ATTTAATCAT ATCGATGAGA TCATCCGTCG TATTGCATTG GCCCGTTTTG ACGTCACGCT TAACCTGTCG CACAACGGCA AATTGGTACG GCAGTATCGC GCTGTCGCAA AGGACGGGCA AAAAGAGCGC CGGTTAGGCG CCATCTGCGG CACGCCGTTT CTCGAACAGG CACTGGCGAT CGAGTGGCAG CATGGCGATC TGACCCTGCG CGGCTGGGTC GCCGATCCGA ATCACACCAC CACGGCGTTA ACGGAGATCC AGTACTGCTA TGTGAATGGC CGCATGATGC GCGACCGCTT GATCAACCAT GCCATTCGCC AGGCCTGTGA AGATAAGCTG GGCGCGGACC AACAGCCTGC GTTTGTGTTG TATCTGGAGA TTGACCCGCA TCAGGTGGAT GTCAATGTTC ATCCCGCCAA GCACGAAGTG CGTTTTCATC AATCCCGGCT GGTGCACGAC TTCATCTATC AAGGGGTGCT GAGCGTCCTG CAACAGCAGA CGGAAACGAC GCTGCCGCTG GAGGAGATTG CGCCAGCGCC GCGGCATGTC CCGGAAAACC GTATCGCCGC CGGGCGCAAC CATTTTGCTG TACCCGCCGA GCCAACTGCG GCGCGCGAGC CCGCGACACC GCGTTATTCC GGCGGCGCAT CGGGCGGCAA CGGCGGGCGT CAGTCCGCGG GTGGTTGGCC GCACGCTCAG CCAGGTTATC AGAAGCAGCA GGGCGAGGTT TATCGCGCGC TTTTACAGAC GCCGGCGACG AGCCCCGCGC CGGAGCCGGT TGCGCCTGCG CTTGACGGAC ATAGCCAGAG TTTTGGTCGC GTACTGACGA TAGTCGGCGG TGACTGTGCG TTGCTGGAAC ACGCGGGGAC TATCCAGCTC TTGTCGCTGC CGGTTGCGGA GCGTTGGCTG CGTCAGGCGC AGCTTACACC GGGTCAAAGT CCGGTTTGCG CGCAGCCGTT GCTGATTCCG CTGCGTTTAA AAGTGAGCGC CGATGAAAAA GCCGCGCTGC AAAAAGCCCA ATCTTTGTTG GGAGAATTGG GTATTGAATT TCAGTCAGAT GCGCAGCATG TGACCATTCG GGCAGTGCCT TTACCCTTAC GACAACAAAA TTTACAAATC TTGATTCCTG AACTGATAGG CTACCTGGCG CAACAGACCA CATTTGCAAC GGTCAATATT GCACAATGGA TAGCGCGTAA TGTGCAGAGC GAACATCCGC AGTGGTCGAT GGCGCAGGCC ATATCGCTGC TGGCGGATGT TGAGCGGCTA TGTCCGCAGC TGGTAAAAGC GCCGCCGGGT GGCCTGTTAC AACCTGTTGA TTTACATTCG GCGATGAACG CCCTGAAGCA TGAATGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RVDIDIERGG AKLIRIRDNG CGIKKEELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQAEA WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF MRTEKTEFNH IDEIIRRIAL ARFDVTLNLS HNGKLVRQYR AVAKDGQKER RLGAICGTPF LEQALAIEWQ HGDLTLRGWV ADPNHTTTAL TEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQTETTLPL EEIAPAPRHV PENRIAAGRN HFAVPAEPTA AREPATPRYS GGASGGNGGR QSAGGWPHAQ PGYQKQQGEV YRALLQTPAT SPAPEPVAPA LDGHSQSFGR VLTIVGGDCA LLEHAGTIQL LSLPVAERWL RQAQLTPGQS PVCAQPLLIP LRLKVSADEK AALQKAQSLL GELGIEFQSD AQHVTIRAVP LPLRQQNLQI LIPELIGYLA QQTTFATVNI AQWIARNVQS EHPQWSMAQA ISLLADVERL CPQLVKAPPG GLLQPVDLHS AMNALKHE
|
| |