Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A4627 |
Symbol | mutL |
ID | 6519058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 4500976 |
End bp | 4502832 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642749567 |
Product | DNA mismatch repair protein |
Protein accession | YP_002117300 |
Protein GI | 194734797 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATTC AGGTTCTGCC GCCGCAGCTT GCGAACCAAA TCGCCGCTGG CGAAGTGGTG GAACGCCCTG CGTCGGTTGT TAAAGAGCTG GTAGAGAATA GTCTGGATGC AGGCGCCACC CGCGTTGATA TCGACATTGA GCGTGGCGGC GCGAAGCTTA TTCGTATTCG CGACAATGGC TGCGGCATTA AAAAAGAGGA GCTGGCGCTG GCGCTGGCTC GTCATGCCAC CAGTAAGATC GCCTCGCTTG ACGATCTGGA AGCGATTATC AGTCTGGGAT TTCGCGGCGA AGCGCTGGCG AGTATCAGTT CGGTCTCGCG TTTGACGCTA ACGTCGCGCA CGGCGGAGCA GGCGGAAGCC TGGCAGGCGT ATGCGGAAGG GCGTGACATG GACGTGACGG TAAAACCCGC CGCGCACCCG GTCGGCACCA CCCTGGAAGT TCTGGATCTC TTTTATAATA CGCCCGCCCG GCGCAAATTC ATGCGTACCG AAAAAACGGA ATTTAATCAC ATCGATGAGA TCATCCGTCG TATTGCATTG GCCCGTTTTG ACGTCACGCT TAACCTGTCG CACAACGGCA AATTGGTACG GCAGTATCGC GCTGTCGCAA AGGACGGGCA AAAAGAGCGC CGGTTAGGCG CCATCTGCGG GACGCCGTTT CTCGAACAGG CACTGGCGAT CGAGTGGCAG CATGGCGATC TGACCCTGCG CGGCTGGGTC GCCGATCCGA ATCACACCAC CACGGCGTTA ACGGAGATCC AGTACTGCTA TGTGAATGGC CGCATGATGC GCGACCGCTT GATCAACCAT GCCATTCGCC AGGCCTGTGA AGATAAACTG GGCGCGGACC AACAGCCTGC GTTTGTGTTG TATCTGGAGA TTGACCCGCA TCAGGTGGAT GTCAATGTTC ATCCCGCCAA GCACGAAGTG CGTTTTCATC AATCCCGGCT GGTGCACGAC TTCATCTATC AAGGGGTGCT GAGCGTTCTG CAACAGCAGA CGGAAACGAC GCTGCCGCTG GAGGATATTG CGCCAGCGCC GCGGCATGTC CCGGAAAACC GTATCGCCGC CGGGCGCAAC CATTTTGCTG TACCCGCCGA GCCAACTGCG GCGCGCGAGC CCGCGACACC GCGTTATTCC GGCGGCGCAT CGGGCGGTAA CGGCGGGCGT CAGTCCGCAG GTGGTTGGCC GCACGCGCAG CCAGGTTATC AGAAGCAGCA GGGCGAGGTT TATCGCACGC TTTTACAGAC GCCGGCGACG AGCCCCGCGC CGGAGCCGGT TGCGCCTGCG CTTGACGGAC ATAGCCAGAG TTTCGGTCGC GTACTGACAA TAGTCGGCGG TGACTGCGCG TTGCTGGAAC ACGCGGGGAC TATCCAGCTC TTGTCGCTGC CGGTTGCGGA GCGTTGGCTG CGTCAGGCGC AGCTTACACC GGGTCAAAGT CCGGTTTGCG CGCAGCCGTT GCTGATTCCG CTGCGTTTAA AAGTGAGCGC CGATGAAAAA GTCGCGCTGC AAAAAGCCCA ATCTTTGTTG GGAGAATTGG GTATTGAATT TCAGTCAGAT GCGCAGCATG TGACCATTCG GGCGGTGCCT TTACCCTTAC GACAACAAAA TTTACAAATC TTGATTCCTG AACTGATAGG CTACCTGGCG CAACAGACCA CATTTGCAAC GGTCAATATT GCACAATGGA TAGCGCGTAA TGTGCAGAGC GAACATCCGC AGTGGTCGAT GGCGCAGGCC ATATCGCTGC TGGCGGATGT TGAGCGGCTA TGTCCGCAGC TGGTAAAAGC GCCGCCGGGT GGCCTGTTAC AACCTGTTGA TTTACATTCG GCGATGAACG CCCTGAAGCA TGAATGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RVDIDIERGG AKLIRIRDNG CGIKKEELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQAEA WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF MRTEKTEFNH IDEIIRRIAL ARFDVTLNLS HNGKLVRQYR AVAKDGQKER RLGAICGTPF LEQALAIEWQ HGDLTLRGWV ADPNHTTTAL TEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQTETTLPL EDIAPAPRHV PENRIAAGRN HFAVPAEPTA AREPATPRYS GGASGGNGGR QSAGGWPHAQ PGYQKQQGEV YRTLLQTPAT SPAPEPVAPA LDGHSQSFGR VLTIVGGDCA LLEHAGTIQL LSLPVAERWL RQAQLTPGQS PVCAQPLLIP LRLKVSADEK VALQKAQSLL GELGIEFQSD AQHVTIRAVP LPLRQQNLQI LIPELIGYLA QQTTFATVNI AQWIARNVQS EHPQWSMAQA ISLLADVERL CPQLVKAPPG GLLQPVDLHS AMNALKHE
|
| |