Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4637 |
Symbol | mutL |
ID | 6795923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4534359 |
End bp | 4536215 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642778711 |
Product | DNA mismatch repair protein |
Protein accession | YP_002149273 |
Protein GI | 197247369 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000512951 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATTC AGGTTCTGCC GCCGCAGCTT GCGAACCAAA TCGCCGCTGG CGAAGTGGTG GAACGCCCTG CGTCGGTTGT TAAAGAGCTG GTAGAGAATA GTCTGGATGC AGGCGCCACC CGCGTTGATA TCGACATCGA GCGTGGCGGC GCGAAGCTTA TTCGTATTCG CGACAATGGC TGCGGCATTA AAAAAGAGGA GCTGGCGCTG GCGCTGGCCC GTCATGCCAC CAGTAAAATC GCCTCGCTTG ACGATCTGGA AGCGATTATC AGTCTGGGAT TTCGCGGCGA AGCGCTGGCC AGTATCAGTT CGGTCTCGCG TTTGACGCTA ACGTCGCGCA CGGCGGAGCA GGCGGAAGCC TGGCAGGCGT ATGCGGAAGG GCGCGACATG GACGTGACGG TAAAACCCGC CGCGCACCCG GTCGGCACCA CCCTGGAAGT TCTGGATCTC TTTTATAATA CGCCCGCCCG GCGTAAATTC ATGCGTACCG AAAAAACGGA ATTTAATCAC ATCGATGAGA TCATCCGTCG TATTGCATTG GCCCGTTTTG ACGTCACGCT TAACCTGTCG CACAACGGCA AATTGGTACG GCAGTATCGC GCTGTCGCAA AGGACGGGCA AAAAGAGCGC CGGTTAGGCG CCATCTGCGG CACGCCGTTT CTCGAACAGG CACTGGCGAT CGAGTGGCAG CATGGCGATC TGACCCTGCG CGGCTGGGTC GCCGATCCGA ATCACACCAC CACGGCGTTA ACGGAGATCC AGTACTGCTA TGTGAATGGC CGCATGATGC GCGACCGCTT GATCAATCAT GCCATTCGCC AGGCCTGTGA AGATAAACTG GGCGCGGACC AACAGCCTGC GTTTGTGTTG TATCTGGAGA TTGACCCGCA TCAGGTGGAT GTCAATGTTC ATCCCGCCAA GCACGAAGTG CGTTTTCATC AATCCCGGCT GGTGCACGAC TTCATCTATC AAGGGGTGCT GAGCGTTCTG CAACAGCAGA CGGAAACGAC GCTGCCGCTT GAGGACATTG CGCCAGCGCC GCGGCATGTC CCGGAAAACC GTATCGCCGC CGGGCGCAAC CATTTTGCTG TACCCGCCGA GCCAACTGCG GCGCGCGAGC CCGCGACACC GCGTTATTCC GGCGGCACAT CGGGCGGTAA CGGCGGGCGT CAGTCCGCAG GTGGTTGGCC GCACGCGCAG CCAGGTTATC AGAAGCCGCA GGGCGAGGTT TATCGCACGC TTTTACAGAC GCCGGCGACG AGCCCCGCGC CGGAGCCGGT TGCGCCTGCG CTTGACGGAC ATAGCCAGAG TTTTGGTCGC GTACTGACGA TAGTCGGCGG TGACTGCGCG TTGCTGGAAC ACGCGGGGAC TATCCAGCTC TTGTCGCTGC CGGTTGCGGA GCGTTGGCTG CGTCAGGCGC AGCTTACACC GGGTCAAAGC CCGGTTTGCG CGCAGCCGTT GCTGATTCCG CTGCGTTTAA AAGTGAGCGC CGATGAAAAA GCCGCGCTGC AACAAGCCCA ATCTTTGTTG GGAGAATTGG GTATTGAATT TCAGTCAGAT GCGCAGCATG TGACCATTCG GGCGGTGCCT TTACCCTTAC GACAACAAAA TTTACAAATC TTGATTCCTG AACTGATAGG CTACCTGGCG CAGCAGACCA CATTTGCAAC GGTCAATATT GCACAATGGA TAGCGCGTAA TGTGCAGAGC GAACATCCGC AGTGGTCGAT GGCGCAGGCC ATATCGCTGC TGGCGGATGT TGAGCGGCTA TGTCCGCAGC TGGTAAAAGC GCCGCCGGGT GGCCTGTTAC AACCTGTTGA TTTACATTCG GCGATGAACG CCCTGAAGCA TGAATGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RVDIDIERGG AKLIRIRDNG CGIKKEELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQAEA WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF MRTEKTEFNH IDEIIRRIAL ARFDVTLNLS HNGKLVRQYR AVAKDGQKER RLGAICGTPF LEQALAIEWQ HGDLTLRGWV ADPNHTTTAL TEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQTETTLPL EDIAPAPRHV PENRIAAGRN HFAVPAEPTA AREPATPRYS GGTSGGNGGR QSAGGWPHAQ PGYQKPQGEV YRTLLQTPAT SPAPEPVAPA LDGHSQSFGR VLTIVGGDCA LLEHAGTIQL LSLPVAERWL RQAQLTPGQS PVCAQPLLIP LRLKVSADEK AALQQAQSLL GELGIEFQSD AQHVTIRAVP LPLRQQNLQI LIPELIGYLA QQTTFATVNI AQWIARNVQS EHPQWSMAQA ISLLADVERL CPQLVKAPPG GLLQPVDLHS AMNALKHE
|
| |