Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4720 |
Symbol | mutL |
ID | 6484080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4600870 |
End bp | 4602726 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642739936 |
Product | DNA mismatch repair protein |
Protein accession | YP_002043614 |
Protein GI | 194445394 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATTC AGGTTCTGCC GCCGCAGCTT GCGAACCAAA TCGCCGCTGG CGAAGTGGTG GAACGCCCTG CGTCGGTTGT TAAAGAGCTG GTAGAGAATA GTCTGGATGC AGGCGCCACC CGCGTTGATA TCGACATCGA GCGTGGCGGC GCGAAGCTTA TTCGTATTCG CGACAATGGC TGCGGCATTA AAAAAGAGGA GCTGGCGCTG GCGCTGGCCC GTCATGCCAC CAGTAAAATC GCCTCGCTTG ACGATCTGGA AGCGATTATC AGTCTGGGAT TTCGCGGCGA AGCGCTGGCC AGTATCAGTT CGGTCTCGCG TTTGACGCTA ACGTCGCGCA CGGCGGAGCA GGCGGAAGCC TGGCAGGCGT ATGCGGAAGG GCGCGACATG GACGTGACGG TAAAACCCGC CGCGCACCCG GTCGGCACCA CCCTGGAAGT TCTGGATCTC TTTTATAATA CGCCCGCCCG GCGCAAATTC ATGCGTACCG AAAAAACGGA ATTTAATCAC ATCGATGAGA TCATCCGTCG TATTGCATTG GCCCGTTTTG ACGTCACTTT TAACCTGTCG CACAACGGCA AATTGGTACG GCAGTATCGC GCTGTCGCAA AGGACGGGCA AAAAGAGCGC CGGTTAGGCG CCATCTGCGG CACGCCGTTT CTCGAACAGG CACTGGCGAT CGAGTGGCAG CATGGCGATC TGACCCTGCG CGGCTGGGTC GCCGATCCGA ATCACACCAC CACGGCGTTA ACGGAGATCC AGTACTGTTA TGTGAATGGC CGCATGATGC GCGACCGCTT GATCAACCAT GCCATTCGCC AGGCCTGTGA AGATAAACTG GGCGCGGACC AACAGCCTGC GTTTGTGTTG TATCTGGAGA TTGACCCGCA TCAGGTGGAT GTCAATGTTC ATCCCGCCAA GCACGAAGTG CGTTTTCATC AATCGCGGCT GGTGCACGAC TTCATCTATC AAGGGGTGCT GAGCGTTCTA CAACAGCAGA CGGAAACGAC GCTGCCGCTG GAGGAGATTG CGCCAGCGCC GCGGCATGTC CCGGAAAACC GTATCGCCGC CGGGCGCAAC CATTTTGCTG TACCCGCCGA GCCAACTGCG GCGCGCGAGC CCGCGACACC GCGTTATTCC GGCGGCGCAT CGGGCGGTAA CGGCGGGCGT CAGTCCGCAG GTGGTTGGCC GCACGCGCAG CCAGGTTATC AGAAGCAGCA GGGCGAGGTT TATCGCGCGC TTTTACAGAC GCCGACGACG AGCCCCGCGC CGGAGCCGGT TGCGCCTGCG CTTGACGGAC ATAGCCAGAG TTTCGGTCGC GTACTGACAA TAGTCGGCGG TGACTGCGCG TTGCTGGAAC ACGCGGGGAC TATCCAGCTC TTGTCGCTGC CGGTTGCGGA GCGTTGGCTG CGTCAGGCGC AGCTTACACC GGGTCAAAGC CCGGTTTGCG CGCAGCCGTT GCTGATTCCG CTGCGTTTAA AAGTGAGCGC CGATGAAAAA GACGCGCTGC AACAAGCCCA ATCTTTATTG GGAGAATTGG GTATTGAATT TCAGTCAGAT GCGCAGCATG TGACCATTCG GGCGGTGCCT TTACCCTTAC GACAACAAAA TTTACAAATC TTGATTCCTG AACTGATAGG CTACCTGGCG CAACAGACCA CATTTGCAAC GGTCAATATT GCACAATGGA TAGCGCGTAA TGTGCAGAGC GAACATCCGC AGTGGTCGAT GGCGCAGGCC ATATCGCTGC TGGCGGATGT TGAGCGGCTA TGTCCGCAGC TGGTAAAAGC GCCGCCGGGT GGCCTGTTAC AACCTGTTGA TTTACATTCG GCGATGAACG CCCTGAAGCA TGAATGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RVDIDIERGG AKLIRIRDNG CGIKKEELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQAEA WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF MRTEKTEFNH IDEIIRRIAL ARFDVTFNLS HNGKLVRQYR AVAKDGQKER RLGAICGTPF LEQALAIEWQ HGDLTLRGWV ADPNHTTTAL TEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQTETTLPL EEIAPAPRHV PENRIAAGRN HFAVPAEPTA AREPATPRYS GGASGGNGGR QSAGGWPHAQ PGYQKQQGEV YRALLQTPTT SPAPEPVAPA LDGHSQSFGR VLTIVGGDCA LLEHAGTIQL LSLPVAERWL RQAQLTPGQS PVCAQPLLIP LRLKVSADEK DALQQAQSLL GELGIEFQSD AQHVTIRAVP LPLRQQNLQI LIPELIGYLA QQTTFATVNI AQWIARNVQS EHPQWSMAQA ISLLADVERL CPQLVKAPPG GLLQPVDLHS AMNALKHE
|
| |