Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4701 |
Symbol | mutL |
ID | 6273284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4392961 |
End bp | 4394808 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641728466 |
Product | DNA mismatch repair protein |
Protein accession | YP_001882861 |
Protein GI | 187733395 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0915673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACG CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAACTTA TCCGCATTCG TGATAACGGC TGCGGTATCA AAAAAGATGA GCTGGCGCTG GCGCTGGCTC GTCATGCCAC CAGTAAAATC GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC TGGCAGGCCT ATGCCGAAGG GCGCGATATG AACGTGACGG TAAAACCGGC GGCGCATCCT GTGGGGACGA CGCTGGAGGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC CTGCGCACCG AGAAAACCGA ATTTAACCAC ATTGATGAGA TCATCCGCCG CATTGCGCTG GCGCGTTTCG ACGTCACGAT CAACCTGTCG CATAACGGTA AAATTGTGCG TCAGTACCGC GCAGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGCGG CACTGCTTTT CTTGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACGCTACG CGGCTGGGTG GCCGATCCAA ATCACACCAC GCCCGCACTG GCAGAAATTC AGTATTGCTA CGTGAACGGT CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCCTGCGA AGACAAACTG GGGGCCGATC AGCAACCGGC ATTTGTGCTG TATCTGGAGA TCGATCCGCA TCAGGTGGAC GTCAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTCCATC AGTCGCGTCT GGTGCATGAT TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAAC TGGAAACGCC GCTACCGCTG GACGATGAAC CCCAACCTGC ACCGCGTTCC ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC AATCACTTTG CAGAACCGGC AGCTCGTGAG CCGGTAGCTC CGCGCTACAC TCCTGCGCCA GCATCAGGCA GTCGTCCGGC TGCCCCTTGG CCGAATGCGC AGCCAGGCTA CCAGAAACAG CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ATTAAAAGCG CCGGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGACGGCA ACATTTCACT TTTAGCCTTG CCAGTGGCAG AACGTTGGCT ACGTCAGGTA CAACTGACGC CGGGTGAAGC GCCAGTTTGC GCTCAGCCGC TGCTTATTCC GTTGCGGCTA AAAGTTTCTG GCGAAGAAAA ATCGGCATTA GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT GAACTGATAG GCTACCTGGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA WQAYAEGRDM NVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRS IPENRVAAGR NHFAEPAARE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKLKA PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLAL PVAERWLRQV QLTPGEAPVC AQPLLIPLRL KVSGEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL QSVDLHPAIK ALKDE
|
| |