Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3030 |
Symbol | mutS |
ID | 5587055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3032953 |
End bp | 3035535 |
Gene Length | 2583 bp |
Protein Length | 860 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640926676 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001464052 |
Protein GI | 157156430 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA TCAATATGAA TGAAAACATT GACAAGGACT TCTCCAGCCA TACGCCAATG ATGCAGCAGT ATCTCAAGCT GAAAGCCCAG CATCCCGAGA TCCTGCTGTT TTACCGGATG GGTGATTTTT ATGAACTGTT TTATGACGAC GCAAAACGCG CGTCGCAACT GCTGGATATT TCACTGACCA AACGCGGTGC TTCGGCGGGA GAGCCGATCC CGATGGCAGG GATCCCCTAC CATGCGGTGG AAAACTACCT CGCCAAACTG GTGAATCAGG GCGAGTCCGT TGCCATCTGC GAACAAATTG GCGATCCGGC GACCAGCAAA GGTCCGGTTG AGCGCAAAGT TGTGCGTATC GTTACGCCAG GCACCATCAG CGATGAAGCC CTGTTACAGG AGCGTCAGGA CAACCTGCTG GCGGCTATCT GGCAGGACAG CAAAGGTTTC GGCTACGCGA CGCTGGATAT CAGCTCCGGG CGTTTTCGCC TGAGCGAACC GGCTGACCGG GAAACGATGG CGGCAGAACT GCAACGCACT AATCCTGCGG AACTGCTGTA TGCAGAAGAT TTTGCTGAAA TGTCGTTAAT TGAAGGCCGT CGCGGCCTGC GCCGTCGCCC GCTGTGGGAG TTTGAAATCG ACACCGCGCG CCAGCAGTTG AATCTGCAAT TTGGCACTCG CGATCTGGTC GGTTTTGGCG TCGAGAACGC GCCGCGCGGA CTTTGTGCTG CCGGTTGTCT GTTGCAGTAT GCGAAAGATA CCCAACGTAC GACTCTGCCG CATATTCGTT CCATCACCAT GGAACGTGAG CAGGACAGCA TCATTATGGA TGCCGCGACG CGTCGTAATC TGGAAATCAC CCAGAACCTG GCGGGTGGTG CGGAAAATAC GCTGGCTTCT GTGCTCGACT GCACCGTCAC GCCGATGGGC AGCCGTATGC TGAAACGCTG GCTGCATATG CCAGTGCGCG ATACCCGCGT GTTGCTTGAG CGCCAGCAAA CTATCGGCGC ATTGCAGGAT TTCACCGCCG AGTTGCAGCC GGTACTGCGT CAGGTCGGCG ACCTGGAACG TATTCTGGCA CGTCTGGCTT TACGAACTGC TCGCCCACGC GATCTGGCCC GTATGCGCCA CGCTTTCCAG CAACTGCCGG AGCTGCGTGC ACAGTTAGAA ACTGTCGATA GTGCACCGGT ACAGGCGCTA CGTGAGAAGA TGGGCGAGTT TGCCGAGCTG CGCGATCTGC TGGAGCGAGC AATCATCGAC ACACCGCCAG TGCTGGTACG CGACGGTGGT GTTATCGCAT CGGGCTATAA CGAAGAGCTG GATGAGTGGC GCGCGCTGGC TGACGGCGCG ACCGATTATC TGGAGCGTCT GGAAGTCCGC GAGCGTGAAC GTACCGGCCT GGACACGCTA AAAGTTGGCT TTAATGCGGT GCACGGCTAC TACATTCAAA TCAGCCGTGG GCAAAGCCAT CTGGCACCCA TCAACTACAT GCGTCGCCAG ACGCTGAAAA ACGCCGAGCG CTACATCATT CCAGAGCTAA AAGAGTACGA AGATAAAGTT CTCACCTCAA AAGGCAAAGC ACTGGCTCTG GAAAAACAGC TTTATGAAGA ACTGTTCGAC CTGCTGTTGC CGCATCTGGA AGCGTTGCAA CAGAGCGCGA GCGCGCTGGC GGAACTCGAC GTGCTGGTGA ACCTGGCGGA ACGGGCCTAT ACCCTGAACT ACACCTGCCC GACCTTTATT GATAAACCGG GCATTCGCAT TACCGAAGGC CGCCATCCGG TGGTTGAACA GGTGCTGAAT GAGCCGTTTA TCGCCAACCC GCTGAATCTG TCGCCGCAGC GTCGCATGTT GATCATCACC GGTCCGAACA TGGGCGGTAA AAGTACCTAT ATGCGCCAGA CCGCGTTGAT TGCGCTAATG GCCTATATCG GCAGCTACGT ACCGGCGCAA AAAGTCGAGA TTGGCCCGAT TGACCGTATC TTTACCCGCG TAGGCGCGGC AGATGACCTG GCTTCCGGGC GTTCGACCTT TATGGTGGAG ATGACCGAAA CCGCCAATAT TTTACATAAT GCCACCGAAT ATAGTCTGGT ATTGATGGAT GAGATCGGGC GTGGAACGTC CACTTACGAT GGTCTGTCGC TGGCGTGGGC GTGCGCGGAA AATCTGGCGA ATAAGATTAA GGCATTGACG CTGTTTGCCA CCCACTATTT CGAGCTGACA CAGTTACCGG AGAAAATGGA AGGCGTCGCC AACGTGCATC TCGATGCACT GGAGCACGGC GACACCATTG CCTTTATGCA CAGCGTGCAG GATGGCGCGG CGAGCAAAAG CTACGGCCTG GCGGTTGCAG CTCTGGCCGG CGTGCCAAAA GAGGTTATTA AGCGCGCACG GCAAAAGCTG CGTGAGCTGG AAAGCATTTC GCCGAACGCC GCCGCTACGC AAGTGGATGG TACGCAAATG TCTTTGCTGT CCGTACCAGA AGAAATTTCG CCTGCGGTCG AGGCACTGGA AAACCTGGAC CCGGATTCAC TGACTCCGCG TCAGGCGCTG GAGTGGATTT ATCGCTTGAA GAGTCTGGTG TAA
|
Protein sequence | MSKINMNENI DKDFSSHTPM MQQYLKLKAQ HPEILLFYRM GDFYELFYDD AKRASQLLDI SLTKRGASAG EPIPMAGIPY HAVENYLAKL VNQGESVAIC EQIGDPATSK GPVERKVVRI VTPGTISDEA LLQERQDNLL AAIWQDSKGF GYATLDISSG RFRLSEPADR ETMAAELQRT NPAELLYAED FAEMSLIEGR RGLRRRPLWE FEIDTARQQL NLQFGTRDLV GFGVENAPRG LCAAGCLLQY AKDTQRTTLP HIRSITMERE QDSIIMDAAT RRNLEITQNL AGGAENTLAS VLDCTVTPMG SRMLKRWLHM PVRDTRVLLE RQQTIGALQD FTAELQPVLR QVGDLERILA RLALRTARPR DLARMRHAFQ QLPELRAQLE TVDSAPVQAL REKMGEFAEL RDLLERAIID TPPVLVRDGG VIASGYNEEL DEWRALADGA TDYLERLEVR ERERTGLDTL KVGFNAVHGY YIQISRGQSH LAPINYMRRQ TLKNAERYII PELKEYEDKV LTSKGKALAL EKQLYEELFD LLLPHLEALQ QSASALAELD VLVNLAERAY TLNYTCPTFI DKPGIRITEG RHPVVEQVLN EPFIANPLNL SPQRRMLIIT GPNMGGKSTY MRQTALIALM AYIGSYVPAQ KVEIGPIDRI FTRVGAADDL ASGRSTFMVE MTETANILHN ATEYSLVLMD EIGRGTSTYD GLSLAWACAE NLANKIKALT LFATHYFELT QLPEKMEGVA NVHLDALEHG DTIAFMHSVQ DGAASKSYGL AVAALAGVPK EVIKRARQKL RELESISPNA AATQVDGTQM SLLSVPEEIS PAVEALENLD PDSLTPRQAL EWIYRLKSLV
|
| |