Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3986 |
Symbol | mutS |
ID | 6971425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3685222 |
End bp | 3687783 |
Gene Length | 2562 bp |
Protein Length | 853 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643387755 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002272198 |
Protein GI | 209399182 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.724345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCAA TAGAAAATTT CGACGCCCAT ACGCCCATGA TGCAGCAGTA TCTCAAGCTG AAAGCCCAGC ATCCCGAGAT CCTGCTGTTT TACCGGATGG GTGATTTTTA TGAACTGTTT TATGACGACG CAAAACGCGC GTCGCAACTG CTGGATATTT CACTGACCAA ACGCGGTGCT TCGGCGGGAG AGCCGATCCC GATGGCGGGG ATTCCCTACC ATGCGGTGGA AAACTACCTC GCCAAACTGG TGAATCAGGG CGAGTCCGTT GCCATCTGCG AACAAATTGG CGATCCGGCG ACCAGCAAAG GTCCGGTTGA GCGCAAAGTT GTGCGTATCG TTACGCCAGG CACCATCAGC GATGAAGCCC TGTTGCAGGA GCGTCAGGAC AACCTGCTGG CGGCTATCTG GCAGGACAGC AAAGGTTTCG GCTACGCGAC GCTGGATATC AGTTCCGGTC GTTTTCGCCT GAGCGAACCG GCTGACCGGG AAACGATGGC GGCAGAACTG CAACGCACTA ATCCTGCGGA ACTGCTGTAT GCAGAAGATT TTGCTGAAAT GTCGTTAATT GAAGGCCGTC GCGGCCTGCG CCGTCGCCCG CTGTGGGAGT TTGAAATCGA CACCGCGCGC CAGCAGTTGA ATCTGCAATT TGGGACCCGC GATCTGGTCG GTTTTGGCGT CGAGAACGCG CCGCGCGGAC TTTGTGCTGC CGGTTGTCTG TTGCAGTATG CGAAAGATAC CCAACGTACG ACTCTGCCGC ATATTCGTTC CATCACCATG GAACGTGAGC AGGACAGCAT CATTATGGAT GCCGCGACGC GTCGTAATCT GGAAATCACC CAGAACCTGG CGGGTGGTGC GGAAAATACG CTGGCTTCTG TGCTCGACTG CACCGTCACG CCGATGGGCA GCCGTATGCT GAAACGCTGG CTGCATATGC CAGTGCGCGA TACCCGCGTG TTGCTTGAGC GCCAGCAAAC TATTGGCGCA TTGCAGGATT TCACCGCCGA GTTGCAGCCG GTACTACGTC AGGTCGGCGA CCTGGAACGT ATTCTGGCGC GTCTGGCGTT GCGTACCGCT CGCCCACGCG ATCTGGCCCG TATGCGTCAC GCTTTCCAGC AACTGCCGGA GCTGCGTGCG CAGTTAGAAA CTGTTGATAG TGCACCAGTA CAGGCGCTAC GTGAGAAGAT GGGCGAGTTT GCCGAGCTGC GCGATCTGCT GGAGCGAGCA ATCATCGACA CACCGCCGGT GCTGGTACGC GACGGTGGTG TTATCGCATC AGGCTATAAC GAAGAGCTGG ATGAGTGGCG CGCGCTGGCT GACGGCGCGA CCGATTATCT GGAGCGTCTG GAAGTCCGCG AGCGTGAACG TACCGGCCTG GACACGCTAA AAGTTGGCTT TAATGCGGTG CACGGCTACT ACATTCAAAT CAGCCGTGGG CAAAGCCATC TGGCACCTAT CAACTATATG CGTCGCCAGA CGCTGAAAAA CGCCGAGCGC TACATCATTC CAGAGCTAAA AGAGTACGAA GATAAAGTCC TCACTTCAAA AGGCAAAGCA CTGGCTCTGG AAAAACAGCT TTATGAAGAG CTGTTCGACC TGCTGTTGCC GCATCTGGAA GCGTTGCAAC AGAGCGCGAG CGCGCTGGCG GAACTCGACG TGCTGGTGAA CCTGGCGGAA CGGGCCTATA CCCTGAACTA CACCTGCCCG ACCTTCATTG ATAAACCGGG CATTCGCATT ACCGAAGGCC GCCATCCGGT GGTTGAACAG GTGCTGAACG AGCCATTTAT CGCCAACCCG CTGAATCTGT CACCGCAGCG CCGGATGTTG ATTATTACCG GTCCGAACAT GGGCGGTAAA AGTACCTATA TGCGCCAGAC CGCACTGATT GCGCTGATGG CCTATATCGG CAGCTACGTA CCGGCGCAAA AAGTCGAGAT TGGCCCGATT GACCGTATCT TTACCCGCGT AGGGGCAGCG GATGATCTGG CTTCCGGGCG TTCAACCTTT ATGGTGGAGA TGACCGAAAC CGCTAATATT CTGCATAACG CCACCGAGTA CAGTCTGGTG CTGATGGATG AGATTGGGCG CGGAACGTCC ACTTACGATG GTCTGTCGCT GGCGTGGGCG TGCGCGGAAA ATCTGGCGAA TAAGATTAAG GCGTTGACGC TGTTTGCCAC CCACTATTTC GAGCTGACCC AGTTACCGGA GAAAATGGAA GGCGTCGCCA ACGTGCATCT CGATGCACTG GAGCACGGCG ACACCATTGC CTTTATGCAT AGCGTGCAGG ATGGCGCGGC GAGCAAAAGC TACGGCCTGG CGGTTGCAGC TCTGGCCGGC GTGCCAAAAG AGGTTATTAA GCGCGCACGG CAAAAACTGC GTGAGCTGGA AAGCATTTCG CCGAACGCCG CCGCTACGCA AGTGGATGGT ACGCAAATGT CTTTGCTGTC CGTACCGGAA GAAACCTCGC CTGCAGTCGA GGCACTGGAA AACCTCGATC CGGATTCACT CACCCCGCGT CAGGCGCTGG AATGGATTTA TCGCCTGAAG AGTCTGGTGT AA
|
Protein sequence | MSAIENFDAH TPMMQQYLKL KAQHPEILLF YRMGDFYELF YDDAKRASQL LDISLTKRGA SAGEPIPMAG IPYHAVENYL AKLVNQGESV AICEQIGDPA TSKGPVERKV VRIVTPGTIS DEALLQERQD NLLAAIWQDS KGFGYATLDI SSGRFRLSEP ADRETMAAEL QRTNPAELLY AEDFAEMSLI EGRRGLRRRP LWEFEIDTAR QQLNLQFGTR DLVGFGVENA PRGLCAAGCL LQYAKDTQRT TLPHIRSITM EREQDSIIMD AATRRNLEIT QNLAGGAENT LASVLDCTVT PMGSRMLKRW LHMPVRDTRV LLERQQTIGA LQDFTAELQP VLRQVGDLER ILARLALRTA RPRDLARMRH AFQQLPELRA QLETVDSAPV QALREKMGEF AELRDLLERA IIDTPPVLVR DGGVIASGYN EELDEWRALA DGATDYLERL EVRERERTGL DTLKVGFNAV HGYYIQISRG QSHLAPINYM RRQTLKNAER YIIPELKEYE DKVLTSKGKA LALEKQLYEE LFDLLLPHLE ALQQSASALA ELDVLVNLAE RAYTLNYTCP TFIDKPGIRI TEGRHPVVEQ VLNEPFIANP LNLSPQRRML IITGPNMGGK STYMRQTALI ALMAYIGSYV PAQKVEIGPI DRIFTRVGAA DDLASGRSTF MVEMTETANI LHNATEYSLV LMDEIGRGTS TYDGLSLAWA CAENLANKIK ALTLFATHYF ELTQLPEKME GVANVHLDAL EHGDTIAFMH SVQDGAASKS YGLAVAALAG VPKEVIKRAR QKLRELESIS PNAAATQVDG TQMSLLSVPE ETSPAVEALE NLDPDSLTPR QALEWIYRLK SLV
|
| |