Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2860 |
Symbol | mutS |
ID | 6144403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2933451 |
End bp | 2936012 |
Gene Length | 2562 bp |
Protein Length | 853 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617729 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001744884 |
Protein GI | 170680116 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCAA TAGAAAATTT CGATGCCCAT ACGCCCATGA TGCAGCAATA TCTCAAGCTG AAAGCCCAGC ATCCCGAGAT CCTGCTGTTT TACCGGATGG GTGATTTTTA TGAACTATTT TATGACGATG CAAAACGCGC CTCGCAACTG CTGGATATTT CACTGACCAA ACGCGGTGCT TCAGCGGGAG AGCCGATCCC GATGGCGGGG ATCCCCTACC ATGCGGTGGA AAACTACCTC GCCAAACTGG TGAATCAGGG CGAGTCCGTT GCCATCTGCG AACAAATTGG CGATCCGGCG ACCAGCAAAG GTCCGGTTGA GCGCAAAGTT GTGCGTATCG TTACGCCGGG CACCATCAGC GATGAAGCGC TGTTGCAGGA GCGTCAGGAT AACCTGCTGG CGGCTATCTG GCAGGACAGC AAAGGTTTCG GCTACGCGAC GCTGGATATC AGTTCCGGGC GTTTTCGCCT GAGCGAACCG GCTGACCGCG AAACGATGGC GGCAGAGCTG CAACGCACTA ATCCGGCAGA GCTGTTGTAT GCAGAAGATT TCGCCGAGAT GTCGCTGATT GAAGGTCGTC GCGGACTGCG CCGTCGCCCG CTGTGGGAGT TTGAAATCGA CACCGCGCGC CAGCAGTTGA ATCTGCAATT TGGCACTCGC GATCTGGTCG GTTTTGGCGT CGAGAACGCG CCGCGCGGAC TTTGTGCTGC CGGTTGTCTG TTGCAGTATG CGAAAGATAC CCAACGCACG ACCCTGCCGC ATATTCGTTC TATCACCATG GAACGTGAGC AGGACAGCAT CATTATGGAT GCCGCGACGC GTCGTAATCT GGAAATCACC CAGAATCTGG CGGGTGGTGC GGAAAATACG CTGGCTTCTG TGCTCGACTG CACCGTCACG CCGATGGGCA GCCGTATGCT GAAACGCTGG CTGCATATGC CAGTGCGCGA TACCCGCGTG TTGCTTGAGC GCCAGCAAAC TATTGGCGCA TTGCAGGATT TCACCGCCGA GCTGCAGCCG GTACTGCGTC AGGTCGGCGA CCTGGAACGT ATTCTGGCAC GTCTGGCTTT ACGAACTGCT CGCCCACGCG ATCTGGCCCG TATGCGTCAC GCTTTCCAGC AACTGCCGGA GCTGCGTGCA CTATTAGAAA ATGTCGATAG CGCTCCGGTA CAGGCGCTGC GTGAAAAAAT GGGCGAGTTT GCCGAGCTGC GCGATCTGCT GGAGCGAGCA ATCATCGACA CACCGCCAGT GCTGGTACGC GACGGTGGTG TTATCGCACC GGGCTATAAC GAAGAGCTGG ATGAGTGGCG CGCGCTGGCT GACGGCGCGA CCGATTATCT GGAGCGTCTG GAGGTCCGCG AGCGTGAACG TACCGGCCTG GACACGCTGA AAGTTGGCTT TAATGCAGTG CACGGCTACT ACATTCAGAT CAGCCGTGGG CAAAGCCATC TGGCACCGAT TAACTACATG CGCCGCCAGA CGCTGAAAAA CGCCGAGCGT TACATTATTC CGGAGCTGAA AGAGTACGAA GATAAAGTTC TCACCTCAAA AGGCAAAGCA CTGGCTCTGG AAAAACAACT TTATGAAGAG CTGTTCGACT TGCTGTTGCC GCATCTGGAA GCGTTGCAAC AGAGCGCGAG CGCGCTGGCG GAACTCGACG TGCTGGTGAA CCTGGCGGAA CGGGCCTATA CCCTGAACTA CACCTGCCCG ACCTTTATTG ATAAACCTGG CATTCGCATT ACCGAAGGCC GCCATCCGGT GGTTGAACAG GTACTGAATG AGCCGTTTAT CGCCAACCCG CTGAACCTGT CGCCGCAGCG CCGCATGTTG ATCATCACCG GTCCGAACAT GGGCGGTAAA AGTACCTATA TGCGCCAGAC CGCGTTGATT GCGCTGATGG CCTATATCGG CAGCTACGTA CCGGCGCAAA AAGTCGAGAT TGGCCCGATC GATCGCATCT TTACCCGCGT AGGTGCCGCG GATGATCTGG CTTCCGGACG TTCAACCTTT ATGGTGGAGA TGACCGAAAC CGCCAATATT TTACATAACG CCACCGAATA CAGTCTGGTG TTGATGGACG AGATTGGGCG TGGAACGTCC ACTTACGATG GTCTGTCGCT GGCGTGGGCA TGCGCGGAAA ATCTGGCAAA TAAGATCAAA GCGTTGACGC TGTTTGCTAC CCACTATTTC GAGCTGACCC AACTGCCGGA GAAAATGGAA GGCGTCGCCA ACGTGCATCT CGATGCGCTG GAGCACGGCG ACACCATTGC CTTTATGCAT AGCGTGCAGG ATGGCGCGGC AAGCAAAAGC TACGGCCTGG CGGTTGCAGC GCTGGCAGGC GTGCCAAAAG AGGTTATTAA GCGCGCACGG CAAAAACTGC GTGAGCTGGA AAGCATTTCG CCGAACGCCG CTGCTACGCA AGTGGATGGT ACACAAATGT CTTTGCTATC CGTACCGGAA GAAACTTCGC CTGCGGTCGA GGCACTGGAA AACCTGGACC CAGATTCACT CACTCCGCGT CAGGCGCTGG AGTGGATTTA TCGCTTGAAG AGTCTGGTGT AA
|
Protein sequence | MSAIENFDAH TPMMQQYLKL KAQHPEILLF YRMGDFYELF YDDAKRASQL LDISLTKRGA SAGEPIPMAG IPYHAVENYL AKLVNQGESV AICEQIGDPA TSKGPVERKV VRIVTPGTIS DEALLQERQD NLLAAIWQDS KGFGYATLDI SSGRFRLSEP ADRETMAAEL QRTNPAELLY AEDFAEMSLI EGRRGLRRRP LWEFEIDTAR QQLNLQFGTR DLVGFGVENA PRGLCAAGCL LQYAKDTQRT TLPHIRSITM EREQDSIIMD AATRRNLEIT QNLAGGAENT LASVLDCTVT PMGSRMLKRW LHMPVRDTRV LLERQQTIGA LQDFTAELQP VLRQVGDLER ILARLALRTA RPRDLARMRH AFQQLPELRA LLENVDSAPV QALREKMGEF AELRDLLERA IIDTPPVLVR DGGVIAPGYN EELDEWRALA DGATDYLERL EVRERERTGL DTLKVGFNAV HGYYIQISRG QSHLAPINYM RRQTLKNAER YIIPELKEYE DKVLTSKGKA LALEKQLYEE LFDLLLPHLE ALQQSASALA ELDVLVNLAE RAYTLNYTCP TFIDKPGIRI TEGRHPVVEQ VLNEPFIANP LNLSPQRRML IITGPNMGGK STYMRQTALI ALMAYIGSYV PAQKVEIGPI DRIFTRVGAA DDLASGRSTF MVEMTETANI LHNATEYSLV LMDEIGRGTS TYDGLSLAWA CAENLANKIK ALTLFATHYF ELTQLPEKME GVANVHLDAL EHGDTIAFMH SVQDGAASKS YGLAVAALAG VPKEVIKRAR QKLRELESIS PNAAATQVDG TQMSLLSVPE ETSPAVEALE NLDPDSLTPR QALEWIYRLK SLV
|
| |