Gene ECH74115_3986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3986 
SymbolmutS 
ID6971425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3685222 
End bp3687783 
Gene Length2562 bp 
Protein Length853 aa 
Translation table11 
GC content56% 
IMG OID643387755 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002272198 
Protein GI209399182 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.724345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCAA TAGAAAATTT CGACGCCCAT ACGCCCATGA TGCAGCAGTA TCTCAAGCTG 
AAAGCCCAGC ATCCCGAGAT CCTGCTGTTT TACCGGATGG GTGATTTTTA TGAACTGTTT
TATGACGACG CAAAACGCGC GTCGCAACTG CTGGATATTT CACTGACCAA ACGCGGTGCT
TCGGCGGGAG AGCCGATCCC GATGGCGGGG ATTCCCTACC ATGCGGTGGA AAACTACCTC
GCCAAACTGG TGAATCAGGG CGAGTCCGTT GCCATCTGCG AACAAATTGG CGATCCGGCG
ACCAGCAAAG GTCCGGTTGA GCGCAAAGTT GTGCGTATCG TTACGCCAGG CACCATCAGC
GATGAAGCCC TGTTGCAGGA GCGTCAGGAC AACCTGCTGG CGGCTATCTG GCAGGACAGC
AAAGGTTTCG GCTACGCGAC GCTGGATATC AGTTCCGGTC GTTTTCGCCT GAGCGAACCG
GCTGACCGGG AAACGATGGC GGCAGAACTG CAACGCACTA ATCCTGCGGA ACTGCTGTAT
GCAGAAGATT TTGCTGAAAT GTCGTTAATT GAAGGCCGTC GCGGCCTGCG CCGTCGCCCG
CTGTGGGAGT TTGAAATCGA CACCGCGCGC CAGCAGTTGA ATCTGCAATT TGGGACCCGC
GATCTGGTCG GTTTTGGCGT CGAGAACGCG CCGCGCGGAC TTTGTGCTGC CGGTTGTCTG
TTGCAGTATG CGAAAGATAC CCAACGTACG ACTCTGCCGC ATATTCGTTC CATCACCATG
GAACGTGAGC AGGACAGCAT CATTATGGAT GCCGCGACGC GTCGTAATCT GGAAATCACC
CAGAACCTGG CGGGTGGTGC GGAAAATACG CTGGCTTCTG TGCTCGACTG CACCGTCACG
CCGATGGGCA GCCGTATGCT GAAACGCTGG CTGCATATGC CAGTGCGCGA TACCCGCGTG
TTGCTTGAGC GCCAGCAAAC TATTGGCGCA TTGCAGGATT TCACCGCCGA GTTGCAGCCG
GTACTACGTC AGGTCGGCGA CCTGGAACGT ATTCTGGCGC GTCTGGCGTT GCGTACCGCT
CGCCCACGCG ATCTGGCCCG TATGCGTCAC GCTTTCCAGC AACTGCCGGA GCTGCGTGCG
CAGTTAGAAA CTGTTGATAG TGCACCAGTA CAGGCGCTAC GTGAGAAGAT GGGCGAGTTT
GCCGAGCTGC GCGATCTGCT GGAGCGAGCA ATCATCGACA CACCGCCGGT GCTGGTACGC
GACGGTGGTG TTATCGCATC AGGCTATAAC GAAGAGCTGG ATGAGTGGCG CGCGCTGGCT
GACGGCGCGA CCGATTATCT GGAGCGTCTG GAAGTCCGCG AGCGTGAACG TACCGGCCTG
GACACGCTAA AAGTTGGCTT TAATGCGGTG CACGGCTACT ACATTCAAAT CAGCCGTGGG
CAAAGCCATC TGGCACCTAT CAACTATATG CGTCGCCAGA CGCTGAAAAA CGCCGAGCGC
TACATCATTC CAGAGCTAAA AGAGTACGAA GATAAAGTCC TCACTTCAAA AGGCAAAGCA
CTGGCTCTGG AAAAACAGCT TTATGAAGAG CTGTTCGACC TGCTGTTGCC GCATCTGGAA
GCGTTGCAAC AGAGCGCGAG CGCGCTGGCG GAACTCGACG TGCTGGTGAA CCTGGCGGAA
CGGGCCTATA CCCTGAACTA CACCTGCCCG ACCTTCATTG ATAAACCGGG CATTCGCATT
ACCGAAGGCC GCCATCCGGT GGTTGAACAG GTGCTGAACG AGCCATTTAT CGCCAACCCG
CTGAATCTGT CACCGCAGCG CCGGATGTTG ATTATTACCG GTCCGAACAT GGGCGGTAAA
AGTACCTATA TGCGCCAGAC CGCACTGATT GCGCTGATGG CCTATATCGG CAGCTACGTA
CCGGCGCAAA AAGTCGAGAT TGGCCCGATT GACCGTATCT TTACCCGCGT AGGGGCAGCG
GATGATCTGG CTTCCGGGCG TTCAACCTTT ATGGTGGAGA TGACCGAAAC CGCTAATATT
CTGCATAACG CCACCGAGTA CAGTCTGGTG CTGATGGATG AGATTGGGCG CGGAACGTCC
ACTTACGATG GTCTGTCGCT GGCGTGGGCG TGCGCGGAAA ATCTGGCGAA TAAGATTAAG
GCGTTGACGC TGTTTGCCAC CCACTATTTC GAGCTGACCC AGTTACCGGA GAAAATGGAA
GGCGTCGCCA ACGTGCATCT CGATGCACTG GAGCACGGCG ACACCATTGC CTTTATGCAT
AGCGTGCAGG ATGGCGCGGC GAGCAAAAGC TACGGCCTGG CGGTTGCAGC TCTGGCCGGC
GTGCCAAAAG AGGTTATTAA GCGCGCACGG CAAAAACTGC GTGAGCTGGA AAGCATTTCG
CCGAACGCCG CCGCTACGCA AGTGGATGGT ACGCAAATGT CTTTGCTGTC CGTACCGGAA
GAAACCTCGC CTGCAGTCGA GGCACTGGAA AACCTCGATC CGGATTCACT CACCCCGCGT
CAGGCGCTGG AATGGATTTA TCGCCTGAAG AGTCTGGTGT AA
 
Protein sequence
MSAIENFDAH TPMMQQYLKL KAQHPEILLF YRMGDFYELF YDDAKRASQL LDISLTKRGA 
SAGEPIPMAG IPYHAVENYL AKLVNQGESV AICEQIGDPA TSKGPVERKV VRIVTPGTIS
DEALLQERQD NLLAAIWQDS KGFGYATLDI SSGRFRLSEP ADRETMAAEL QRTNPAELLY
AEDFAEMSLI EGRRGLRRRP LWEFEIDTAR QQLNLQFGTR DLVGFGVENA PRGLCAAGCL
LQYAKDTQRT TLPHIRSITM EREQDSIIMD AATRRNLEIT QNLAGGAENT LASVLDCTVT
PMGSRMLKRW LHMPVRDTRV LLERQQTIGA LQDFTAELQP VLRQVGDLER ILARLALRTA
RPRDLARMRH AFQQLPELRA QLETVDSAPV QALREKMGEF AELRDLLERA IIDTPPVLVR
DGGVIASGYN EELDEWRALA DGATDYLERL EVRERERTGL DTLKVGFNAV HGYYIQISRG
QSHLAPINYM RRQTLKNAER YIIPELKEYE DKVLTSKGKA LALEKQLYEE LFDLLLPHLE
ALQQSASALA ELDVLVNLAE RAYTLNYTCP TFIDKPGIRI TEGRHPVVEQ VLNEPFIANP
LNLSPQRRML IITGPNMGGK STYMRQTALI ALMAYIGSYV PAQKVEIGPI DRIFTRVGAA
DDLASGRSTF MVEMTETANI LHNATEYSLV LMDEIGRGTS TYDGLSLAWA CAENLANKIK
ALTLFATHYF ELTQLPEKME GVANVHLDAL EHGDTIAFMH SVQDGAASKS YGLAVAALAG
VPKEVIKRAR QKLRELESIS PNAAATQVDG TQMSLLSVPE ETSPAVEALE NLDPDSLTPR
QALEWIYRLK SLV