Gene EcSMS35_2860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2860 
SymbolmutS 
ID6144403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2933451 
End bp2936012 
Gene Length2562 bp 
Protein Length853 aa 
Translation table11 
GC content56% 
IMG OID641617729 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001744884 
Protein GI170680116 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCAA TAGAAAATTT CGATGCCCAT ACGCCCATGA TGCAGCAATA TCTCAAGCTG 
AAAGCCCAGC ATCCCGAGAT CCTGCTGTTT TACCGGATGG GTGATTTTTA TGAACTATTT
TATGACGATG CAAAACGCGC CTCGCAACTG CTGGATATTT CACTGACCAA ACGCGGTGCT
TCAGCGGGAG AGCCGATCCC GATGGCGGGG ATCCCCTACC ATGCGGTGGA AAACTACCTC
GCCAAACTGG TGAATCAGGG CGAGTCCGTT GCCATCTGCG AACAAATTGG CGATCCGGCG
ACCAGCAAAG GTCCGGTTGA GCGCAAAGTT GTGCGTATCG TTACGCCGGG CACCATCAGC
GATGAAGCGC TGTTGCAGGA GCGTCAGGAT AACCTGCTGG CGGCTATCTG GCAGGACAGC
AAAGGTTTCG GCTACGCGAC GCTGGATATC AGTTCCGGGC GTTTTCGCCT GAGCGAACCG
GCTGACCGCG AAACGATGGC GGCAGAGCTG CAACGCACTA ATCCGGCAGA GCTGTTGTAT
GCAGAAGATT TCGCCGAGAT GTCGCTGATT GAAGGTCGTC GCGGACTGCG CCGTCGCCCG
CTGTGGGAGT TTGAAATCGA CACCGCGCGC CAGCAGTTGA ATCTGCAATT TGGCACTCGC
GATCTGGTCG GTTTTGGCGT CGAGAACGCG CCGCGCGGAC TTTGTGCTGC CGGTTGTCTG
TTGCAGTATG CGAAAGATAC CCAACGCACG ACCCTGCCGC ATATTCGTTC TATCACCATG
GAACGTGAGC AGGACAGCAT CATTATGGAT GCCGCGACGC GTCGTAATCT GGAAATCACC
CAGAATCTGG CGGGTGGTGC GGAAAATACG CTGGCTTCTG TGCTCGACTG CACCGTCACG
CCGATGGGCA GCCGTATGCT GAAACGCTGG CTGCATATGC CAGTGCGCGA TACCCGCGTG
TTGCTTGAGC GCCAGCAAAC TATTGGCGCA TTGCAGGATT TCACCGCCGA GCTGCAGCCG
GTACTGCGTC AGGTCGGCGA CCTGGAACGT ATTCTGGCAC GTCTGGCTTT ACGAACTGCT
CGCCCACGCG ATCTGGCCCG TATGCGTCAC GCTTTCCAGC AACTGCCGGA GCTGCGTGCA
CTATTAGAAA ATGTCGATAG CGCTCCGGTA CAGGCGCTGC GTGAAAAAAT GGGCGAGTTT
GCCGAGCTGC GCGATCTGCT GGAGCGAGCA ATCATCGACA CACCGCCAGT GCTGGTACGC
GACGGTGGTG TTATCGCACC GGGCTATAAC GAAGAGCTGG ATGAGTGGCG CGCGCTGGCT
GACGGCGCGA CCGATTATCT GGAGCGTCTG GAGGTCCGCG AGCGTGAACG TACCGGCCTG
GACACGCTGA AAGTTGGCTT TAATGCAGTG CACGGCTACT ACATTCAGAT CAGCCGTGGG
CAAAGCCATC TGGCACCGAT TAACTACATG CGCCGCCAGA CGCTGAAAAA CGCCGAGCGT
TACATTATTC CGGAGCTGAA AGAGTACGAA GATAAAGTTC TCACCTCAAA AGGCAAAGCA
CTGGCTCTGG AAAAACAACT TTATGAAGAG CTGTTCGACT TGCTGTTGCC GCATCTGGAA
GCGTTGCAAC AGAGCGCGAG CGCGCTGGCG GAACTCGACG TGCTGGTGAA CCTGGCGGAA
CGGGCCTATA CCCTGAACTA CACCTGCCCG ACCTTTATTG ATAAACCTGG CATTCGCATT
ACCGAAGGCC GCCATCCGGT GGTTGAACAG GTACTGAATG AGCCGTTTAT CGCCAACCCG
CTGAACCTGT CGCCGCAGCG CCGCATGTTG ATCATCACCG GTCCGAACAT GGGCGGTAAA
AGTACCTATA TGCGCCAGAC CGCGTTGATT GCGCTGATGG CCTATATCGG CAGCTACGTA
CCGGCGCAAA AAGTCGAGAT TGGCCCGATC GATCGCATCT TTACCCGCGT AGGTGCCGCG
GATGATCTGG CTTCCGGACG TTCAACCTTT ATGGTGGAGA TGACCGAAAC CGCCAATATT
TTACATAACG CCACCGAATA CAGTCTGGTG TTGATGGACG AGATTGGGCG TGGAACGTCC
ACTTACGATG GTCTGTCGCT GGCGTGGGCA TGCGCGGAAA ATCTGGCAAA TAAGATCAAA
GCGTTGACGC TGTTTGCTAC CCACTATTTC GAGCTGACCC AACTGCCGGA GAAAATGGAA
GGCGTCGCCA ACGTGCATCT CGATGCGCTG GAGCACGGCG ACACCATTGC CTTTATGCAT
AGCGTGCAGG ATGGCGCGGC AAGCAAAAGC TACGGCCTGG CGGTTGCAGC GCTGGCAGGC
GTGCCAAAAG AGGTTATTAA GCGCGCACGG CAAAAACTGC GTGAGCTGGA AAGCATTTCG
CCGAACGCCG CTGCTACGCA AGTGGATGGT ACACAAATGT CTTTGCTATC CGTACCGGAA
GAAACTTCGC CTGCGGTCGA GGCACTGGAA AACCTGGACC CAGATTCACT CACTCCGCGT
CAGGCGCTGG AGTGGATTTA TCGCTTGAAG AGTCTGGTGT AA
 
Protein sequence
MSAIENFDAH TPMMQQYLKL KAQHPEILLF YRMGDFYELF YDDAKRASQL LDISLTKRGA 
SAGEPIPMAG IPYHAVENYL AKLVNQGESV AICEQIGDPA TSKGPVERKV VRIVTPGTIS
DEALLQERQD NLLAAIWQDS KGFGYATLDI SSGRFRLSEP ADRETMAAEL QRTNPAELLY
AEDFAEMSLI EGRRGLRRRP LWEFEIDTAR QQLNLQFGTR DLVGFGVENA PRGLCAAGCL
LQYAKDTQRT TLPHIRSITM EREQDSIIMD AATRRNLEIT QNLAGGAENT LASVLDCTVT
PMGSRMLKRW LHMPVRDTRV LLERQQTIGA LQDFTAELQP VLRQVGDLER ILARLALRTA
RPRDLARMRH AFQQLPELRA LLENVDSAPV QALREKMGEF AELRDLLERA IIDTPPVLVR
DGGVIAPGYN EELDEWRALA DGATDYLERL EVRERERTGL DTLKVGFNAV HGYYIQISRG
QSHLAPINYM RRQTLKNAER YIIPELKEYE DKVLTSKGKA LALEKQLYEE LFDLLLPHLE
ALQQSASALA ELDVLVNLAE RAYTLNYTCP TFIDKPGIRI TEGRHPVVEQ VLNEPFIANP
LNLSPQRRML IITGPNMGGK STYMRQTALI ALMAYIGSYV PAQKVEIGPI DRIFTRVGAA
DDLASGRSTF MVEMTETANI LHNATEYSLV LMDEIGRGTS TYDGLSLAWA CAENLANKIK
ALTLFATHYF ELTQLPEKME GVANVHLDAL EHGDTIAFMH SVQDGAASKS YGLAVAALAG
VPKEVIKRAR QKLRELESIS PNAAATQVDG TQMSLLSVPE ETSPAVEALE NLDPDSLTPR
QALEWIYRLK SLV