Gene EcHS_A2871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2871 
SymbolmutS 
ID5591359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2874511 
End bp2877072 
Gene Length2562 bp 
Protein Length853 aa 
Translation table11 
GC content56% 
IMG OID640921988 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001459499 
Protein GI157162181 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCAA TAGAAAATTT CGACGCCCAT ACGCCCATGA TGCAGCAGTA TCTCAGGCTG 
AAAGCCCAGC ATCCCGAGAT CCTGCTGTTT TACCGGATGG GTGATTTTTA TGAACTGTTT
TATGACGACG CAAAACGCGC GTCGCAACTG CTGGATATTT CACTGACCAA ACGCGGTGCT
TCGGCGGGAG AGCCGATCCC GATGGCGGGG ATTCCCTACC ATGCGGTGGA AAACTATCTC
GCCAAACTGG TGAATCAGGG AGAGTCCGTT GCCATCTGCG AACAAATTGG CGATCCGGCG
ACCAGCAAAG GTCCGGTTGA GCGCAAAGTT GTGCGTATCG TTACGCCAGG CACCATCAGC
GATGAAGCCC TGTTGCAGGA GCGTCAGGAC AACCTGCTGG CGGCTATCTG GCAGGACAGC
AAAGGTTTCG GCTACGCGAC GCTGGATATC AGTTCCGGGC GTTTTCGCCT GAGCGAACCG
GCTGACCGCG AAACGATGGC GGCAGAACTG CAACGCACTA ATCCTGCGGA ACTGCTGTAT
GCAGAAGATT TTGCTGAAAT GTCGTTAATT GAAGGCCGTC GCGGCCTGCG CCGTCGCCCG
CTGTGGGAGT TTGAAATCGA CACCGCGCGC CAGCAGTTGA ATCTGCAATT TGGGACCCGC
GATCTGGTCG GTTTTGGCGT CGAGAACGCG CCGCGCGGAC TTTGTGCTGC CGGTTGTCTG
TTGCAGTATG CGAAAGATAC CCAACGTACG ACTCTGCCGC ATATTCGTTC CATCACCATG
GAACGTGAGC AGGACAGCAT CATTATGGAT GCCGCGACGC GTCGTAATCT GGAAATCACC
CAGAACCTGG CGGGTGGTGC GGAAAATACG CTGGCTTCTG TGCTCGACTG CACCGTCACG
CCGATGGGCA GCCGTATGCT GAAACGCTGG CTGCATATGC CAGTGCGCGA TACCCGCGTG
TTGCTTGAGC GCCAGCAAAC TATTGGCGCA TTGCAGGATT TCACCGCCGG GCTACAGCCG
GTACTGCGTC AGGTCGGCGA CCTGGAACGT ATTCTGGCAC GTCTGGCTTT ACGAACTGCT
CGCCCACGCG ATCTGGCCCG TATGCGCCAC GCTTTCCAGC AACTGCCGGA GCTGCGTGCG
CAGTTAGAAA CTGTCGATAG TGCACCGGTA CAGGCGCTAC GTGAGAAGAT GGGCGAGTTT
GCCGAGCTGC GCGATCTGCT GGAGCGAGCA ATCATCGACA CACCGCCGGT GCTGGTACGC
GACGGTGGTG TTATCGCATC GGGCTATAAC GAAGAGCTGG ATGAGTGGCG CGCGCTGGCT
GACGGCGCGA CCGATTATCT GGAGCGTCTG GAAGTCCGCG AGCGTGAACG TACCGGCCTG
GACACGCTGA AAGTTGGCTT TAATGCGGTG CACGGCTACT ACATTCAAAT CAGCCGTGGG
CAAAGCCATC TGGCACCCAT CAACTACATG CGTCGCCAGA CGCTGAAAAA CGCCGAGCGC
TACATCATTC CAGAGCTAAA AGAGTACGAA GATAAAGTTC TCACCTCAAA AGGCAAAGCA
CTGGCACTGG AAAAACAGCT TTATGAAGAG CTGTTCGACC TGCTGTTGCC GCATCTGGAA
GCGTTGCAAC AGAGCGCGAG CGCGCTGGCG GAACTCGACG TGCTGGTTAA CCTGGCGGAA
CGGGCCTATA CCCTGAACTA CACCTGCCCG ACCTTCATTG ATAAACCGGG CATTCGCATT
ACCGAAGGTC GCCATCCGGT AGTTGAACAA GTACTGAATG AGCCATTTAT CGCTAACCCG
CTGAATCTGT CGCCGCAGCG CCGTATGTTG ATCATCACCG GTCCGAACAT GGGCGGTAAA
AGTACCTATA TGCGCCAGAC CGCGTTGATT GCGCTGATGG CCTATATCGG CAGCTACGTA
CCGGCGCAAA AAGTCGAGAT TGGCCCGATT GACCGTATCT TTACCCGCGT AGGCGCGGCA
GATGACCTGG CTTCCGGGCG CTCAACCTTT ATGGTGGAGA TGACTGAAAC CGCCAATATT
TTACATAACG CCACCGAGTA CAGTCTGGTG CTGATGGATG AGATTGGGCG CGGAACTTCC
ACTTACGATG GTCTGTCGCT GGCGTGGGCG TGCGCGGAAA ATCTGGCGAA TAAGATTAAA
GCGTTGACGC TGTTTGCCAC CCACTATTTC GAGCTGACAC AGTTACCGGA GAAAATGGAA
GGCGTCGCCA ACGTGCATCT CGATGCACTG GAGCACGGCG ACACCATTGC CTTTATGCAC
AGCGTGCAGG ATGGCGCGGC GAGCAAAAGC TACGGCCTGG CGGTTGCAGC TCTGGCAGGC
GTGCCAAAAG AGGTGATTAA GCGCGCACGG CAAAAACTGC GTGAGCTGGA AAGCATTTCG
CCGAACGCCG CCGCTACGCA AGTGGATGGT ACGCAAATGT CTTTGCTGTC AGTACCGGAA
GAAACCTCGC CAGCGGTCGA AGCTCTGGAA AACCTGGACC CGGATTCACT GACTCCGCGT
CAGGCGCTGG AATGGATTTA TCGCTTGAAG AGCCTGGTGT AA
 
Protein sequence
MSAIENFDAH TPMMQQYLRL KAQHPEILLF YRMGDFYELF YDDAKRASQL LDISLTKRGA 
SAGEPIPMAG IPYHAVENYL AKLVNQGESV AICEQIGDPA TSKGPVERKV VRIVTPGTIS
DEALLQERQD NLLAAIWQDS KGFGYATLDI SSGRFRLSEP ADRETMAAEL QRTNPAELLY
AEDFAEMSLI EGRRGLRRRP LWEFEIDTAR QQLNLQFGTR DLVGFGVENA PRGLCAAGCL
LQYAKDTQRT TLPHIRSITM EREQDSIIMD AATRRNLEIT QNLAGGAENT LASVLDCTVT
PMGSRMLKRW LHMPVRDTRV LLERQQTIGA LQDFTAGLQP VLRQVGDLER ILARLALRTA
RPRDLARMRH AFQQLPELRA QLETVDSAPV QALREKMGEF AELRDLLERA IIDTPPVLVR
DGGVIASGYN EELDEWRALA DGATDYLERL EVRERERTGL DTLKVGFNAV HGYYIQISRG
QSHLAPINYM RRQTLKNAER YIIPELKEYE DKVLTSKGKA LALEKQLYEE LFDLLLPHLE
ALQQSASALA ELDVLVNLAE RAYTLNYTCP TFIDKPGIRI TEGRHPVVEQ VLNEPFIANP
LNLSPQRRML IITGPNMGGK STYMRQTALI ALMAYIGSYV PAQKVEIGPI DRIFTRVGAA
DDLASGRSTF MVEMTETANI LHNATEYSLV LMDEIGRGTS TYDGLSLAWA CAENLANKIK
ALTLFATHYF ELTQLPEKME GVANVHLDAL EHGDTIAFMH SVQDGAASKS YGLAVAALAG
VPKEVIKRAR QKLRELESIS PNAAATQVDG TQMSLLSVPE ETSPAVEALE NLDPDSLTPR
QALEWIYRLK SLV