Gene SeSA_A3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3059 
SymbolmutS 
ID6517413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2955616 
End bp2958183 
Gene Length2568 bp 
Protein Length855 aa 
Translation table11 
GC content57% 
IMG OID642748081 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002115858 
Protein GI194737683 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.725135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAGT CATTTGATAA GGACTTCTCC AACCACACAC CAATGATGCA GCAGTATCTC 
AAGCTGAAAG CCCAGCATCC GGAGATCCTG CTCTTTTATC GTATGGGAGA CTTTTACGAG
CTGTTTTATG ACGACGCAAA GCGTGCGTCG CAGTTGCTCG ATATTTCGCT GACCAAACGC
GGCGCATCGG CTGGCGAACC TATCCCTATG GCGGGTATCC CGCACCACGC CGTAGAAAAC
TACCTCGCGA AACTGGTCAA TCAGGGCGAA TCCGTCGCTA TTTGCGAACA GATTGGCGAT
CCGGCCACCA GCAAAGGTCC CGTTGAACGT AAGGTCGTGC GTATCGTTAC GCCTGGCACC
ATCAGCGATG AAGCCCTGTT ACAGGAGCGT CAGGATAACC TGCTGGCGGC TATCTGGCAG
GATGGTAAAG GCTACGGTTA CGCCACGCTG GATATCAGCT CCGGGCGCTT TCGTCTGAGC
GAACCCGCCG ACCGTGAAAC GATGGCCGCA GAGTTGCAGC GCACCAATCC CGCCGAACTG
TTGTATGCCG AAGATTTTGC TGAAATGGCG TTAATAGAGG GACGCCGGGG TTTGCGCCGT
CGCCCGCTGT GGGAGTTTGA GATTGATACC GCCCGCCAGC AGCTAAATCT GCAATTCGGT
ACGCGGGATC TGGTCGGTTT CGGCGTCGAA AATGCCTCGC GTGGATTATG TGCGGCAGGC
TGCCTGTTAC AGTACGTAAA AGATACCCAG CGCACCTCCC TGCCGCATAT TCGTTCCATT
ACGATGGAAC GCCAGCAGGA CAGCATCATT ATGGATGCCG CCACCCGCCG CAATCTGGAA
ATTACCCAGA ACCTGGCCGG CGGCGTCGAA AATACCCTCG CCGCGGTGCT TGACTGTACC
GTGACGCCAA TGGGTAGCCG AATGCTTAAA CGCTGGCTGC ATATGCCGGT ACGAAATACC
GACATATTGC GTGAACGCCA GCAGACCATC GGCGCCTTGC AGGACACCGT CAGCGAACTG
CAACCGGTGC TGCGTCAGGT CGGCGATCTG GAGCGTATCC TTGCGCGTCT GGCGCTGCGC
ACCGCGCGTC CGCGCGATCT CGCCCGAATG CGTCACGCCT TCCAGCAGCT GCCGGAGTTG
CACGCTCAGC TTGAAACCGT TGACAGCGCG CCGGTACAGG CGTTGCGTAA AAAAATGGGC
GATTTCGCCG AGCTGCGCGA CCTCCTGGAA CGCGCCATTA TTGACGCGCC GCCGGTACTG
GTCCGCGACG GCGGCGTTAT TGCGCCCGGC TACCATGAAG AGCTGGACGA ATGGCGCGCG
CTGGCGGACG GCGCCACCGA TTATCTCGAT CGTCTGGAAA TTCGCGAGCG CGAGCGTACC
GGGCTGGATA CGCTGAAAGT CGGCTATAAC GCGGTACATG GTTATTACAT TCAGATTAGC
CGTGGTCAAA GCCATCTGGC GCCCATCAAT TATGTGCGTC GCCAGACGCT GAAAAATGCC
GAACGCTACA TTATTCCTGA GCTAAAAGAG TACGAAGATA AAGTTCTGAC CTCGAAAGGC
AAAGCGCTGG CGCTGGAAAA ACAGCTATAT GACGAATTGT TTGATCTGCT GTTGCCGCAT
CTGGCCGATT TACAGCAGAG CGCCAACGCG CTGGCGGAGC TGGATGTGCT GGTTAACCTG
TCCGAACGCG CCTGGACGCT GAATTACACC TGCCCGACAT TTACCGATAA ACCCGGTATC
CGTATTACCG AAGGTCGCCA CCCGGTGGTT GAACAGGTAC TGAATGAGCC GTTTATCGCT
AACCCGCTCA ACCTGTCGCC GCAGCGCCGG ATGTTGATCA TTACCGGCCC CAATATGGGC
GGTAAAAGTA CCTATATGCG GCAAACAGCA TTGATCGCCC TGCTGGCCTA TATCGGCAGT
TACGTTCCGG CGCAAAACGT GGAAATCGGC CCCATCGACC GTATTTTTAC CCGCGTCGGC
GCAGCGGACG ACCTGGCTTC CGGGCGTTCG ACCTTTATGG TGGAGATGAC CGAAACCGCG
AACATTCTGC ATAATGCCAC GGAAAACAGT CTGGTATTGA TGGATGAAAT CGGGCGCGGT
ACCTCCACGT ATGACGGGCT GTCGTTGGCC TGGGCCTGCG CGGAGAATCT CGCGAATAAG
ATTAAAGCGT TAACGCTGTT CGCTACCCAC TACTTCGAGC TGACCCAATT ACCGGAGAAA
ATGGAAGGCG TGGCGAACGT CCACCTGGAT GCGCTGGAAC ACGGCGATAC TATCGCGTTT
ATGCACAGCG TGCAGGACGG CGCGGCAAGT AAGAGCTATG GCCTGGCGGT CGCGGCGCTT
GCGGGCGTCC CTAAAGAAGT CATCAAACGC GCGCGCCAAA AACTCCGCGA GCTGGAGAGC
ATCTCGCCCA ATGCGGCGGC AACGCAGGTG GACGGCACGC AAATGTCGTT GCTTGCCGCC
CCGGAAGAGA CATCGCCCGC CGTTGAAGCG CTGGAGAATC TCGATCCGGA CTCCCTGACG
CCACGCCAGG CGCTGGAGTG GATCTATCGG CTGAAAAGTC TGGTGTAA
 
Protein sequence
MNESFDKDFS NHTPMMQQYL KLKAQHPEIL LFYRMGDFYE LFYDDAKRAS QLLDISLTKR 
GASAGEPIPM AGIPHHAVEN YLAKLVNQGE SVAICEQIGD PATSKGPVER KVVRIVTPGT
ISDEALLQER QDNLLAAIWQ DGKGYGYATL DISSGRFRLS EPADRETMAA ELQRTNPAEL
LYAEDFAEMA LIEGRRGLRR RPLWEFEIDT ARQQLNLQFG TRDLVGFGVE NASRGLCAAG
CLLQYVKDTQ RTSLPHIRSI TMERQQDSII MDAATRRNLE ITQNLAGGVE NTLAAVLDCT
VTPMGSRMLK RWLHMPVRNT DILRERQQTI GALQDTVSEL QPVLRQVGDL ERILARLALR
TARPRDLARM RHAFQQLPEL HAQLETVDSA PVQALRKKMG DFAELRDLLE RAIIDAPPVL
VRDGGVIAPG YHEELDEWRA LADGATDYLD RLEIRERERT GLDTLKVGYN AVHGYYIQIS
RGQSHLAPIN YVRRQTLKNA ERYIIPELKE YEDKVLTSKG KALALEKQLY DELFDLLLPH
LADLQQSANA LAELDVLVNL SERAWTLNYT CPTFTDKPGI RITEGRHPVV EQVLNEPFIA
NPLNLSPQRR MLIITGPNMG GKSTYMRQTA LIALLAYIGS YVPAQNVEIG PIDRIFTRVG
AADDLASGRS TFMVEMTETA NILHNATENS LVLMDEIGRG TSTYDGLSLA WACAENLANK
IKALTLFATH YFELTQLPEK MEGVANVHLD ALEHGDTIAF MHSVQDGAAS KSYGLAVAAL
AGVPKEVIKR ARQKLRELES ISPNAAATQV DGTQMSLLAA PEETSPAVEA LENLDPDSLT
PRQALEWIYR LKSLV