Gene SeD_A3218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3218 
SymbolmutS 
ID6872537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3094770 
End bp3097337 
Gene Length2568 bp 
Protein Length855 aa 
Translation table11 
GC content57% 
IMG OID642786235 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002216876 
Protein GI198242699 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.864156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAGT CATTTGATAA GGACTTCTCC AACCACACCC CGATGATGCA GCAGTATCTC 
AAGCTGAAAG CCCAGCATCC GGAGATCCTG CTCTTTTATC GTATGGGAGA CTTTTACGAG
CTGTTTTATG ACGACGCAAA GCGTGCGTCG CAGTTGCTCG ATATTTCGCT GACCAAACGC
GGCGCATCGG CTGGCGAACC TATCCCTATG GCGGGTATCC CGCACCACGC CGTAGAAAAC
TACCTCGCGA AACTGGTCAA TCAGGGCGAA TCCGTCGCTA TTTGCGAACA GATTGGCGAT
CCGGCCACCA GCAAAGGTCC CGTTGAACGT AAGGTCGTGC GTATCGTTAC GCCTGGCACT
ATCAGCGATG AAGCCCTGTT ACAGGAGCGT CAGGATAACC TACTGGCGGC TATCTGGCAG
GATGGTAAAG GTTACGGTTA CGCCACGCTG GATATCAGCT CCGGGCGCTT TCGTCTGAGC
GAACCCGCCG ACCGTGAAAC GATGGCCGCA GAGCTGCAGC GCACCAATCC CGCCGAACTG
TTGTATGCCG AAGATTTTGC TGAAATGGCG TTAATAGAGG GACGCCGGGG TCTGCGCCGT
CGCCCGTTGT GGGAGTTTGA GATTGATACC GCCCGCCAGC AGTTAAATCT GCAGTTCGGT
ACGCGGGATC TGGTCGGTTT CGGCGTCGAA AATGCCTCGC GTGGATTATG TGCGGCAGGC
TGCCTGTTAC AGTACGTAAA AGATACCCAG CGCACCTCCC TGCCGCATAT TCGTTCCATT
ACGATGGAAC GCCAGCAGGA CAGCATCATT ATGGATGCCG CGACCCGCCG CAATCTGGAA
ATTACCCAGA ACCTGGCCGG TGGTGTCGAA AATACCCTCG CCGCGGTGCT TGACTGTACC
GTGACGCCAA TGGGTAGCCG AATGCTTAAA CGCTGGCTGC ATATGCCGGT ACGAAATACC
GACATATTAC GTGAACGCCA GCAGACCATC GGCGCCTTGC AGGACACCGT CAGCGAACTG
CAACCGGTGC TGCGTCAGGT CGGCGATTTG GAGCGTATCC TTGCGCGTCT GGCGCTGCGC
ACCGCGCGTC CGCGCGATCT CGCCCGAATG CGTCACGCCT TCCAGCAACT GCCGGAATTG
CACGCTCAGC TTGAAACCGT TGACAGCGCG CCGGTACAGG CGTTGCGTAA AAAAATGGGC
GATTTCGCCG AGCTGCGCGA CCTCCTGGAA CGCGCCATTA TTGACGCGCC GCCGGTACTG
GTCCGCGACG GCGGCGTTAT TGCGCCCGGC TACCATGAAG AGCTGGACGA ATGGCGCGCG
CTGGCGGACG GCGCCACCGA TTATCTCGAT TGTCTGGAAA TTCGCGAGCG CGAGCGTACC
GGGCTGGATA CGCTGAAAGT CGGCTATAAC GCGGTACATG GTTATTACAT TCAGATTAGC
CGTGGTCAAA GCCATCTGGC GCCCATCAAT TATGTGCGTC GCCAGACGCT GAAAAATGCC
GAACGCTACA TTATTCCTGA GCTTAAAGAG TACGAAGATA AAGTTCTGAC CTCGAAAGGC
AAAGCGCTGG CGCTGGAAAA ACAGCTATAT GACGAATTGT TTGATCTGCT GTTGCCGCAT
CTGGCCGATT TACAGCAGAG CGCCAACGCG CTGGCGGAGC TGGATGTGCT GGTTAACCTG
GCCGAACGCG CCTGGACGCT GAATTACACC TGCCCGACAT TTACCGATAA ACCCGGTATC
CGTATTACCG AAGGTCGCCA CCCGGTGGTT GAACAGGTAC TGAATGAGCC GTTTATCGCT
AACCCGCTCA ATTTATCGCC GCAGCGCCGG ATGTTGATCA TTACCGGCCC CAATATGGGC
GGTAAAAGTA CCTATATGCG ACAAACGGCA TTGATTGCCC TGCTGGCCTA TATCGGCAGT
TACGTTCCGG CGCAAAACGT GGAAATCGGC CCCATCGACC GTATTTTTAC CCGCGTCGGC
GCAGCGGACG ACCTGGCCTC CGGGCGTTCG ACCTTTATGG TGGAGATGAC CGAAACCGCG
AACATTCTGC ATAATGCCAC GGAAAACAGT CTGGTATTGA TGGATGAAAT CGGGCGCGGT
ACCTCTACGT ATGACGGGCT GTCGTTGGCC TGGGCCTGCG CGGAGAATCT CGCGAATAAG
ATTAAAGCGT TAACGCTGTT CGCTACCCAC TACTTCGAGC TGACCCAATT ACCGGAGAAA
ATGGAAGGCG TGGCGAACGT CCACCTGGAT GCGCTGGAAC ACGGCGATAC TATCGCGTTT
ATGCACAGCG TGCAGGACGG CGCAGCAAGT AAGAGCTATG GCCTGGCGGT CGCGGCGCTT
GCGGGCGTCC CTAAAGAAGT CATCAAACGC GCGCGCCAAA AACTCCGCGA GCTGGAGAGC
ATCTCGCCCA ATGCGGCGGC AACGCAGGTG GACGGCACGC AAATGTCGTT GCTTGCCGCC
CCGGAGGAGG CATCGCCTGC CGTTGAAGCG CTGGAGAATC TCGATCCGGA CTCCCTGACG
CCACGCCAGG CGCTGGAGTG GATCTATCGG CTGAAAAGTC TGGTGTAA
 
Protein sequence
MNESFDKDFS NHTPMMQQYL KLKAQHPEIL LFYRMGDFYE LFYDDAKRAS QLLDISLTKR 
GASAGEPIPM AGIPHHAVEN YLAKLVNQGE SVAICEQIGD PATSKGPVER KVVRIVTPGT
ISDEALLQER QDNLLAAIWQ DGKGYGYATL DISSGRFRLS EPADRETMAA ELQRTNPAEL
LYAEDFAEMA LIEGRRGLRR RPLWEFEIDT ARQQLNLQFG TRDLVGFGVE NASRGLCAAG
CLLQYVKDTQ RTSLPHIRSI TMERQQDSII MDAATRRNLE ITQNLAGGVE NTLAAVLDCT
VTPMGSRMLK RWLHMPVRNT DILRERQQTI GALQDTVSEL QPVLRQVGDL ERILARLALR
TARPRDLARM RHAFQQLPEL HAQLETVDSA PVQALRKKMG DFAELRDLLE RAIIDAPPVL
VRDGGVIAPG YHEELDEWRA LADGATDYLD CLEIRERERT GLDTLKVGYN AVHGYYIQIS
RGQSHLAPIN YVRRQTLKNA ERYIIPELKE YEDKVLTSKG KALALEKQLY DELFDLLLPH
LADLQQSANA LAELDVLVNL AERAWTLNYT CPTFTDKPGI RITEGRHPVV EQVLNEPFIA
NPLNLSPQRR MLIITGPNMG GKSTYMRQTA LIALLAYIGS YVPAQNVEIG PIDRIFTRVG
AADDLASGRS TFMVEMTETA NILHNATENS LVLMDEIGRG TSTYDGLSLA WACAENLANK
IKALTLFATH YFELTQLPEK MEGVANVHLD ALEHGDTIAF MHSVQDGAAS KSYGLAVAAL
AGVPKEVIKR ARQKLRELES ISPNAAATQV DGTQMSLLAA PEEASPAVEA LENLDPDSLT
PRQALEWIYR LKSLV