Gene SNSL254_A3114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3114 
SymbolmutS 
ID6485637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3028527 
End bp3031094 
Gene Length2568 bp 
Protein Length855 aa 
Translation table11 
GC content57% 
IMG OID642738426 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002042150 
Protein GI194445735 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0090493 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGAGT CATTTGATAA GGACTTCTCC AACCACACCC CGATGATGCA GCAGTATCTC 
AAGCTGAAAG CCCAGCATCC GGAGATCCTG CTCTTTTATC GTATGGGAGA CTTTTACGAG
CTGTTTTATG ACGACGCAAA GCGTGCGTCG CAGTTGCTCG ATATTTCGCT GACCAAACGC
GGCGCATCGG CTGGCGAACC TATCCCTATG GCGGGTATCC CGCACCACGC CGTAGAAAAC
TACCTCGCGA AACTGGTCAA TCAGGGCGAA TCCGTCGCTA TTTGCGAACA GATTGGCGAT
CCGGCCACCA GCAAAGGTCC CGTTGAACGT AAGGTCGTGC GTATCGTTAC GCCTGGCACC
ATCAGCGATG AAGCCCTGTT ACAGGAGCGT CAGGATAACC TGCTGGCGGC TATCTGGCAG
GATGGTAAAG GCTACGGTTA CGCCACGCTG GATATCAGCT CCGGGCGCTT TCGTCTGAGC
GAACCCGCCG ACCGTGAAAC GATGGCCGCA GAGCTGCAGC GCACCAATCC CGCCGAACTG
TTGTATGCCG AAGATTTTGC TGAAATGGCG TTAATAGAGG GACGCCGGGG TCTGCGCCGT
CGCCCGTTGT GGGAGTTTGA GATTGATACC GCCCGCCAGC AGTTAAATCT GCAATTCGGT
ACGCGGGATC TGGTCGGTTT CGGCGTCGAA AATGCCTCGC GTGGATTATG TGCGGCAGGC
TGCCTGTTAC AGTACGTAAA AGATACCCAG CGCACCTCCC TGCCGCATAT TCGTTCCATT
ACGATGGAAC GCCAGCAGGA CAGCATCATT ATGGATGCCG CGACCCGCCG CAATCTGGAA
ATTACCCAGA ACCTGGCCGG CGGTGTCGAA AATACCCTCG CCGCGGTGCT TGACTGTACC
GTGACGCCAA TGGGTAGCCG AATGCTTAAA CGCTGGCTGC ATATGCCGGT ACGAAATACC
AACATATTGC GTGAACGCCA GCAGACCATC GGCGCCTTGC AGGACACCGT CAGCGAACTG
CAACCGGTGC TGCGTCAGGT CGGCGATCTG GAGCGTATCC TTGCGCGTCT GGCGCTGCGC
ACCGCGCGTC CGCGCGATCT CGCCCGAATG CGTCACGCCT TCCAGCAACT GCCGGAATTG
CACGCTCAGC TTGAAACCGT TGACAGCGCG CCGGTACAGG CGTTGCGTAA AAAAATGGGC
GATTTCGCCG AGCTGCGCGA CCTCCTGGAA CGCGCCATTA TTGACGCGCC GCCGGTACTG
GTCCGCGACG GCGGCGTTAT TGCGCCCGGC TACCATGAAG AGCTGGACGA ATGGCGCGCG
CTGGCGGACG GCGCCACCGA TTATCTCGAT CGTCTGGAAA TTCGCGAGCG CGAGCGTACC
GGGCTGGATA CGCTAAAAGT CGGCTATAAC GCGGTACATG GTTATTACAT TCAGATTAGC
CGTGGTCAAA GCCATCTGGC GCCCATCAAT TATGTGCGTC GCCAGACGCT GAAAAATGCC
GAACGCTACA TTATTCCTGA GCTTAAAGAG TACGAAGATA AAGTTCTGAC CTCGAAAGGC
AAAGCGCTGG CGCTGGAAAA ACAGCTATAT GACGAATTGT TTGATCTGCT GTTGCCGCAT
CTGGCCGATT TACAGCAGAG CGCCAACGCG CTGGCGGAGC TGGATGTGCT GGTTAACCTG
GCCGAACGCG CCTGGACGCT GAATTACACC TGCCCGACAT TTACCGATAA ACCCGGTATC
CGTATTACCG AAGGTCGCCA CCCGGTGGTT GAACAGGTAC TGAATGAGCC GTTTATCGCT
AACCCGCTCA ATCTGTCGCC GCAGCGCCGA ATGTTGATCA TTACCGGCCC CAATATGGGC
GGTAAAAGTA CCTATATGCG ACAAACGGCA TTGATCGCCC TGCTGGCCTA TATCGGCAGT
TACGTTCCGG CGCAAAACGT GGAAATCGGC CCCATCGACC GTATTTTTAC CCGCGTCGGC
GCAGCGGACG ACCTGGCCTC CGGGCGTTCG ACCTTTATGG TGGAGATGAC CGAAACCGCG
AACATTCTGC ATAATGCCAC GGAAAACAGT CTGGTATTGA TGGATGAAAT CGGGCGCGGT
ACCTCCACGT ATGACGGGCT GTCGTTGGCC TGGGCCTGCG CGGAGAATCT CGCGAATAAG
ATTAAAGCGT TAACGCTGTT CGCTACCCAC TACTTCGAGC TGACCCAATT ACCGGAGAAA
ATGGAAGGCG TGGCGAACGT CCACCTGGAT GCGCTGGAAC ACGGCGATAC TATCGCGTTT
ATGCACAGCG TGCAGGACGG CGCGGCAAGT AAGAGCTATG GCCTGGCGGT CGCGGCGCTT
GCGGGCGTCC CTAAAGAAGT CATCAAACGC GCGCGCCAGA AACTCCGCGA GCTGGAAAGC
ATCTCGCCCA ATGCGGCGGC AACGCAGGTG GACGGCACGC AAATGTCGTT GCTTGCGGCC
CCGGAGGAGA CATCGCCTGC CGTTGAAGCG CTGGAAAATC TCGATCCGGA CTCTCTGACG
CCACGCCAGG CGCTGGAGTG GATCTATCGG CTGAAAAGTC TGGTGTAA
 
Protein sequence
MNESFDKDFS NHTPMMQQYL KLKAQHPEIL LFYRMGDFYE LFYDDAKRAS QLLDISLTKR 
GASAGEPIPM AGIPHHAVEN YLAKLVNQGE SVAICEQIGD PATSKGPVER KVVRIVTPGT
ISDEALLQER QDNLLAAIWQ DGKGYGYATL DISSGRFRLS EPADRETMAA ELQRTNPAEL
LYAEDFAEMA LIEGRRGLRR RPLWEFEIDT ARQQLNLQFG TRDLVGFGVE NASRGLCAAG
CLLQYVKDTQ RTSLPHIRSI TMERQQDSII MDAATRRNLE ITQNLAGGVE NTLAAVLDCT
VTPMGSRMLK RWLHMPVRNT NILRERQQTI GALQDTVSEL QPVLRQVGDL ERILARLALR
TARPRDLARM RHAFQQLPEL HAQLETVDSA PVQALRKKMG DFAELRDLLE RAIIDAPPVL
VRDGGVIAPG YHEELDEWRA LADGATDYLD RLEIRERERT GLDTLKVGYN AVHGYYIQIS
RGQSHLAPIN YVRRQTLKNA ERYIIPELKE YEDKVLTSKG KALALEKQLY DELFDLLLPH
LADLQQSANA LAELDVLVNL AERAWTLNYT CPTFTDKPGI RITEGRHPVV EQVLNEPFIA
NPLNLSPQRR MLIITGPNMG GKSTYMRQTA LIALLAYIGS YVPAQNVEIG PIDRIFTRVG
AADDLASGRS TFMVEMTETA NILHNATENS LVLMDEIGRG TSTYDGLSLA WACAENLANK
IKALTLFATH YFELTQLPEK MEGVANVHLD ALEHGDTIAF MHSVQDGAAS KSYGLAVAAL
AGVPKEVIKR ARQKLRELES ISPNAAATQV DGTQMSLLAA PEETSPAVEA LENLDPDSLT
PRQALEWIYR LKSLV