Gene YpsIP31758_3292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3292 
SymbolmutS 
ID5387491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3701698 
End bp3704253 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content51% 
IMG OID640866307 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001402249 
Protein GI153948692 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA ACGATAAACT CGACTCCCAC ACCCCGATGA TGCAGCAGTA TCTTCGGTTA 
AAAGCCCAGC ATCCTGAAAT ACTCCTGTTC TATCGAATGG GGGATTTTTA TGAACTGTTC
TATAGTGATG CCAAGCGAGC CTCACAACTG TTGGATATCT CACTGACTAA ACGTGGTGCT
TCAGCCGGTG AACCCATACC GATGGCAGGC GTCCCCTATC ATTCGATAGA AAACTATCTG
GCTAAGCTAG TGCAATTAGG TGAGTCAGCC GCCATCTGTG AGCAAATTGG TGATCCAGCC
ACCAGCAAAG GGCCAGTTGA ACGTAAGGTT GTCCGTATCG TTACACCGGG CACAATCAGC
GATGAAGCGC TGCTACAAGA GCGGCAAGAT AACTTACTGG CGGCTATCTG GCAGGATGCC
AAAGGGTTTG GGTATGCCAC TCTGGATATC AGCTCTGGCC GCTTCCGGGT TGCAGAACCC
GCCGATCTTG AAACGATGGC TGCCGAGTTA CAACGCACCA ATCCTGCCGA GTTACTGTAT
CCGGAAAACT TCGAGCCTAT GTCGTTGATC GAGCATCGAC ATGGCTTACG CCGCCGGCCT
TTATGGGAGT TTGAGCTGGA TACCGCCAAA CAACAGCTTA ATCTGCAATT CGGGACCCGT
GATTTAATTG GTTTCGGCGT TGAGCAAGCC CATCTGGCAT TGCGGGCGGC GGGCTGCCTG
CTGCAATATG TCAAAGATAC CCAACGCACA TCCCTGCCGC ATATCCGTGG CCTGACCATG
GAGCGCCAGC AAGATGGCAT CATTATGGAT GCCGCTACCC GTCGTAATCT CGAATTGACG
CAGAACCTAT CCGGTGGCAG TGAAAATACG CTGGCAGCCA TCCTCGATTG CAGCGTGACG
CCAATGGGTA GCCGGATGCT AAAACGCTGG TTACATATGC CAATCCGCGA TATTCGCGTG
CTCACGGATC GGCAACAAGC CATTGGTGGC CTACAAGATA TCGCCGCCGA GTTACAAACC
CCCTTGAGAC AAGTGGGCGA TTTAGAACGT ATTTTGGCAC GCTTAGCTCT GCGAACTGCG
CGTCCACGCG ATTTGGCCAG AATGCGTCAT GCTTTCCAGC AACTGCCAGA AATCCACCGT
TTATTGCAAC CTATTGATGT TCCTCATGTA CAGAACTTGT TATCACAGGT GGGCCAATTC
GACGAATTGC AAGACTTATT GGAGCGGGCC ATTGTCGAGA CGCCACCAGT ATTAGTCCGC
GATGGCGGCG TTATTGCATC AGGTTATAAC GCTGAGTTAG ATGAATGGCG GGCGCTGGCC
GATGGTGCAA CCGATTATCT GGATCGGTTG GAAATCCGTG AGCGAGAGAA GTTAGGGCTG
GACACACTAA AAGTGGGCTT TAATGGTGTA CATGGCTATT ACATTCAGGT TAGCCGTGGT
CAGAGCCATC TGGTGCCTAT TCATTACGTC CGTCGGCAAA CACTGAAAAA TGCCGAGCGC
TACATTATTC CGGAGCTGAA AGAGTACGAA GATAAGGTTC TGACCTCAAA AGGTAAGGCA
CTGGCAATTG AGAAAGGGTT GTACGAAGAA ATTTTCGATT TGCTGCTGCC GCATCTGCCA
GAATTACAAC TCAGTGCTAA TGCACTGGCT GAATTAGATG TACTGGCTAA TCTGGCGGAA
AGAGCTGAAA CACTCAACTA CTCTTGCCCA ACCCTGAGCG ATAAGCCGGG GATTAAGATT
ATGGGTGGCC GTCACCCGGT TGTGGAACAA GTCCTCAAAG AACCCTTTAT TTCTAACCCG
TTGACGTTAT CTCCTCAGCG ACGGATGTTG ATCATTACTG GGCCGAACAT GGGCGGCAAA
AGTACCTATA TGCGCCAAAC GGCGCTGATT GTGCTCTTGG CACACCTGGG AAGCTATGTC
CCTGCGGATC AGGCAACCAT CGGGCCTATT GACCGCATAT TTACCCGCGT CGGTGCCGCT
GACGATCTGG CCTCTGGTCG TTCGACCTTT ATGGTGGAAA TGACCGAGAC CGCGAATATT
CTGCATAACG CCACCGAACA AAGCCTGGTA TTGATGGATG AGATTGGCCG TGGCACATCC
ACCTATGATG GTTTGTCATT GGCCTGGGCT TGTGCAGAAA ATCTGGCCAG CCGTATCAAA
GCAATGACGC TATTTGCGAC GCATTACTTT GAATTAACGA CATTGCCAGA AAAAATGGAA
GGTGTGGCAA ATGTTCATCT TGATGCATTG GAACACGGCG AAACCATCGC GTTTATGCAC
AGTGTACAAG AAGGCGCAGC CAGTAAAAGT TATGGCCTGG CAGTAGCCGC ACTGGCCGGT
GTGCCGCGCG ATGTCATTAA GCGAGCACGA CAAAAACTGA AAGAGCTGGA ATCACTCTCT
AATAACGCCG CCGCCAGTAC GATTGATGGC TCACAAATGA CACTGTTAAA TGAAGAGATC
CCCCCCGCCG TGGAAGCGCT GGAAGCGCTG GATCCGGATT CATTGTCACC GCGTCAGGCA
CTGGAGTGGA TCTATCGCTT GAAGAACATG GTGTAA
 
Protein sequence
MKNNDKLDSH TPMMQQYLRL KAQHPEILLF YRMGDFYELF YSDAKRASQL LDISLTKRGA 
SAGEPIPMAG VPYHSIENYL AKLVQLGESA AICEQIGDPA TSKGPVERKV VRIVTPGTIS
DEALLQERQD NLLAAIWQDA KGFGYATLDI SSGRFRVAEP ADLETMAAEL QRTNPAELLY
PENFEPMSLI EHRHGLRRRP LWEFELDTAK QQLNLQFGTR DLIGFGVEQA HLALRAAGCL
LQYVKDTQRT SLPHIRGLTM ERQQDGIIMD AATRRNLELT QNLSGGSENT LAAILDCSVT
PMGSRMLKRW LHMPIRDIRV LTDRQQAIGG LQDIAAELQT PLRQVGDLER ILARLALRTA
RPRDLARMRH AFQQLPEIHR LLQPIDVPHV QNLLSQVGQF DELQDLLERA IVETPPVLVR
DGGVIASGYN AELDEWRALA DGATDYLDRL EIREREKLGL DTLKVGFNGV HGYYIQVSRG
QSHLVPIHYV RRQTLKNAER YIIPELKEYE DKVLTSKGKA LAIEKGLYEE IFDLLLPHLP
ELQLSANALA ELDVLANLAE RAETLNYSCP TLSDKPGIKI MGGRHPVVEQ VLKEPFISNP
LTLSPQRRML IITGPNMGGK STYMRQTALI VLLAHLGSYV PADQATIGPI DRIFTRVGAA
DDLASGRSTF MVEMTETANI LHNATEQSLV LMDEIGRGTS TYDGLSLAWA CAENLASRIK
AMTLFATHYF ELTTLPEKME GVANVHLDAL EHGETIAFMH SVQEGAASKS YGLAVAALAG
VPRDVIKRAR QKLKELESLS NNAAASTIDG SQMTLLNEEI PPAVEALEAL DPDSLSPRQA
LEWIYRLKNM V