Gene YpAngola_A0956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0956 
SymbolmutS 
ID5799419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp976914 
End bp979469 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content51% 
IMG OID641338948 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001605520 
Protein GI162418943 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA ACGATAAACT CGACTCCCAC ACCCCGATGA TGCAGCAGTA TCTTCGGTTA 
AAAGCCCAGC ATCCTGAAAT ACTCCTGTTC TATCGAATGG GGGATTTTTA TGAACTGTTC
TATAGTGATG CCAAGCGAGC CTCACAACTG TTGGATATCT CACTGACTAA ACGTGGTGCT
TCAGCCGGTG AACCCATACC GATGGCAGGC GTCCCCTATC ATTCGATAGA AAACTATCTG
GCTAAGCTAG TGCAATTAGG TGAGTCAGCC GCCATCTGTG AGCAAATTGG TGATCCAGCC
ACCAGCAAAG GGCCAGTTGA ACGGAAGGTT GTCCGTATCG TTACACCGGG CACAATCAGC
GATGAAGCGC TGCTACAAGA GCGGCAAGAT AACTTACTGG CGGCTATCTG GCAGGATGCC
AAAGGGTTTG GGTATGCCAC TCTGGATATC AGCTCTGGCC GCTTCCGGGT TGCAGAACCC
GCCGATCTTG AAACGATGGC TGCCGAGTTA CAACGCACCA ATCCTGCCGA GTTACTGTAT
CCGGAAAACT TCGAGCCTAT GTCGTTGATC GAGCATCGAC ATGGCTTACG CCGCCGGCCT
TTATGGGAGT TTGAGCTGGA TACCGCCAAA CAACAGCTTA ATCTGCAATT CGGGACCCGT
GATTTAATTG GTTTCGGCGT TGAGCAAGCC CATCTGGCAC TGCGGGCGGC GGGCTGCCTG
CTGCAATATG TCAAAGATAC CCAACGCACA TCCCTGCCGC ATATCCGTGG CCTGACCATG
GAGCGCCAGC AAGATGGCAT CATTATGGAT GCTGCTACCC GTCGTAATCT CGAATTGACG
CAGAACCTAT CCGGTGGCAG TGAAAATACG CTGGCAGCCA TCCTCGATTG CAGCGTGACG
CCAATGGGTA GCCGGATGCT AAAACGCTGG TTACATATGC CAATCCGCGA TATTCGCGTG
CTCACGGATC GGCAACAAGC CATTGGTGGC CTACAAGATA TCGCCGCCGA GTTACAAACC
CCCTTGAGAC AAGTGGGCGA TTTAGAACGT ATTTTGGCAC GCTTAGCTCT GCGAACTGCC
CGTCCACGCG ATTTGGCCAG AATGCGTCAT GCTTTCCAGC AACTGCCAGA AATCCACCGT
TTATTGCAAC CTATTGATGT TCCTCATGTA CAGAACTTGT TATCACAGGT GGGCCAATTC
GACGAATTGC AAGACTTATT GGAGCGGGCC ATTGTCGAGA CGCCACCAGT ATTAGTCCGC
GATGGCGGCG TTATTGCATC AGGTTATAAC GCTGAGTTAG ATGAATGGCG GGCGCTGGCC
GATGGTGCAA CCGATTATCT GGATCGGTTG GAAATCCGTG AGCGGGAGAA GTTAGGGCTG
GACACACTAA AAGTGGGCTT TAATGGTGTA CATGGCTATT ACATTCAGGT TAGCCGTGGT
CAGAGCCATC TGGTGCCTAT TCATTATGTC CGTCGGCAAA CACTGAAAAA TGCCGAGCGC
TACATTATTC CGGAGCTGAA AGAGTACGAA GATAAGGTTC TGACCTCAAA AGGTAAGGCA
CTGGCAATTG AGAAAGGGTT GTACGAAGAA ATTTTCGATT TGCTGCTGCC GCATCTGCCA
GAATTACAAC TCAGTGCTAA TGCACTGGCT GAATTAGATG TACTGGCTAA TCTGGCGGAA
AGAGCTGAAA CACTCAACTA CTCTTGCCCA ACCCTGAGCG ATAAGCCGGG GATTAAGATT
ATGGGTGGCC GTCACCCGGT TGTGGAACAG GTCCTCAAAG AACCCTTTAT TTCTAACCCG
TTGACGTTAT CTCCTCAGCG ACGGATGTTG ATCATTACTG GGCCGAACAT GGGCGGCAAA
AGTACCTATA TGCGCCAAAC GGCGCTGATT GTGCTCTTGG CACACCTGGG AAGCTATGTC
CCTGCGGATC AGGCAACCAT CGGGCCTATT GACCGCATAT TTACCCGCGT CGGTGCCGCT
GACGATCTGG CCTCTGGTCG TTCGACCTTT ATGGTGGAAA TGACCGAGAC CGCGAATATT
CTGCATAACG CCACCGAACA AAGCCTGGTA TTGATGGATG AGATTGGCCG TGGCACATCC
ACCTATGATG GTTTGTCATT GGCCTGGGCT TGTGCAGAAA ATCTGGCCAG CCGTATCAAA
GCAATGACGC TATTTGCGAC GCATTACTTT GAATTAACGA CATTGCCAGA AAAAATGGAA
GGTGTGGTAA ATGTTCATCT TGATGCATTG GAGCACGGCG AAACCATCGC GTTTATGCAC
AGTGTACAAG AGGGTGCAGC CAGTAAAAGT TATGGCCTGG CAGTAGCCGC ACTGGCTGGT
GTGCCACGCG ATGTCATTAA GCGAGCACGA CAAAAACTGA AAGAGCTGGA ATCACTCTCT
AATAACGCCG CCGCCAGTAC GATTGATGGC TCACAAATGA CGTTGTTAAA TGAAGAAATC
CCTCCCGCAG TGGAAGCGCT GGAAGCGCTG GATCCGGATT CATTGTCACC GCGTCAGGCA
CTGGAGTGGA TCTATCGCTT GAAGAACATG GTGTAA
 
Protein sequence
MKNNDKLDSH TPMMQQYLRL KAQHPEILLF YRMGDFYELF YSDAKRASQL LDISLTKRGA 
SAGEPIPMAG VPYHSIENYL AKLVQLGESA AICEQIGDPA TSKGPVERKV VRIVTPGTIS
DEALLQERQD NLLAAIWQDA KGFGYATLDI SSGRFRVAEP ADLETMAAEL QRTNPAELLY
PENFEPMSLI EHRHGLRRRP LWEFELDTAK QQLNLQFGTR DLIGFGVEQA HLALRAAGCL
LQYVKDTQRT SLPHIRGLTM ERQQDGIIMD AATRRNLELT QNLSGGSENT LAAILDCSVT
PMGSRMLKRW LHMPIRDIRV LTDRQQAIGG LQDIAAELQT PLRQVGDLER ILARLALRTA
RPRDLARMRH AFQQLPEIHR LLQPIDVPHV QNLLSQVGQF DELQDLLERA IVETPPVLVR
DGGVIASGYN AELDEWRALA DGATDYLDRL EIREREKLGL DTLKVGFNGV HGYYIQVSRG
QSHLVPIHYV RRQTLKNAER YIIPELKEYE DKVLTSKGKA LAIEKGLYEE IFDLLLPHLP
ELQLSANALA ELDVLANLAE RAETLNYSCP TLSDKPGIKI MGGRHPVVEQ VLKEPFISNP
LTLSPQRRML IITGPNMGGK STYMRQTALI VLLAHLGSYV PADQATIGPI DRIFTRVGAA
DDLASGRSTF MVEMTETANI LHNATEQSLV LMDEIGRGTS TYDGLSLAWA CAENLASRIK
AMTLFATHYF ELTTLPEKME GVVNVHLDAL EHGETIAFMH SVQEGAASKS YGLAVAALAG
VPRDVIKRAR QKLKELESLS NNAAASTIDG SQMTLLNEEI PPAVEALEAL DPDSLSPRQA
LEWIYRLKNM V