Gene SbBS512_E3143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3143 
SymbolmutS 
ID6272547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2933350 
End bp2935911 
Gene Length2562 bp 
Protein Length853 aa 
Translation table11 
GC content56% 
IMG OID641727064 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001881523 
Protein GI187734248 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCAA TAGAAAATTT CGACGCCCAT ACGCCCATGA TGCAGCAGTA TCTCAGGCTG 
AAAGCCCAGC ATCCCGAGAT CCTGCTGTTT TACCGGATGG GTGATTTTTA TGAACTGTTT
TATGACGACG CAAAACGCGC GTCGCAACTG CTGGATATTT CACTGACCAA ACGCGGTGCT
TCGGCGGGAG AGCCGATCCC GATGGCGGGG ATTCCCTACC ATGCGGTGGA AAACTATCTC
GCCAAACTGG TGAATCAGGG AGAGTCCGTT GCCATCTGCG AACAAATTGG CGATCCGGCG
ACCAGCAAAG GTCCGGTTGA GCGCAAAGTT GTGCGTATCG TTACGCCAGG CACCATCAGC
GATGAAGCCC TGTTACAGGA GCGTCAGGAC AACCTGCTGG CGGCTATCTG GCAGGACAGC
AAAGGTTTCG GCTACGCGAC GCTGGATATC AGCTCCGGGC GTTTTCGCCT GAGCGAACCG
GCCGACCGCG AAACGATGGC GGCAGAGCTG CAACGCACTA ATCCGGCGGA ACTGCTGTAT
GCAGAAGATT TCGCCGAGAT GTCGCTGATT GAAGGCCGTC GCGGCCTGCG CCGTCGCCCG
CTGTGGGAGT TTGAAATCGA CACCGCGCGC CAGCAGTTGA ATCTGCAATT TGGCACCCGC
GATCTGGTCG GTTTTGGCGT CGAGAACGCG CCGCGCGGAC TTTGTGCTGC CGGTTGTCTG
TTGCAGTATG CGAAAGATAC CCAACGCACG ACCCTGCCGC ATATTCGTTC TATCACCATG
GAACGTGAGC AGGACAGCAT CATTATGGAT GCCGCGACAC GTCGTAACCT GGAAATTACT
CAGAACCTGG CGGGCGGTGC GGAAAATACG CTGGCTTCTG TGCTCGACTG CACCGTCACG
CCGATGGGTA GTCGTATGCT GAAACGCTGG CTGCATATGC CAGTGCGCGA TACCCGCGTG
TTGCTTGAGC GCCAGCAAAC TATTGGCGCA TTGCAGGATT TCACCGCCGG GCTACAGCCG
GTACTGCGTC AGGTCGGCGA CCTTGAACGT ATTCTGGCTC GTCTGGCTTT ACGAACTGCT
CGCCCACGCG ATCTGGCCCG TATGCGCCAC GCTTTCCAGC AACTGCCGGA GCTGCGTGCA
CAGTTAGAAA CTGTCGATAG TGCACCGGTA CAGGCGCTAC GTGAGAAGAT GGGCGAGTTT
GCCGAGCTGC GCGATCTGCT GGAGCGAGCA ATCATCGACA CACCGCCGGT GCTGGTACGC
GACGGTGGTG TTATCGCATC GGGCTATAAC GAAGAGCTGG ATGAGTGGCG CGCGCTGGCT
GACGGCGCGA CCGATTATCT GGAGCGTCTG GAAGTCCGCG AGCGTGAACG TACCGGCCTG
GACACGCTGA AAGTTGGCTT TAATGCGGTG CACGGCTACT ACATTCAAAT CAGCCGTGGG
CAAAGCCATC TGGCACCCAT CAACTACATG CGTCGCCAGA CGCTGAAAAA CGCCGAGCGC
TACATCATTC CAGAGCTAAA AGAGTACGAA GATAAAGTTC TCACCTCAAA AGGCAAAGCA
CTGGCACTGG AAAAACAGCT TTATGAAGAG CTGTTCGACC TGCTGTTGCC GCATCTGGAA
GCGTTGCAAC AGAGCGCGAG CGCGCTGGCG GAACTCGACG TGCTGGTTAA CCTGGCGGAA
CGGGCCTATA CCCTGAACTA CACCTGCCCG ACCTTCATTG ATAAACCGGG CATTCGCATT
ACCGAAGGTC GCCATCCGGT AGTTGAACAA GTACTGAATG AGCCATTTAT CGCTAACCCG
CTGAATCTGT CGCCGCAGCG CCGTATGTTG ATTATTACCG GCCCGAACAT GGGCGGTAAA
AGTACCTATA TGCGCCAGAC CGCACTGATT GCGCTGATGG CCTACATCGG CAGCTATGTA
CCGGCACAAA AAGTCGAGAT TGGACCTATC GATCGCATCT TTACCCGCGT AGGCGCGGCA
GATGACCTGG CTTCCGGGCG CTCAACCTTT ATGGTGGAGA TGACTGAAAC CGCCAATATT
TTACATAACG CCACCGAATA CAGTCTGGTG TTAATGGATG AGATCGGGCG TGGAACGTCC
ACCTACGATG GTCTGTCGCT GGCGTGGGCG TGCGCAGAAA ATCTGGCGAA TAAGATTAAG
GCATTGACGC TGTTTGCCAC CCACTATTTC GAGCTGACCC AGCTACCGGA GAAAATGGAA
GACGTCGCCA ACGTGCATCT CGATGCGCTG GAGCACGGAG ACACCATTGC CTTTATGCAT
AGCGTGCAGG ATGGCGCGGC GAGCAAAAGC TACGGCCTGG CGGTTGCAGC TCTGGCAGGC
GTGCCAAAAG AGGTGATTAA GCGCGCACGG CAAAAACTGC GTGAGCTGGA AAGCATTTCG
CCGAACGCCG CCGCTACGCA AGTGGATGGT ACGCAAATGT CTTTGCTGTC AGTACCGGAA
GAAACCTCGC CAGCGGTCGA AGCTCTGGAA AACCTGGACC CGGATTCACT GACTCCGCGT
CAGGCGCTGG AATGGATTTA TCGCTTGAAG AGCCTGGTGT AA
 
Protein sequence
MSAIENFDAH TPMMQQYLRL KAQHPEILLF YRMGDFYELF YDDAKRASQL LDISLTKRGA 
SAGEPIPMAG IPYHAVENYL AKLVNQGESV AICEQIGDPA TSKGPVERKV VRIVTPGTIS
DEALLQERQD NLLAAIWQDS KGFGYATLDI SSGRFRLSEP ADRETMAAEL QRTNPAELLY
AEDFAEMSLI EGRRGLRRRP LWEFEIDTAR QQLNLQFGTR DLVGFGVENA PRGLCAAGCL
LQYAKDTQRT TLPHIRSITM EREQDSIIMD AATRRNLEIT QNLAGGAENT LASVLDCTVT
PMGSRMLKRW LHMPVRDTRV LLERQQTIGA LQDFTAGLQP VLRQVGDLER ILARLALRTA
RPRDLARMRH AFQQLPELRA QLETVDSAPV QALREKMGEF AELRDLLERA IIDTPPVLVR
DGGVIASGYN EELDEWRALA DGATDYLERL EVRERERTGL DTLKVGFNAV HGYYIQISRG
QSHLAPINYM RRQTLKNAER YIIPELKEYE DKVLTSKGKA LALEKQLYEE LFDLLLPHLE
ALQQSASALA ELDVLVNLAE RAYTLNYTCP TFIDKPGIRI TEGRHPVVEQ VLNEPFIANP
LNLSPQRRML IITGPNMGGK STYMRQTALI ALMAYIGSYV PAQKVEIGPI DRIFTRVGAA
DDLASGRSTF MVEMTETANI LHNATEYSLV LMDEIGRGTS TYDGLSLAWA CAENLANKIK
ALTLFATHYF ELTQLPEKME DVANVHLDAL EHGDTIAFMH SVQDGAASKS YGLAVAALAG
VPKEVIKRAR QKLRELESIS PNAAATQVDG TQMSLLSVPE ETSPAVEALE NLDPDSLTPR
QALEWIYRLK SLV