Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3143 |
Symbol | mutS |
ID | 6272547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2933350 |
End bp | 2935911 |
Gene Length | 2562 bp |
Protein Length | 853 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641727064 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001881523 |
Protein GI | 187734248 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCAA TAGAAAATTT CGACGCCCAT ACGCCCATGA TGCAGCAGTA TCTCAGGCTG AAAGCCCAGC ATCCCGAGAT CCTGCTGTTT TACCGGATGG GTGATTTTTA TGAACTGTTT TATGACGACG CAAAACGCGC GTCGCAACTG CTGGATATTT CACTGACCAA ACGCGGTGCT TCGGCGGGAG AGCCGATCCC GATGGCGGGG ATTCCCTACC ATGCGGTGGA AAACTATCTC GCCAAACTGG TGAATCAGGG AGAGTCCGTT GCCATCTGCG AACAAATTGG CGATCCGGCG ACCAGCAAAG GTCCGGTTGA GCGCAAAGTT GTGCGTATCG TTACGCCAGG CACCATCAGC GATGAAGCCC TGTTACAGGA GCGTCAGGAC AACCTGCTGG CGGCTATCTG GCAGGACAGC AAAGGTTTCG GCTACGCGAC GCTGGATATC AGCTCCGGGC GTTTTCGCCT GAGCGAACCG GCCGACCGCG AAACGATGGC GGCAGAGCTG CAACGCACTA ATCCGGCGGA ACTGCTGTAT GCAGAAGATT TCGCCGAGAT GTCGCTGATT GAAGGCCGTC GCGGCCTGCG CCGTCGCCCG CTGTGGGAGT TTGAAATCGA CACCGCGCGC CAGCAGTTGA ATCTGCAATT TGGCACCCGC GATCTGGTCG GTTTTGGCGT CGAGAACGCG CCGCGCGGAC TTTGTGCTGC CGGTTGTCTG TTGCAGTATG CGAAAGATAC CCAACGCACG ACCCTGCCGC ATATTCGTTC TATCACCATG GAACGTGAGC AGGACAGCAT CATTATGGAT GCCGCGACAC GTCGTAACCT GGAAATTACT CAGAACCTGG CGGGCGGTGC GGAAAATACG CTGGCTTCTG TGCTCGACTG CACCGTCACG CCGATGGGTA GTCGTATGCT GAAACGCTGG CTGCATATGC CAGTGCGCGA TACCCGCGTG TTGCTTGAGC GCCAGCAAAC TATTGGCGCA TTGCAGGATT TCACCGCCGG GCTACAGCCG GTACTGCGTC AGGTCGGCGA CCTTGAACGT ATTCTGGCTC GTCTGGCTTT ACGAACTGCT CGCCCACGCG ATCTGGCCCG TATGCGCCAC GCTTTCCAGC AACTGCCGGA GCTGCGTGCA CAGTTAGAAA CTGTCGATAG TGCACCGGTA CAGGCGCTAC GTGAGAAGAT GGGCGAGTTT GCCGAGCTGC GCGATCTGCT GGAGCGAGCA ATCATCGACA CACCGCCGGT GCTGGTACGC GACGGTGGTG TTATCGCATC GGGCTATAAC GAAGAGCTGG ATGAGTGGCG CGCGCTGGCT GACGGCGCGA CCGATTATCT GGAGCGTCTG GAAGTCCGCG AGCGTGAACG TACCGGCCTG GACACGCTGA AAGTTGGCTT TAATGCGGTG CACGGCTACT ACATTCAAAT CAGCCGTGGG CAAAGCCATC TGGCACCCAT CAACTACATG CGTCGCCAGA CGCTGAAAAA CGCCGAGCGC TACATCATTC CAGAGCTAAA AGAGTACGAA GATAAAGTTC TCACCTCAAA AGGCAAAGCA CTGGCACTGG AAAAACAGCT TTATGAAGAG CTGTTCGACC TGCTGTTGCC GCATCTGGAA GCGTTGCAAC AGAGCGCGAG CGCGCTGGCG GAACTCGACG TGCTGGTTAA CCTGGCGGAA CGGGCCTATA CCCTGAACTA CACCTGCCCG ACCTTCATTG ATAAACCGGG CATTCGCATT ACCGAAGGTC GCCATCCGGT AGTTGAACAA GTACTGAATG AGCCATTTAT CGCTAACCCG CTGAATCTGT CGCCGCAGCG CCGTATGTTG ATTATTACCG GCCCGAACAT GGGCGGTAAA AGTACCTATA TGCGCCAGAC CGCACTGATT GCGCTGATGG CCTACATCGG CAGCTATGTA CCGGCACAAA AAGTCGAGAT TGGACCTATC GATCGCATCT TTACCCGCGT AGGCGCGGCA GATGACCTGG CTTCCGGGCG CTCAACCTTT ATGGTGGAGA TGACTGAAAC CGCCAATATT TTACATAACG CCACCGAATA CAGTCTGGTG TTAATGGATG AGATCGGGCG TGGAACGTCC ACCTACGATG GTCTGTCGCT GGCGTGGGCG TGCGCAGAAA ATCTGGCGAA TAAGATTAAG GCATTGACGC TGTTTGCCAC CCACTATTTC GAGCTGACCC AGCTACCGGA GAAAATGGAA GACGTCGCCA ACGTGCATCT CGATGCGCTG GAGCACGGAG ACACCATTGC CTTTATGCAT AGCGTGCAGG ATGGCGCGGC GAGCAAAAGC TACGGCCTGG CGGTTGCAGC TCTGGCAGGC GTGCCAAAAG AGGTGATTAA GCGCGCACGG CAAAAACTGC GTGAGCTGGA AAGCATTTCG CCGAACGCCG CCGCTACGCA AGTGGATGGT ACGCAAATGT CTTTGCTGTC AGTACCGGAA GAAACCTCGC CAGCGGTCGA AGCTCTGGAA AACCTGGACC CGGATTCACT GACTCCGCGT CAGGCGCTGG AATGGATTTA TCGCTTGAAG AGCCTGGTGT AA
|
Protein sequence | MSAIENFDAH TPMMQQYLRL KAQHPEILLF YRMGDFYELF YDDAKRASQL LDISLTKRGA SAGEPIPMAG IPYHAVENYL AKLVNQGESV AICEQIGDPA TSKGPVERKV VRIVTPGTIS DEALLQERQD NLLAAIWQDS KGFGYATLDI SSGRFRLSEP ADRETMAAEL QRTNPAELLY AEDFAEMSLI EGRRGLRRRP LWEFEIDTAR QQLNLQFGTR DLVGFGVENA PRGLCAAGCL LQYAKDTQRT TLPHIRSITM EREQDSIIMD AATRRNLEIT QNLAGGAENT LASVLDCTVT PMGSRMLKRW LHMPVRDTRV LLERQQTIGA LQDFTAGLQP VLRQVGDLER ILARLALRTA RPRDLARMRH AFQQLPELRA QLETVDSAPV QALREKMGEF AELRDLLERA IIDTPPVLVR DGGVIASGYN EELDEWRALA DGATDYLERL EVRERERTGL DTLKVGFNAV HGYYIQISRG QSHLAPINYM RRQTLKNAER YIIPELKEYE DKVLTSKGKA LALEKQLYEE LFDLLLPHLE ALQQSASALA ELDVLVNLAE RAYTLNYTCP TFIDKPGIRI TEGRHPVVEQ VLNEPFIANP LNLSPQRRML IITGPNMGGK STYMRQTALI ALMAYIGSYV PAQKVEIGPI DRIFTRVGAA DDLASGRSTF MVEMTETANI LHNATEYSLV LMDEIGRGTS TYDGLSLAWA CAENLANKIK ALTLFATHYF ELTQLPEKME DVANVHLDAL EHGDTIAFMH SVQDGAASKS YGLAVAALAG VPKEVIKRAR QKLRELESIS PNAAATQVDG TQMSLLSVPE ETSPAVEALE NLDPDSLTPR QALEWIYRLK SLV
|
| |