Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2125 |
Symbol | |
ID | 8429107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 2299456 |
End bp | 2302149 |
Gene Length | 2694 bp |
Protein Length | 897 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645034445 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003191576 |
Protein GI | 258515354 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0565829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.043993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATCATA CACCGATGAT CAAACAATAC CTGCAGATTA AAGAGCGGCA TACTGATGCT ATTCTCTTTT TCCGGTTAGG CGATTTTTAT GAGATGTTTT TTGAAGACGC CCACTGTGCC TCCCGTGAGC TGGATATAAC CCTGACCGGT CGCGAAGGTG GCCGGGAAGA ACGCATTCCC ATGTGCGGTG TTCCCTATCA CGCTGCGGAG GGCTACATAG CCCGCCTGGT GGAAAAAGGT TATAAAGTTG CCATATGTGA GCAGGTAGAA GACCCTAAGG CTGTGAAAGG CATTGTGCGC AGGGAGGTAG TCCGTGTAAT CACGCCGGGC ACTATATTGT CAGGCAGTTT TATAGAAGAC AAAAGAAATA ATTTTATTAT CTCTGTCAGC CGGGAAAAAG AGCATTATGG ACTGGCAGTT GTTGATCTGG GAACCGGTCT TTTTATGGTA ACTGAATTTG CTATTGATGA CACCGCTCTG GCGGAGGAAA TATCTCGCCT GCAGCCCTCT GAAGCAGTTG TGGCGCGGGA TAGCTTTAGT AAATCAGAAT TAGGCATTAT TTTCTCTGGA ATCTTTAATA TAACTATCAG CCAACAGCCA CTTGATTTTT TCACCTTTAC CCAGGCCTAC AAGTCGCTGG TAGATAACAT TGGAGAGGAA AAATTAACTG AAAACAAGCT GCTTGAGATG ACTGCTGCCG TGTGTGCGGC AGGAGGTCTT TTAACATATC TTAAAGAAAC GCAAAAAACA GCTTTGCAGC ATTTGAATGC AATTACAGTA TACAAACCCT CAAGGTTTAT GGTTTTAGAC GCCACTACCA GGAAAAATCT GGAGCTTACA AAATCGCTGA GAGAAGGTAC TAAATGGGGT AGTTTGCTCT GGGTGCTCGA TCGAACAGTA ACGGCTATGG GCGGTAGATT GCTCAAAAAC TGGCTGGAAC AGCCTCTCTT ATCTGCTGCT AAAATCAATC TAAGACTGGA TGCTGTAGCT GAGCTGATAA AGGACGGCTT TATCAGGTAT GATTTAAAAG AAATACTGTC TAAGCTCTAC GATTTGGAGC GCTTGACCGG CAGGATCGCC TATGGTACAG CGGGGGCCCG TGATTTAAAT GCCATAAAAG TCTCACTGGC AGTTCTGCCG CAAATCAAAG AGATACTTGC CAGAACCAGC TCCGTACTGT TAAGCAAGCT GAGTGAACAA ATTGAGATAC TGGACGAGCT TTATGATTTA TTAGAAAGAG CTGTTATTGA TAATCCTCCT GTTTCTGTAC GTGAGGGAGG TATGATTAAG ACCGGGTTTA ATGAAAAAGT TGATTACCTG AGAAGTGCCG GTAAAGACGC AAAAAATTGG GTGGCGGAAA TGGAAGCCCG AGAAAGAGAG CGTACAGGTA TAAAGTCCCT TAAGGTAGGC TTTAACAAAG TCTTCGGCTA CTATTTGGAA GTAACTAAAA GCAATATTCA TCTGGTGCCG GAAGAGTATA TCAGAAAACA GACTTTAGCC AATGCTGAAA GATATATTAC GCCACAACTG AAAGAATATG AAAATATGAT ATTAGGGGCA CGGGATAAGT TAAATGAACT GGAATACAGT ATTTTTATCG ATCTAAGAAA TATTGTTGCT GATAAAATAC CCGCCCTCCA GAAGTCCGCC CGGGCAGCTG CCAGGGCGGA TGCGTTGATG GCACTGGCTG AAACTGCGGT GGAAGAAAGA TATTTACGCC CGTCAATTAA CTCTAACGGT TTAATCAAGA TTAAAGAGGG ACGCCACCCT GTGGTGGAAC GTGTCCTAAA AACCGGAGAG TTTGTGCCAA ATGATACGGA TATTAATGAG CAGGACAGGC GCCTTGTTCT CTTGACCGGA CCTAATATGG CAGGCAAAAG TACCTACATG AGACAGGTGG CCCTGATTGT TCTTTTAGCC CAGGTAGGCA GCTTTGTGCC GGCTGATTAC GCTGAAATAG GCATTGTGGA TAGGATTTTC ACCAGGGTCG GGGCTGCTGA TGATCTGGCC GGTGGGCAGA GCACCTTCAT GGTGGAAATG AACGAGTGCA AAGTTATCGT AGAAAATGCT ACGGCCCAAA GCCTGATCAT CATGGATGAA GTAGGTCGCG GCACCAGTAC TTATGACGGC ATTAGTATTG CCAGGGCCTT AATTGAATAT ATCAACAGGG ATCTGAAGGC CAAGACACTT TTTTCCACCC ACTATCATGA GTTGACTGAC CTGGATCAGT TGCATGGTGT AGTAAATCAT ACTGTCGCTG TGCAGGAAAC CGGAGATGGA ATCGTATTCC TGAGGAAGGT CATCCCAGGT AAAGCTGATC GCAGTTATGG CATACATGTG GCTAGGCTTG CCGGGATTCC CGAAAATATT TTGCTGAGGG CAGACGAGGT ATTAAACAGC TTGGAAGCCT GTTCTCCAAC CGGGCAGAGT CAGGCGGCAG CTTCTGTGGA TTTAAGCGAG ATATCAGCAT TTGCGGAGCT GTCGGTTATA AAAGAAGAAA AAAAAGACGT CAGCTCTATA ATGAGAGACA GTGAGGCTAA CCGGCTTTCC TGCTCTCAAG AGCCAGTACC TTGGCTAAGG CCGAAAATGC TGGCGATTTT AGAGGAGCTG GAGGGTTTAG ATATTTTAGC TATGAACCCC CTGCAAGCAA TGAACAAACT CTTTGAATTG CAGGGGAAGC TAAAAGATTA CTAA
|
Protein sequence | MNHTPMIKQY LQIKERHTDA ILFFRLGDFY EMFFEDAHCA SRELDITLTG REGGREERIP MCGVPYHAAE GYIARLVEKG YKVAICEQVE DPKAVKGIVR REVVRVITPG TILSGSFIED KRNNFIISVS REKEHYGLAV VDLGTGLFMV TEFAIDDTAL AEEISRLQPS EAVVARDSFS KSELGIIFSG IFNITISQQP LDFFTFTQAY KSLVDNIGEE KLTENKLLEM TAAVCAAGGL LTYLKETQKT ALQHLNAITV YKPSRFMVLD ATTRKNLELT KSLREGTKWG SLLWVLDRTV TAMGGRLLKN WLEQPLLSAA KINLRLDAVA ELIKDGFIRY DLKEILSKLY DLERLTGRIA YGTAGARDLN AIKVSLAVLP QIKEILARTS SVLLSKLSEQ IEILDELYDL LERAVIDNPP VSVREGGMIK TGFNEKVDYL RSAGKDAKNW VAEMEARERE RTGIKSLKVG FNKVFGYYLE VTKSNIHLVP EEYIRKQTLA NAERYITPQL KEYENMILGA RDKLNELEYS IFIDLRNIVA DKIPALQKSA RAAARADALM ALAETAVEER YLRPSINSNG LIKIKEGRHP VVERVLKTGE FVPNDTDINE QDRRLVLLTG PNMAGKSTYM RQVALIVLLA QVGSFVPADY AEIGIVDRIF TRVGAADDLA GGQSTFMVEM NECKVIVENA TAQSLIIMDE VGRGTSTYDG ISIARALIEY INRDLKAKTL FSTHYHELTD LDQLHGVVNH TVAVQETGDG IVFLRKVIPG KADRSYGIHV ARLAGIPENI LLRADEVLNS LEACSPTGQS QAAASVDLSE ISAFAELSVI KEEKKDVSSI MRDSEANRLS CSQEPVPWLR PKMLAILEEL EGLDILAMNP LQAMNKLFEL QGKLKDY
|
| |