Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2697 |
Symbol | |
ID | 4071599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3187411 |
End bp | 3190059 |
Gene Length | 2649 bp |
Protein Length | 882 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984714 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_591772 |
Protein GI | 94969724 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.60859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.57313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGACC CGACAACGCC GTTGATGAAG CAGTGGGCCC AGGTCAAACG CGACCATCCC AATGCCCTGC TCTTCTTCCG ACTCGGGGAT TTCTACGAGC TGTTCTTTGA CGACGCCGTG ATCGCCGCGC GCGAATTGCA GATCACGCTG ACGGCGCGCA ACAAGGAAAA GGGCAACAGC GTTCCGATGT GCGGCGTGCC CTATCACGCC GCCGAGAACT ACATTTCCAA ACTGATTCGT CGCGGCTTCA AGGTTGCGGT CTGCGACCAG GTGGAAGACC CGAAACTCGC GAAGAAGTTG GTGAAACGCG AAGTGACGCG GGTGATGACA CCCGGCACAA CGGCCGATTC GCAGCTTGGT TCGGAAGAAA ACAACTTCCT TGCGGCGGTC GCGAGCCACG GAGATTTCGT CGGTTTTGCA GCACTTGATT TGTCAACCGG CGAATTTCGC GCGACCGAGT TCAAGGGTAG CGATGCCCGG CGGCGGATCC AGGAAGAGTT ACTGACGCTG CGTCCGCGAG AAATACTCTA CGGCTCTTCC CTGCCGTTGT TCGACGCCGC ACGACCGGCC AACGCTGTGA GTGGCGCGAT GCCGAAATTG GCAACGGTTG AGGGTGCGAG TTGGGCGGAG ACGCCGCTGG AAGACTGGGT GTTTGCTCCG GACTACGCGA TCCCGCTTGT TGAGAATCAT TTCGGCGTGT TGTCGCTAGA GGGATTCGGG CTGGCGAACA AGGCGTCGGC CGCGAGCGCA GCGGGAGCGA TCCTGCACTA CGTTCGCCAG ACGCAGCGCG GCTCGCTGCA TCACGTGGAC CGTATCGGCT TCTACGAACG GCAGAACTGC CTGGTGCTCG ATGCGGTGAC GGTGCGCAAC CTGGAGTTGA TTGAGCCGCT GTTCACAAAT ACTGGCGAGG GTGTAACTCT CTTTCGCGCA CTGGACGCGA CGATGACGCC GATGGGCAAG CGCCTGCTGC GGGCGTGGAT GCTGCGGCCT TCGATTGATA CGTCGGAGAT CAATGCGCGG CTGGATGCGA TCGAGATGCA GGTAGTGGAT ACACTCGGCC GCGAAGAACT GCGCCGGGCG ATGGACGGAA TTCTCGACAT CGAACGCTTG CTCAGCCGGG TGACGCTGGA GACCGCAAAT CCACGTGATC TGCTGGCGCT GGCCCAATGT TTTGGACGAC TGCCAAAGGT GCGGGCGGCC ATGCAGCGGT TCACGTCGGC ACGATTCCTG GTGCTCCATG GACTACTTGA TGACCTTGCT GACCTTCGCG ATCGCATCTT CACCACGCTG GTAGATGAGC CGCCTATCAC ACTGAACGAT GGCGGGGTGG TGCGCGAGGG ACTTGATGCT GCGCTCGATG AACTTCGCAA CCTGAGCCAC AACAGCAAGC AATTCATCGC GCAGATCGAA GAGCGCGAAC GCAAGCGCAC CGGGATCGGC TCGCTGAAAA TCAAGTTCAA CAATGTCTTC GGGTACTACC TCGAAATCTC AAATGCGAAT AAGCACCTTG CGCCGGCGGA CTACGAGCGC AAGCAGACGC TTGTAAACGC CGAGCGCTTC ACGACGCCAG AATTAAAGGA ATACGAGGCG AAGGTGCTGG ATGCGCAGGA GAAGATCGTC GAGATTGAAC GGCGGATCTT CGGCGAGCTG CGCACAGCGA TTGCGGCCGA GGCGCGGCGG GTGCGACAAA CGGGCTTGGC CCTAGCCGAA GTGGATGTGC TGGCGAATTT CGCACACCTG GCTGCAACGA GGAATTATTG TCGTCCGAAG TTCGATCAGA GCGGTGAGTT TGAGTTGATA GAGGCGCGGC ATCCGGTGAT TGAGTTGCCG GAATTGACGG GTAGCGCGGA CCGCTTCGTG CCGAACGATC TGTATTTGAA TGCGACAACC CATACGGTGA TTGTGCTTAC CGGACCGAAC ATGGGTGGCA AGTCTACCTA TCTTCGCCAA GCAGCGTTGG TTGCCGTGAT GGCGCAGATG GGCAGCTTCG TTCCAGCCCG CTCCGCGCGT CTGAGCGTGG TGGACCGGGT GTTCACACGC ATCGGCGCCG CCGACAATCT CGCGCGCGGA CGATCGACGT TCATGGTCGA GATGACCGAG ACGGCCGCGA TCCTGAATAC GGCGACGGAC CGTTCGCTGA TTCTGCTCGA CGAAGTAGGC CGGGGCACTT CCACCTATGA CGGGCTGGCG ATTGCGTGGG CGTGTATCGA GTTCCTGCAT GCACGAACGC GAGCGAAGGC TCTCTTTGCT ACGCATTACC ACGAGTTGAC CGTGCTCGCC GATGAGTTGA GCGGTGTGAA GAACTATCAC GTGTCGGTGA AAGAGAGCGG CGGGAATGTC GTGTTCCTAC GCAGGGTGGA ACCGGGCGCT GCGGACAAGA GCTACGGCAT CGAGGTCGCG AAGCTTGCGG GATTACCTGC AGAAGTCATC GAGCGTGCGC GGGCGGTGCT GAAGGAGCAT GAATCGGTCG AGCGGCAGGC GACCTCGCAT CTGTCGAAAG ACGAACGTGG ATCCGACTCT ATGCAATTGA CGATCTTCAC TCCTCTGTCG CAAAAGATTG TGGACCAACT GAAGGAGACG GACTTGAACC GCCTGACGCC GATCGAAGCG CTGAACCTAC TGCATGAGTT AAAGAAGCAG TTGGACTAA
|
Protein sequence | MNDPTTPLMK QWAQVKRDHP NALLFFRLGD FYELFFDDAV IAARELQITL TARNKEKGNS VPMCGVPYHA AENYISKLIR RGFKVAVCDQ VEDPKLAKKL VKREVTRVMT PGTTADSQLG SEENNFLAAV ASHGDFVGFA ALDLSTGEFR ATEFKGSDAR RRIQEELLTL RPREILYGSS LPLFDAARPA NAVSGAMPKL ATVEGASWAE TPLEDWVFAP DYAIPLVENH FGVLSLEGFG LANKASAASA AGAILHYVRQ TQRGSLHHVD RIGFYERQNC LVLDAVTVRN LELIEPLFTN TGEGVTLFRA LDATMTPMGK RLLRAWMLRP SIDTSEINAR LDAIEMQVVD TLGREELRRA MDGILDIERL LSRVTLETAN PRDLLALAQC FGRLPKVRAA MQRFTSARFL VLHGLLDDLA DLRDRIFTTL VDEPPITLND GGVVREGLDA ALDELRNLSH NSKQFIAQIE ERERKRTGIG SLKIKFNNVF GYYLEISNAN KHLAPADYER KQTLVNAERF TTPELKEYEA KVLDAQEKIV EIERRIFGEL RTAIAAEARR VRQTGLALAE VDVLANFAHL AATRNYCRPK FDQSGEFELI EARHPVIELP ELTGSADRFV PNDLYLNATT HTVIVLTGPN MGGKSTYLRQ AALVAVMAQM GSFVPARSAR LSVVDRVFTR IGAADNLARG RSTFMVEMTE TAAILNTATD RSLILLDEVG RGTSTYDGLA IAWACIEFLH ARTRAKALFA THYHELTVLA DELSGVKNYH VSVKESGGNV VFLRRVEPGA ADKSYGIEVA KLAGLPAEVI ERARAVLKEH ESVERQATSH LSKDERGSDS MQLTIFTPLS QKIVDQLKET DLNRLTPIEA LNLLHELKKQ LD
|
| |