Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0366 |
Symbol | |
ID | 8409864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 361109 |
End bp | 363853 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645018691 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003176210 |
Protein GI | 257386437 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0942134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.39879 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAGG CGACGGGTAT CGTCGGCGAG TTCCTCTCGC TGAAGGCGGA CAGTGACGCG GATCTGCTGG CGATGCAGTG TGGCGACTTC TACGAGTTCT TCGACGAGGA CGCCGAGATC GTCGCCGACG AGCTCGACCT GAAGGTGAGT CAGAAGTCCT CCCACGGCGC TTCGTACCCG ATGGCGGGCG TGCCGGTCGA CGACCTGACC CCCTACGTCT CCGCGCTGGT CGAGCGGGGC TACCGGGTCG CCGTCGCCGA CCAGCACGAG ACGGCGGACG GACACGCCCG CGAGATCACG CGCGTCGTGA CGCCGGGAAC CCACCTGGAG ACGACCGACG ACGCCGCCCA GTACCTCGCT GCCGTCGTCG GTACCGGCGG CGACTCGTCG GACCCAGTGG CGGGAGCCGA CGGCAGCTAC GGGCTGGCCG TGGCCGACGT GACTACCGGC CAGTTCCACG TGACCCAACT CGACGGCCCG GACGCTCGCA CGCGGATTCA GACGGAGCTG TACAAGTTCG ACCCCGCGGA GGTGTTGCCC GGCCCCGACC TGCGTGGCGA CGACGACTTC CTCGATCGCC TGCGCGAGCG GACCGACGCG GCGCTGACGG TCCACACCGA GGAGTCGTTC GCGCCCGGAC GAGCGCGCCA CCGCGTTCGC GACCACTTCG GCCCCGAGAC CGTCGACAGC GTCGGTATCG CCGACGACGA CGCGGCCATC TGTGCCGCCG GTGCCGTCCT GTCCTACGTC GAGGACACCG GCGTCGGGAC GCTGGCCTCG ATCACGCGCC TGCAGTCCTA CGGCGCGACC GACCACGTCA ATCTGGACGC GACGACCCAG CGCAACCTCG AACTGACCGA GACGATGCAG GGCGGGACCG ACGGCTCGCT GTTCGATACG ATCGACCACA CGGCGACGGC TGCCGGCGCA CGCCTGCTCC GGGAGTGGCT CCAGCGGCCA CGCAGAGATC GCGCCGAGCT GACGCGACGC CAGTCCTGTA TCGCCGCGCT GACCGAGGCA GCGATGGCTC GCGAACAGCT CTGTGAGACG CTCTCGGACG CGTACGACCT CGAACGACTC GCGTCGAAGG CGGTCTCGGG CAGCGCCGAC GCCCGCGACC TGCGCGCGGT CGCGGGGACG CTGTCCCTGC TGGAAACGGT CGAAGACGTG ATCGACGAGG ACGACCGACT GGCCGACTCG CCGCTGGCCG ACGCCCTCGA TCGGCTCGAC CGCGAGGTCG TCGCCGACCT CGCCGCGGAA CTGGACGCCG CCCTCGTCGA CGACCCGCCG GGGACGGTCC GCCAGGGAGG CCTCTTCTGC CGGGGGTACG ACGAGGAACT GGACGCGGTG ATCGAGCGCC ACGAGGACGC GCTGGAGTGG CTGGAGACGC TGCCCGACCG CGAGAAAGAG ACGACCGGCA TCACGCACCT CTCGGTCGAC CGCAACAAGA CCGACGGCTA CTACATCCAG GTCGGCAAGA GCGAGGCCGA CGACGTGCCC GACCGCTACG AGGGGATCAA GGAACTGAAG AACTCGAAGC GGTTCACGAC CGACGAACTG CAGGAGAGAG AGCGGGCGGT GTTCCGCCTC GAAGAACAGC GCCACGAGAT GGAGTACGAA CTGTTCGGCG ACCTGCGGGA GCGAGTCGCC GAGGACGCCA CGCTCCTGCA GGACGCGGGC CGCGTGCTGG CCGAACTGGA CGCGGTGGCG TCGCTGGCGA TCCACGCCGT CCGCAACGAC TGGACGCGTC CGGAACTCAC CGACGGCCGC GAACTCGACG TGGAAGCGGG CCGGCACCCC GTCGTCGAGC AGACCACGGA GTTCGTTCCC AACGACCTCC GGATGGACGG GGCGACTTCG CCGCCGGACC AAGAGCGTGC CAGCGGGCAC GCGGACGCGG ATCGTCAGTT CCTGATCGTC ACCGGCCCCA ACATGAGCGG CAAGTCGACG TACATGCGAC AGGCCGCGCT GATCACGCTG CTGGCACAGG TCGGCAGTTT CGTCCCCGCA CGCGCGGCGA CGGTCGGACT CGTCGACGGG ATCTACACCC GCGTCGGCGC GCTGGACGAA CTCGCACAGG GCCGGTCGAC GTTCATGGTC GAGATGCAGG AACTCAGCAA CATCCTCCAC TCCGCGACCG ACGACTCGCT GGTGATACTC GACGAGGTGG GCCGTGGCAC CGCGACCTAC GACGGCATCT CGATCGCCTG GGCGGCGACC GAGTACCTGC ACAACGAGGT CCGGGCGAAG ACCCTCTTTG CGACACACTA CCACGAGCTG ACCTCGCTCG CCGACCACCT CGACCGCGTC CACAACGTCC ACGTCGCTGC CGACGAGTCC GACGGTGACG TGACCTTCCT CCGGACGGTG CGAGACGGGC CGACCGACCG CTCCTACGGG ATCCACGTCG CCGACCTGGC GGGCGTCCCA GAGCCCGTCG TCGACCGCTC CCGGACGGTG CTCGATCGCC TGCGCGAAGA GAAGGCCATC GAGGCGAAAG GCAGCGGGTC GTCCGAACCC GTCCAGACCG TCTTCGACGT GAACGCGGGC GGGTTCAAAC GCGCCGACGA GAGCGAGACG GCCACCGCCG ACGGTGGAAC GGCGGCCGAG GCGTCGGACG GGGGACTGGA CCCCGAAACC GAGGCCGTCG TCGAGGAACT GACCGAACTC GACGTCAACG AGACGCCGCC GGTCGAGCTG CTGGCGACGG TCCAGCAGTG GCAAGAGCAA CTGGACGACG GCTGA
|
Protein sequence | MTEATGIVGE FLSLKADSDA DLLAMQCGDF YEFFDEDAEI VADELDLKVS QKSSHGASYP MAGVPVDDLT PYVSALVERG YRVAVADQHE TADGHAREIT RVVTPGTHLE TTDDAAQYLA AVVGTGGDSS DPVAGADGSY GLAVADVTTG QFHVTQLDGP DARTRIQTEL YKFDPAEVLP GPDLRGDDDF LDRLRERTDA ALTVHTEESF APGRARHRVR DHFGPETVDS VGIADDDAAI CAAGAVLSYV EDTGVGTLAS ITRLQSYGAT DHVNLDATTQ RNLELTETMQ GGTDGSLFDT IDHTATAAGA RLLREWLQRP RRDRAELTRR QSCIAALTEA AMAREQLCET LSDAYDLERL ASKAVSGSAD ARDLRAVAGT LSLLETVEDV IDEDDRLADS PLADALDRLD REVVADLAAE LDAALVDDPP GTVRQGGLFC RGYDEELDAV IERHEDALEW LETLPDREKE TTGITHLSVD RNKTDGYYIQ VGKSEADDVP DRYEGIKELK NSKRFTTDEL QERERAVFRL EEQRHEMEYE LFGDLRERVA EDATLLQDAG RVLAELDAVA SLAIHAVRND WTRPELTDGR ELDVEAGRHP VVEQTTEFVP NDLRMDGATS PPDQERASGH ADADRQFLIV TGPNMSGKST YMRQAALITL LAQVGSFVPA RAATVGLVDG IYTRVGALDE LAQGRSTFMV EMQELSNILH SATDDSLVIL DEVGRGTATY DGISIAWAAT EYLHNEVRAK TLFATHYHEL TSLADHLDRV HNVHVAADES DGDVTFLRTV RDGPTDRSYG IHVADLAGVP EPVVDRSRTV LDRLREEKAI EAKGSGSSEP VQTVFDVNAG GFKRADESET ATADGGTAAE ASDGGLDPET EAVVEELTEL DVNETPPVEL LATVQQWQEQ LDDG
|
| |