Gene Hmuk_0366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0366 
Symbol 
ID8409864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp361109 
End bp363853 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content69% 
IMG OID645018691 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003176210 
Protein GI257386437 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0942134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.39879 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGG CGACGGGTAT CGTCGGCGAG TTCCTCTCGC TGAAGGCGGA CAGTGACGCG 
GATCTGCTGG CGATGCAGTG TGGCGACTTC TACGAGTTCT TCGACGAGGA CGCCGAGATC
GTCGCCGACG AGCTCGACCT GAAGGTGAGT CAGAAGTCCT CCCACGGCGC TTCGTACCCG
ATGGCGGGCG TGCCGGTCGA CGACCTGACC CCCTACGTCT CCGCGCTGGT CGAGCGGGGC
TACCGGGTCG CCGTCGCCGA CCAGCACGAG ACGGCGGACG GACACGCCCG CGAGATCACG
CGCGTCGTGA CGCCGGGAAC CCACCTGGAG ACGACCGACG ACGCCGCCCA GTACCTCGCT
GCCGTCGTCG GTACCGGCGG CGACTCGTCG GACCCAGTGG CGGGAGCCGA CGGCAGCTAC
GGGCTGGCCG TGGCCGACGT GACTACCGGC CAGTTCCACG TGACCCAACT CGACGGCCCG
GACGCTCGCA CGCGGATTCA GACGGAGCTG TACAAGTTCG ACCCCGCGGA GGTGTTGCCC
GGCCCCGACC TGCGTGGCGA CGACGACTTC CTCGATCGCC TGCGCGAGCG GACCGACGCG
GCGCTGACGG TCCACACCGA GGAGTCGTTC GCGCCCGGAC GAGCGCGCCA CCGCGTTCGC
GACCACTTCG GCCCCGAGAC CGTCGACAGC GTCGGTATCG CCGACGACGA CGCGGCCATC
TGTGCCGCCG GTGCCGTCCT GTCCTACGTC GAGGACACCG GCGTCGGGAC GCTGGCCTCG
ATCACGCGCC TGCAGTCCTA CGGCGCGACC GACCACGTCA ATCTGGACGC GACGACCCAG
CGCAACCTCG AACTGACCGA GACGATGCAG GGCGGGACCG ACGGCTCGCT GTTCGATACG
ATCGACCACA CGGCGACGGC TGCCGGCGCA CGCCTGCTCC GGGAGTGGCT CCAGCGGCCA
CGCAGAGATC GCGCCGAGCT GACGCGACGC CAGTCCTGTA TCGCCGCGCT GACCGAGGCA
GCGATGGCTC GCGAACAGCT CTGTGAGACG CTCTCGGACG CGTACGACCT CGAACGACTC
GCGTCGAAGG CGGTCTCGGG CAGCGCCGAC GCCCGCGACC TGCGCGCGGT CGCGGGGACG
CTGTCCCTGC TGGAAACGGT CGAAGACGTG ATCGACGAGG ACGACCGACT GGCCGACTCG
CCGCTGGCCG ACGCCCTCGA TCGGCTCGAC CGCGAGGTCG TCGCCGACCT CGCCGCGGAA
CTGGACGCCG CCCTCGTCGA CGACCCGCCG GGGACGGTCC GCCAGGGAGG CCTCTTCTGC
CGGGGGTACG ACGAGGAACT GGACGCGGTG ATCGAGCGCC ACGAGGACGC GCTGGAGTGG
CTGGAGACGC TGCCCGACCG CGAGAAAGAG ACGACCGGCA TCACGCACCT CTCGGTCGAC
CGCAACAAGA CCGACGGCTA CTACATCCAG GTCGGCAAGA GCGAGGCCGA CGACGTGCCC
GACCGCTACG AGGGGATCAA GGAACTGAAG AACTCGAAGC GGTTCACGAC CGACGAACTG
CAGGAGAGAG AGCGGGCGGT GTTCCGCCTC GAAGAACAGC GCCACGAGAT GGAGTACGAA
CTGTTCGGCG ACCTGCGGGA GCGAGTCGCC GAGGACGCCA CGCTCCTGCA GGACGCGGGC
CGCGTGCTGG CCGAACTGGA CGCGGTGGCG TCGCTGGCGA TCCACGCCGT CCGCAACGAC
TGGACGCGTC CGGAACTCAC CGACGGCCGC GAACTCGACG TGGAAGCGGG CCGGCACCCC
GTCGTCGAGC AGACCACGGA GTTCGTTCCC AACGACCTCC GGATGGACGG GGCGACTTCG
CCGCCGGACC AAGAGCGTGC CAGCGGGCAC GCGGACGCGG ATCGTCAGTT CCTGATCGTC
ACCGGCCCCA ACATGAGCGG CAAGTCGACG TACATGCGAC AGGCCGCGCT GATCACGCTG
CTGGCACAGG TCGGCAGTTT CGTCCCCGCA CGCGCGGCGA CGGTCGGACT CGTCGACGGG
ATCTACACCC GCGTCGGCGC GCTGGACGAA CTCGCACAGG GCCGGTCGAC GTTCATGGTC
GAGATGCAGG AACTCAGCAA CATCCTCCAC TCCGCGACCG ACGACTCGCT GGTGATACTC
GACGAGGTGG GCCGTGGCAC CGCGACCTAC GACGGCATCT CGATCGCCTG GGCGGCGACC
GAGTACCTGC ACAACGAGGT CCGGGCGAAG ACCCTCTTTG CGACACACTA CCACGAGCTG
ACCTCGCTCG CCGACCACCT CGACCGCGTC CACAACGTCC ACGTCGCTGC CGACGAGTCC
GACGGTGACG TGACCTTCCT CCGGACGGTG CGAGACGGGC CGACCGACCG CTCCTACGGG
ATCCACGTCG CCGACCTGGC GGGCGTCCCA GAGCCCGTCG TCGACCGCTC CCGGACGGTG
CTCGATCGCC TGCGCGAAGA GAAGGCCATC GAGGCGAAAG GCAGCGGGTC GTCCGAACCC
GTCCAGACCG TCTTCGACGT GAACGCGGGC GGGTTCAAAC GCGCCGACGA GAGCGAGACG
GCCACCGCCG ACGGTGGAAC GGCGGCCGAG GCGTCGGACG GGGGACTGGA CCCCGAAACC
GAGGCCGTCG TCGAGGAACT GACCGAACTC GACGTCAACG AGACGCCGCC GGTCGAGCTG
CTGGCGACGG TCCAGCAGTG GCAAGAGCAA CTGGACGACG GCTGA
 
Protein sequence
MTEATGIVGE FLSLKADSDA DLLAMQCGDF YEFFDEDAEI VADELDLKVS QKSSHGASYP 
MAGVPVDDLT PYVSALVERG YRVAVADQHE TADGHAREIT RVVTPGTHLE TTDDAAQYLA
AVVGTGGDSS DPVAGADGSY GLAVADVTTG QFHVTQLDGP DARTRIQTEL YKFDPAEVLP
GPDLRGDDDF LDRLRERTDA ALTVHTEESF APGRARHRVR DHFGPETVDS VGIADDDAAI
CAAGAVLSYV EDTGVGTLAS ITRLQSYGAT DHVNLDATTQ RNLELTETMQ GGTDGSLFDT
IDHTATAAGA RLLREWLQRP RRDRAELTRR QSCIAALTEA AMAREQLCET LSDAYDLERL
ASKAVSGSAD ARDLRAVAGT LSLLETVEDV IDEDDRLADS PLADALDRLD REVVADLAAE
LDAALVDDPP GTVRQGGLFC RGYDEELDAV IERHEDALEW LETLPDREKE TTGITHLSVD
RNKTDGYYIQ VGKSEADDVP DRYEGIKELK NSKRFTTDEL QERERAVFRL EEQRHEMEYE
LFGDLRERVA EDATLLQDAG RVLAELDAVA SLAIHAVRND WTRPELTDGR ELDVEAGRHP
VVEQTTEFVP NDLRMDGATS PPDQERASGH ADADRQFLIV TGPNMSGKST YMRQAALITL
LAQVGSFVPA RAATVGLVDG IYTRVGALDE LAQGRSTFMV EMQELSNILH SATDDSLVIL
DEVGRGTATY DGISIAWAAT EYLHNEVRAK TLFATHYHEL TSLADHLDRV HNVHVAADES
DGDVTFLRTV RDGPTDRSYG IHVADLAGVP EPVVDRSRTV LDRLREEKAI EAKGSGSSEP
VQTVFDVNAG GFKRADESET ATADGGTAAE ASDGGLDPET EAVVEELTEL DVNETPPVEL
LATVQQWQEQ LDDG