Gene Hmuk_0358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0358 
Symbol 
ID8409856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp350703 
End bp353459 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content71% 
IMG OID645018683 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003176202 
Protein GI257386429 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.613152 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGG CGCTGGGACC TCCCGAGAAG ATGGCCGAGC GGGCCGACGA GCTGACGCCG 
ATGATGCGCC AGTACTACGA GCTCTGTCGG GCCTACGACG ACTCGCTGGT CCTGTTTCAG
GTCGGGGACT TCTACGAGGC CTTCTGCGGT GCCGCCGAGC GGGTCGCCCG GCTCTGTGAG
ATCACGCTGA CCAAGCGCGA GGACTCCACT GGGCAGTACG CGATGGCCGG AGTCCCCATC
GACAACGCCG AGAGCTACGT CGAGACGCTG CTCGACGCCG GCTACCGCGT CGCCATCGCG
GACCAGGTCG AGGACCCCGA CGCGGTCAGC GGCGTCGTCG ACCGGGCGGT GACGCGGATC
ATCACGCCGG GGACGCTGAC CGAGGACGAA CTGCTCGACT CGCCGGACAA CAACTACGTC
GCCGCGCTGA CCGCCGACGG CCGCCGCTAC GGGCTGGCGC TGCTGGACGT GTCCACGGGC
GATTTCTACG CGACCAGCGC CGACGCCGTC GACGCCGTCG CCGACGAGGT GGGTCGGTTC
GCACCCGCGG AGGCGATCGT CGGCCCCGGC GTCGACGTGG ACGAGGATCG GGTCTTCGAG
GCGGCGGCGA TGGTGACGCC GTACGACGAG TCGGTGTTCG CCTTCGAGGA CGCCGCCGAC
CGAGTGCGGA CGTACTTCGG CGACCCGGAC GCGCTGCTGG CCGACGACCT GGAGGTGCGA
GCCTGCGGCG CGCTGCTTGC GTACGCCGAG TACACCCGCG GGGGCGGTGC GGGCACGAGC
GACGAGATCG CAGACGACGA CGGCGGGCAA CTGACGTATC TCACGCACCT CACGCGTTAC
GACCCCCGCG AGTACATGCT GCTGGACGCC GTCGCGCTCG ACAGCCTCGA ACTGTTCGAG
CGCCGGGCGG TGCGGGGCCA CGAGGGGCGC ACGCTGGTCG ACACCGTCGA CGAGACCGCC
TGCGCGCTCG GCCGGCGACG GCTCGGCGAC TGGCTGCGTC GGCCGCTGCT CGACGCCGAC
CGAATCGAAC GCCGCCACGA GGCGGTCGCC GAACTCGTCG AAGCGCTCCA GCGCCGCGAG
CGACTCCACG CCCTGCTCGC GGACGTGTAC GACCTCGAAC GGCTGATCTC TCGCGTCTCG
CGCGGGCGAG CCAACGCGCG GGACCTGCGC TCGCTGGCGG CCACGCTCGC AGTCGTCCCC
GACGTGCGTG AGCAACTGGC CGACGCCGAC AGCGCCCTGC TCGCGGACCT TCACGAGGGG
CTCGATCCGC TGACGGACGT GCGCGAGGAG ATCGAGGCGG CGATCTGTCC GGACCCGCCC
CAGGAAGTCA CCGAGGGTGA CGTGATCCGC GAGGGGTACG ACGACGACCT CGACGCGCTG
CGCGAGACCG AGCGGTCGGG CAAGCGATGG ATCGACGACC TGGAGATCAA CGAGCGCGAA
CGCACCGGAA TCGACTCGCT GAAGGTCGGG CACAACTCCG TCCACGGCTA CTACATCGAG
GTGACCGACC CCAACCTCGA CAGCGTCCCC GACGACTACG AGCGCCGCCA GACGCTGAAA
AACTCCGAGC GCTTCGTCAC GCCCGAACTC AGGGAGCGCG AAGAGGAGAT CGTCCGGGCA
GAGACGGCCG CCGACGACCT CGAATACGAC CTGTTCTGTG AGGTACGAGC GGCGATCGCG
GCCGAGGCCG AGCGCGTGCA GGCGCTGGCC GACCGCCTGG CGACGCTCGA CGCGCTCGTG
GCCTTCGGCG AGGTGGCGGC GACCCACGAC TATTGCCGAC CCAGCGTCGG CGGGGACGCC
ATCGACGTGA CGGCCGGCCG CCACCCCGTT GTCGAGCGCG CCGAGGCGTC GTTCGTTCCC
AACGACGCCT GCCTCACTCC GGACTCGTTT TTCACGATCC TCACTGGCCC CAACATGAGC
GGGAAATCGA CGTACATGCG CCAGATCGCG CTCATCTGCG TGCTGGCACA GGCCGGGAGT
TTCGTCCCCG CTCGCGAGGC GAACCTGCCG ATCGTCGACC GCGTGTTCAC CCGCGTCGGT
GCGAGCGACG ACATCGCCGG CGGGCGCTCG ACGTTCATGA TCGAGATGAC CGAACTCGCA
GACATCCTCC AGGGCGCGAC CAGCGACTCG CTGATCCTGC TGGACGAGGT CGGCCGGGGG
ACCTCGACGG CCGACGGGCT CGCCATCGCC CGCGCCGTCA CCGAACACGT CCACGACGAA
ATCGGGGCGT ACACGCTCTT TGCGACCCAC CACCACGAGC TGACGGCCGT CGCCGACGAA
CTGCCCGGCG TCCGCAACCG CCACTTCGAG ACGCGCCACG ACGGCGACGG CGTCGTCTTC
GAGCACAGCG TCGCCCCCGG CGCGGCCGCG GCGTCCTACG GGATCGAGGT CGCGGCCCTG
GCCGGCGTGC CCGATTCGGT GGTCGAGCGT TCTCGGACGG TGTTGGCCAG CGAGGACGAG
CGAAGCGAGT CCTCGGAAAC GAGAGCGGGA GCGGAGCGAC ACGCGGGCGG CGAGGACGAG
CGCAGCGAGT CCTCGGACGC CGAAAACGGC GCGGTGGCCC AGGCGTCGGC TCCGGCGAGC
GAGCCGCCGT CGGCGTCGGC CGACGGCCAC GCCGTCGTCG AGGCGGCGAC CGGTGACGAG
AGCGGACCTG ACGCCGACCC GCTCCGCGAG CGGCTGGCAC AGCTGGACGT GGCGACGATG
ACGCCCATCG AAGCGATGAA CGCGCTCGCC CGACTACAGG ACGACATCGC GGACTGA
 
Protein sequence
MDAALGPPEK MAERADELTP MMRQYYELCR AYDDSLVLFQ VGDFYEAFCG AAERVARLCE 
ITLTKREDST GQYAMAGVPI DNAESYVETL LDAGYRVAIA DQVEDPDAVS GVVDRAVTRI
ITPGTLTEDE LLDSPDNNYV AALTADGRRY GLALLDVSTG DFYATSADAV DAVADEVGRF
APAEAIVGPG VDVDEDRVFE AAAMVTPYDE SVFAFEDAAD RVRTYFGDPD ALLADDLEVR
ACGALLAYAE YTRGGGAGTS DEIADDDGGQ LTYLTHLTRY DPREYMLLDA VALDSLELFE
RRAVRGHEGR TLVDTVDETA CALGRRRLGD WLRRPLLDAD RIERRHEAVA ELVEALQRRE
RLHALLADVY DLERLISRVS RGRANARDLR SLAATLAVVP DVREQLADAD SALLADLHEG
LDPLTDVREE IEAAICPDPP QEVTEGDVIR EGYDDDLDAL RETERSGKRW IDDLEINERE
RTGIDSLKVG HNSVHGYYIE VTDPNLDSVP DDYERRQTLK NSERFVTPEL REREEEIVRA
ETAADDLEYD LFCEVRAAIA AEAERVQALA DRLATLDALV AFGEVAATHD YCRPSVGGDA
IDVTAGRHPV VERAEASFVP NDACLTPDSF FTILTGPNMS GKSTYMRQIA LICVLAQAGS
FVPAREANLP IVDRVFTRVG ASDDIAGGRS TFMIEMTELA DILQGATSDS LILLDEVGRG
TSTADGLAIA RAVTEHVHDE IGAYTLFATH HHELTAVADE LPGVRNRHFE TRHDGDGVVF
EHSVAPGAAA ASYGIEVAAL AGVPDSVVER SRTVLASEDE RSESSETRAG AERHAGGEDE
RSESSDAENG AVAQASAPAS EPPSASADGH AVVEAATGDE SGPDADPLRE RLAQLDVATM
TPIEAMNALA RLQDDIAD