Gene Hmuk_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1006 
Symbol 
ID8410523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp958999 
End bp961065 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content71% 
IMG OID645019341 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003176841 
Protein GI257387068 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTCG AATCGGTCTC CGGCGTTGGG GAGAAGACGG CGGCGGCACT GGCCGAACTC 
GACGACCCCG AGCGGGCGCT GCGCGAGGGC GACGTGGCGA CGCTGGCCCA GGCCCCGGGG
ATCAGCGAGG GACGGGCCGC CCGAATCGCG CGAGCGGCCA TCCGAGACGA ACACGACGAC
CCCGGCGGGT GGGCCGCGAC CAGTCGCGCA CGCGAGATTT ACCGCGATGC GCTGGGACTG
ATACAGGACC GGACCGTCAC CGACTACGCC CGGAAGCGTC TGGAGACGAT CTATCCCAGC
GGCGTCCCCG AACGCATCGA GAGCGTCCGG GAGCGAGCCC GAGCGGCGAT GGAGCGAGAG
CCGGACGACG CCGTCCTCGA AGCCCTGGCG GGCGTCGAAC CGCTCGCCGA CCCGAGCGAC
GTGCGCGTCC GAGACCGGTG TCTGGCGACG ACCGACGCCG AGCGCTACGC CGAGGCACAG
GCGGCGATCC CGGAAGTCAC CGTCGAGGTG GTCGACGACG CCAGACAGCT CGCAGAGCTG
GGTCGCAGCT ACGCGACCGT CGTCGCCCTC GACGAGTCCT TCGCCGGCCT CGACGTGGAG
GGCGACGTGC GCGTCCAGCC CGACGCACTG GAGAACCCGG CCGACGTGGT CCCCGAGCGC
CCGCTGGCGT TTTTCACGCG CAACCGGGAC CGAATCCGGG CGGCCGTCGC AGTACACCGC
GTGGCCGACC TCGATTCCCC CTGTGATCTG GACGCCCTGG AGAGCGCGCT GGACCGGCTC
GCCGAAGACG GCAGCGTGCG GGGCGACGAC GAACTCGATC GCCTGACCGT CGCCGTCGAC
GATCTCGATG CGGCGGTCTC GACGGCCGAA TCCGCCGCCA ACGACCACCT CCGGGCGGCC
ATCGAAGAGC GAGACGTGAC CATCGAGGGG ACAGACCTGC TCTCGCTCGT CGAGCGAGGG
GCCGGGGTGG ACTCGCTGCT CTCCAGAGAG CTGGCCGACG AGTACGATGC CGCCATCTCG
AAGGCTCGCG ACCGCCTGAT CGAGACGCTG GGGCTGACAG ACACCGAGGC GGTCGCACGG
CGGGCCTTCC CCGACGACCC GACCTACCCC GTCGAACACA ACGAGGAGGC CGTCTCGCGC
CTGCGCGAGG AGCTGACCGC CGCACGGGAC CAGCGCGCCA CGCGACTGAA ACGAGAGCTG
GCCGACGATC TCGCCGAGAT GCGCGGGGCC GCCGAGAGCC TCGTCGAGAC CGCGCTCGAA
CTGGACGTCG AGCTCGCGAT CGCCCGGTTC GCCGCCGACT TCGACTGTAC GATGCCCGTG
GTCGGCGGGG TAGAGCCCGA GGGACGGAGC GCCTCGGAGC GGTCGGACGG GAGCGAGCCC
CGAGACGGCG CGGACGATCC CGGCTTCGCC ATCGAGGGCG GGCGCTCGCC GCTGCTGGAC
GTGCCCTTCG AAGCCGTCGA ACCGGTCGAC TACCGCGTCG ACGGCGTGGC GCTGCTGTCG
GGGGTCAACA GCGGCGGGAA GACCTCGACG CTGGACCTGG TGGCGCTGGT GACGACGCTT
GCCCACATGG GCCTGCCCGT CCCGGCCGAG TCCGCCCGGA TCGGCCGGGT GCGAGAACTC
CACTACCACG CCAAGACCCA GGGGACGCTG GACGCGGGGG CCTTCGAGGC GACGCTGCGG
GATTTCGGCG CGCTGGTAGC GGGCGTCGAC GAGAGTCCGG AAGCCACTCG CGCCGACCGC
GTGATGGTGC TGGTCGACGA ACTGGAGTCG ATCACGGAAC CCGGCGCGGC GGCGACGATC
GTCGCCGGCA TCCTCGAAGC GCTGGCAGAG CGCGACGCGA CGGGCGTGTT CGTCTCTCAC
CTCGCGGGCG ACATCATCGA CGCCGCCGAC GCCGATTTGA CCGTCGACGG GATTCAGGCC
GAGGGACTGG TCGACGGCGA ACTCCGGGTC AATCGCTCGC CCGTGAAGGG CCAGCTCGCC
CGGTCGACGC CGGAGCTGAT CGTCGAGAAG CTGGCAGACG ACAGCGAGAG CGACTTTTAC
GGCGACTTAC TCGGGAAGTT CGAGTAG
 
Protein sequence
MELESVSGVG EKTAAALAEL DDPERALREG DVATLAQAPG ISEGRAARIA RAAIRDEHDD 
PGGWAATSRA REIYRDALGL IQDRTVTDYA RKRLETIYPS GVPERIESVR ERARAAMERE
PDDAVLEALA GVEPLADPSD VRVRDRCLAT TDAERYAEAQ AAIPEVTVEV VDDARQLAEL
GRSYATVVAL DESFAGLDVE GDVRVQPDAL ENPADVVPER PLAFFTRNRD RIRAAVAVHR
VADLDSPCDL DALESALDRL AEDGSVRGDD ELDRLTVAVD DLDAAVSTAE SAANDHLRAA
IEERDVTIEG TDLLSLVERG AGVDSLLSRE LADEYDAAIS KARDRLIETL GLTDTEAVAR
RAFPDDPTYP VEHNEEAVSR LREELTAARD QRATRLKREL ADDLAEMRGA AESLVETALE
LDVELAIARF AADFDCTMPV VGGVEPEGRS ASERSDGSEP RDGADDPGFA IEGGRSPLLD
VPFEAVEPVD YRVDGVALLS GVNSGGKTST LDLVALVTTL AHMGLPVPAE SARIGRVREL
HYHAKTQGTL DAGAFEATLR DFGALVAGVD ESPEATRADR VMVLVDELES ITEPGAAATI
VAGILEALAE RDATGVFVSH LAGDIIDAAD ADLTVDGIQA EGLVDGELRV NRSPVKGQLA
RSTPELIVEK LADDSESDFY GDLLGKFE