Gene Hmuk_2432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2432 
Symbol 
ID8411976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2334292 
End bp2336061 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content72% 
IMG OID645020775 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003178249 
Protein GI257388476 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00495271 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACTGG AGGAATACTG GGGAATCGGG CCGAAGACGA GCGCGCTGCT CTCGTCGGAA 
CTGGGGGTCG ACCGCGCGGT TGCGGCCATC GAATCGGCCG ACACTCGCGA ACTGACGAAG
GCGGGTCTCT CTCGGGGCCG AGCGACGCGG ATCCTCAGGC GGGCGACGGG GGCCGAGGCG
ATGGATCTGC TGGCGACCCG CGACACCCGC GACGTGTACA AGGCGCTGCT GGACCTCGCC
GAAGAGCACG CGGTCACGGA ACACGCGGCC GACCGCATCC GAGTGCTGAC GCCGCTGCCG
GACCGGGCGG CGATGGAAGA GCGGCTCTCG TCCGTCCTCG ACGCGCGCGA CGCCTGGGCC
AGCACCGACG AGGCGACGCG AGACGACGTG CTGGCGGCGT TCGACCGCTA CGACAGCGTC
GCCGGGGGCG AGCTGTCTGC CGTCCGCACG GCGCTTGCGC TACAGGAGAC CGGCGTCGAC
GACGGGGTGT TCGAGCCGAT CGCGACACTT GACGGGGACC GACTCGGCGA CGCGGCACGT
GCGCTGGCCG GACTGCAAGG CGAGGGCGAC CGTGTCGGCG ACGGGGCGGA CGACCAACTC
GACGGACTGC GCGAGCGGCT CGGACAGGTG GAGGATCTCG CTGTGACCTC CGCGGCGGTC
GTCGAGGAAC TCCAGTCCGA GGCCCGCCGA CCCGACGAGT TTCGGGACGC GCTGGCTCGC
TACCTGACCA GCGAGACCGG CGTCGACGCC GCTCGCGTTC GCGACGCCGT CCCGCGGGAG
GCCGCCGACG CACAGGACTT CGTCGAGAGC GCGCTCCGGT CGCTCGCTGG CGATCTCCGC
AGCGCGGTAG ACGAACGCGA AGCGACCGTC GCCGCGACCC TGGAGGAGGA CCTCGCGGCC
GCCCGCGAGA CCGTCGACGC GGCCGTCGAG GCCGTCGACG CTGCCGCGCT GTACGTCTCG
CTGGCACGCT TCGCGCTGGC GTACGAGCTC GGACGTCCGA CGTTCGTCGC GGACCGGGAG
ACCATCGCCG TCCGTGGCGC GCGCAACCTC GCACTGCAGG ACGCCGGGGA CGACGTACAG
CCGGTGACCT ACGCCGTCGG AGACCACGAC CTCCCCGCCG GCCGCGAGCC GCCCACGGGT
GACCGGGTCG CCGTCCTGAC CGGCGCGAAC TCCGGTGGGA AGACGACGCT GCTGGAGACA
CTGTGTCAGG TGCAGCTGCT GGCCCAGATG GGGCTGCCCG TGCCGGCCGA GGACGCCGAA
GTGGGGCTCG TCGACGCGAT CGTCTTCCAC CGCCGTCACG CGTCGTTCAA CGCGGGCGTC
CTGGAGTCGA CGCTCCGATC CGTGGTTCCG CCGCTGACCG ACGCCGGGCG GACGCTGATG
CTGGTCGACG AGTTCGAGGC GATCACGGAA CCGGGCAGCG CCGCCAACCT CCTGCACGGG
CTGGTGACCC TGACCGTCGA GCGCGACGCG CTCGGCGTGT TCGTCACGCA CCTCGCGGAC
GATCTTGAGC CGCTCCCCGA GGCCGCACGG ACCGACGGGA TCTTCGCCGA GGGGCTGTCG
CCGGACCTCG ATCTCGAAGT CGACTACCAG CCGCGTTTCG AGACGGTCGG CAAGTCGACG
CCGGAGTTCA TCGTCTCTCG ACTGGTCGCA AACGCCGCCG ATCCGGTCGA ACGCACGGGC
TTCGAGACGC TGGCACAGGC GGTCGGCGAG GAGGCCGTCC AGCGAACCCT CTCTGACGCC
CGCTGGTCCG AGGGCGACGG CGACGACTGA
 
Protein sequence
MRLEEYWGIG PKTSALLSSE LGVDRAVAAI ESADTRELTK AGLSRGRATR ILRRATGAEA 
MDLLATRDTR DVYKALLDLA EEHAVTEHAA DRIRVLTPLP DRAAMEERLS SVLDARDAWA
STDEATRDDV LAAFDRYDSV AGGELSAVRT ALALQETGVD DGVFEPIATL DGDRLGDAAR
ALAGLQGEGD RVGDGADDQL DGLRERLGQV EDLAVTSAAV VEELQSEARR PDEFRDALAR
YLTSETGVDA ARVRDAVPRE AADAQDFVES ALRSLAGDLR SAVDEREATV AATLEEDLAA
ARETVDAAVE AVDAAALYVS LARFALAYEL GRPTFVADRE TIAVRGARNL ALQDAGDDVQ
PVTYAVGDHD LPAGREPPTG DRVAVLTGAN SGGKTTLLET LCQVQLLAQM GLPVPAEDAE
VGLVDAIVFH RRHASFNAGV LESTLRSVVP PLTDAGRTLM LVDEFEAITE PGSAANLLHG
LVTLTVERDA LGVFVTHLAD DLEPLPEAAR TDGIFAEGLS PDLDLEVDYQ PRFETVGKST
PEFIVSRLVA NAADPVERTG FETLAQAVGE EAVQRTLSDA RWSEGDGDD