Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2432 |
Symbol | |
ID | 8411976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2334292 |
End bp | 2336061 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645020775 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003178249 |
Protein GI | 257388476 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00495271 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACTGG AGGAATACTG GGGAATCGGG CCGAAGACGA GCGCGCTGCT CTCGTCGGAA CTGGGGGTCG ACCGCGCGGT TGCGGCCATC GAATCGGCCG ACACTCGCGA ACTGACGAAG GCGGGTCTCT CTCGGGGCCG AGCGACGCGG ATCCTCAGGC GGGCGACGGG GGCCGAGGCG ATGGATCTGC TGGCGACCCG CGACACCCGC GACGTGTACA AGGCGCTGCT GGACCTCGCC GAAGAGCACG CGGTCACGGA ACACGCGGCC GACCGCATCC GAGTGCTGAC GCCGCTGCCG GACCGGGCGG CGATGGAAGA GCGGCTCTCG TCCGTCCTCG ACGCGCGCGA CGCCTGGGCC AGCACCGACG AGGCGACGCG AGACGACGTG CTGGCGGCGT TCGACCGCTA CGACAGCGTC GCCGGGGGCG AGCTGTCTGC CGTCCGCACG GCGCTTGCGC TACAGGAGAC CGGCGTCGAC GACGGGGTGT TCGAGCCGAT CGCGACACTT GACGGGGACC GACTCGGCGA CGCGGCACGT GCGCTGGCCG GACTGCAAGG CGAGGGCGAC CGTGTCGGCG ACGGGGCGGA CGACCAACTC GACGGACTGC GCGAGCGGCT CGGACAGGTG GAGGATCTCG CTGTGACCTC CGCGGCGGTC GTCGAGGAAC TCCAGTCCGA GGCCCGCCGA CCCGACGAGT TTCGGGACGC GCTGGCTCGC TACCTGACCA GCGAGACCGG CGTCGACGCC GCTCGCGTTC GCGACGCCGT CCCGCGGGAG GCCGCCGACG CACAGGACTT CGTCGAGAGC GCGCTCCGGT CGCTCGCTGG CGATCTCCGC AGCGCGGTAG ACGAACGCGA AGCGACCGTC GCCGCGACCC TGGAGGAGGA CCTCGCGGCC GCCCGCGAGA CCGTCGACGC GGCCGTCGAG GCCGTCGACG CTGCCGCGCT GTACGTCTCG CTGGCACGCT TCGCGCTGGC GTACGAGCTC GGACGTCCGA CGTTCGTCGC GGACCGGGAG ACCATCGCCG TCCGTGGCGC GCGCAACCTC GCACTGCAGG ACGCCGGGGA CGACGTACAG CCGGTGACCT ACGCCGTCGG AGACCACGAC CTCCCCGCCG GCCGCGAGCC GCCCACGGGT GACCGGGTCG CCGTCCTGAC CGGCGCGAAC TCCGGTGGGA AGACGACGCT GCTGGAGACA CTGTGTCAGG TGCAGCTGCT GGCCCAGATG GGGCTGCCCG TGCCGGCCGA GGACGCCGAA GTGGGGCTCG TCGACGCGAT CGTCTTCCAC CGCCGTCACG CGTCGTTCAA CGCGGGCGTC CTGGAGTCGA CGCTCCGATC CGTGGTTCCG CCGCTGACCG ACGCCGGGCG GACGCTGATG CTGGTCGACG AGTTCGAGGC GATCACGGAA CCGGGCAGCG CCGCCAACCT CCTGCACGGG CTGGTGACCC TGACCGTCGA GCGCGACGCG CTCGGCGTGT TCGTCACGCA CCTCGCGGAC GATCTTGAGC CGCTCCCCGA GGCCGCACGG ACCGACGGGA TCTTCGCCGA GGGGCTGTCG CCGGACCTCG ATCTCGAAGT CGACTACCAG CCGCGTTTCG AGACGGTCGG CAAGTCGACG CCGGAGTTCA TCGTCTCTCG ACTGGTCGCA AACGCCGCCG ATCCGGTCGA ACGCACGGGC TTCGAGACGC TGGCACAGGC GGTCGGCGAG GAGGCCGTCC AGCGAACCCT CTCTGACGCC CGCTGGTCCG AGGGCGACGG CGACGACTGA
|
Protein sequence | MRLEEYWGIG PKTSALLSSE LGVDRAVAAI ESADTRELTK AGLSRGRATR ILRRATGAEA MDLLATRDTR DVYKALLDLA EEHAVTEHAA DRIRVLTPLP DRAAMEERLS SVLDARDAWA STDEATRDDV LAAFDRYDSV AGGELSAVRT ALALQETGVD DGVFEPIATL DGDRLGDAAR ALAGLQGEGD RVGDGADDQL DGLRERLGQV EDLAVTSAAV VEELQSEARR PDEFRDALAR YLTSETGVDA ARVRDAVPRE AADAQDFVES ALRSLAGDLR SAVDEREATV AATLEEDLAA ARETVDAAVE AVDAAALYVS LARFALAYEL GRPTFVADRE TIAVRGARNL ALQDAGDDVQ PVTYAVGDHD LPAGREPPTG DRVAVLTGAN SGGKTTLLET LCQVQLLAQM GLPVPAEDAE VGLVDAIVFH RRHASFNAGV LESTLRSVVP PLTDAGRTLM LVDEFEAITE PGSAANLLHG LVTLTVERDA LGVFVTHLAD DLEPLPEAAR TDGIFAEGLS PDLDLEVDYQ PRFETVGKST PEFIVSRLVA NAADPVERTG FETLAQAVGE EAVQRTLSDA RWSEGDGDD
|
| |