Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0358 |
Symbol | |
ID | 8409856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 350703 |
End bp | 353459 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645018683 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003176202 |
Protein GI | 257386429 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.613152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCGG CGCTGGGACC TCCCGAGAAG ATGGCCGAGC GGGCCGACGA GCTGACGCCG ATGATGCGCC AGTACTACGA GCTCTGTCGG GCCTACGACG ACTCGCTGGT CCTGTTTCAG GTCGGGGACT TCTACGAGGC CTTCTGCGGT GCCGCCGAGC GGGTCGCCCG GCTCTGTGAG ATCACGCTGA CCAAGCGCGA GGACTCCACT GGGCAGTACG CGATGGCCGG AGTCCCCATC GACAACGCCG AGAGCTACGT CGAGACGCTG CTCGACGCCG GCTACCGCGT CGCCATCGCG GACCAGGTCG AGGACCCCGA CGCGGTCAGC GGCGTCGTCG ACCGGGCGGT GACGCGGATC ATCACGCCGG GGACGCTGAC CGAGGACGAA CTGCTCGACT CGCCGGACAA CAACTACGTC GCCGCGCTGA CCGCCGACGG CCGCCGCTAC GGGCTGGCGC TGCTGGACGT GTCCACGGGC GATTTCTACG CGACCAGCGC CGACGCCGTC GACGCCGTCG CCGACGAGGT GGGTCGGTTC GCACCCGCGG AGGCGATCGT CGGCCCCGGC GTCGACGTGG ACGAGGATCG GGTCTTCGAG GCGGCGGCGA TGGTGACGCC GTACGACGAG TCGGTGTTCG CCTTCGAGGA CGCCGCCGAC CGAGTGCGGA CGTACTTCGG CGACCCGGAC GCGCTGCTGG CCGACGACCT GGAGGTGCGA GCCTGCGGCG CGCTGCTTGC GTACGCCGAG TACACCCGCG GGGGCGGTGC GGGCACGAGC GACGAGATCG CAGACGACGA CGGCGGGCAA CTGACGTATC TCACGCACCT CACGCGTTAC GACCCCCGCG AGTACATGCT GCTGGACGCC GTCGCGCTCG ACAGCCTCGA ACTGTTCGAG CGCCGGGCGG TGCGGGGCCA CGAGGGGCGC ACGCTGGTCG ACACCGTCGA CGAGACCGCC TGCGCGCTCG GCCGGCGACG GCTCGGCGAC TGGCTGCGTC GGCCGCTGCT CGACGCCGAC CGAATCGAAC GCCGCCACGA GGCGGTCGCC GAACTCGTCG AAGCGCTCCA GCGCCGCGAG CGACTCCACG CCCTGCTCGC GGACGTGTAC GACCTCGAAC GGCTGATCTC TCGCGTCTCG CGCGGGCGAG CCAACGCGCG GGACCTGCGC TCGCTGGCGG CCACGCTCGC AGTCGTCCCC GACGTGCGTG AGCAACTGGC CGACGCCGAC AGCGCCCTGC TCGCGGACCT TCACGAGGGG CTCGATCCGC TGACGGACGT GCGCGAGGAG ATCGAGGCGG CGATCTGTCC GGACCCGCCC CAGGAAGTCA CCGAGGGTGA CGTGATCCGC GAGGGGTACG ACGACGACCT CGACGCGCTG CGCGAGACCG AGCGGTCGGG CAAGCGATGG ATCGACGACC TGGAGATCAA CGAGCGCGAA CGCACCGGAA TCGACTCGCT GAAGGTCGGG CACAACTCCG TCCACGGCTA CTACATCGAG GTGACCGACC CCAACCTCGA CAGCGTCCCC GACGACTACG AGCGCCGCCA GACGCTGAAA AACTCCGAGC GCTTCGTCAC GCCCGAACTC AGGGAGCGCG AAGAGGAGAT CGTCCGGGCA GAGACGGCCG CCGACGACCT CGAATACGAC CTGTTCTGTG AGGTACGAGC GGCGATCGCG GCCGAGGCCG AGCGCGTGCA GGCGCTGGCC GACCGCCTGG CGACGCTCGA CGCGCTCGTG GCCTTCGGCG AGGTGGCGGC GACCCACGAC TATTGCCGAC CCAGCGTCGG CGGGGACGCC ATCGACGTGA CGGCCGGCCG CCACCCCGTT GTCGAGCGCG CCGAGGCGTC GTTCGTTCCC AACGACGCCT GCCTCACTCC GGACTCGTTT TTCACGATCC TCACTGGCCC CAACATGAGC GGGAAATCGA CGTACATGCG CCAGATCGCG CTCATCTGCG TGCTGGCACA GGCCGGGAGT TTCGTCCCCG CTCGCGAGGC GAACCTGCCG ATCGTCGACC GCGTGTTCAC CCGCGTCGGT GCGAGCGACG ACATCGCCGG CGGGCGCTCG ACGTTCATGA TCGAGATGAC CGAACTCGCA GACATCCTCC AGGGCGCGAC CAGCGACTCG CTGATCCTGC TGGACGAGGT CGGCCGGGGG ACCTCGACGG CCGACGGGCT CGCCATCGCC CGCGCCGTCA CCGAACACGT CCACGACGAA ATCGGGGCGT ACACGCTCTT TGCGACCCAC CACCACGAGC TGACGGCCGT CGCCGACGAA CTGCCCGGCG TCCGCAACCG CCACTTCGAG ACGCGCCACG ACGGCGACGG CGTCGTCTTC GAGCACAGCG TCGCCCCCGG CGCGGCCGCG GCGTCCTACG GGATCGAGGT CGCGGCCCTG GCCGGCGTGC CCGATTCGGT GGTCGAGCGT TCTCGGACGG TGTTGGCCAG CGAGGACGAG CGAAGCGAGT CCTCGGAAAC GAGAGCGGGA GCGGAGCGAC ACGCGGGCGG CGAGGACGAG CGCAGCGAGT CCTCGGACGC CGAAAACGGC GCGGTGGCCC AGGCGTCGGC TCCGGCGAGC GAGCCGCCGT CGGCGTCGGC CGACGGCCAC GCCGTCGTCG AGGCGGCGAC CGGTGACGAG AGCGGACCTG ACGCCGACCC GCTCCGCGAG CGGCTGGCAC AGCTGGACGT GGCGACGATG ACGCCCATCG AAGCGATGAA CGCGCTCGCC CGACTACAGG ACGACATCGC GGACTGA
|
Protein sequence | MDAALGPPEK MAERADELTP MMRQYYELCR AYDDSLVLFQ VGDFYEAFCG AAERVARLCE ITLTKREDST GQYAMAGVPI DNAESYVETL LDAGYRVAIA DQVEDPDAVS GVVDRAVTRI ITPGTLTEDE LLDSPDNNYV AALTADGRRY GLALLDVSTG DFYATSADAV DAVADEVGRF APAEAIVGPG VDVDEDRVFE AAAMVTPYDE SVFAFEDAAD RVRTYFGDPD ALLADDLEVR ACGALLAYAE YTRGGGAGTS DEIADDDGGQ LTYLTHLTRY DPREYMLLDA VALDSLELFE RRAVRGHEGR TLVDTVDETA CALGRRRLGD WLRRPLLDAD RIERRHEAVA ELVEALQRRE RLHALLADVY DLERLISRVS RGRANARDLR SLAATLAVVP DVREQLADAD SALLADLHEG LDPLTDVREE IEAAICPDPP QEVTEGDVIR EGYDDDLDAL RETERSGKRW IDDLEINERE RTGIDSLKVG HNSVHGYYIE VTDPNLDSVP DDYERRQTLK NSERFVTPEL REREEEIVRA ETAADDLEYD LFCEVRAAIA AEAERVQALA DRLATLDALV AFGEVAATHD YCRPSVGGDA IDVTAGRHPV VERAEASFVP NDACLTPDSF FTILTGPNMS GKSTYMRQIA LICVLAQAGS FVPAREANLP IVDRVFTRVG ASDDIAGGRS TFMIEMTELA DILQGATSDS LILLDEVGRG TSTADGLAIA RAVTEHVHDE IGAYTLFATH HHELTAVADE LPGVRNRHFE TRHDGDGVVF EHSVAPGAAA ASYGIEVAAL AGVPDSVVER SRTVLASEDE RSESSETRAG AERHAGGEDE RSESSDAENG AVAQASAPAS EPPSASADGH AVVEAATGDE SGPDADPLRE RLAQLDVATM TPIEAMNALA RLQDDIAD
|
| |