Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0458 |
Symbol | |
ID | 8382725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 452888 |
End bp | 455485 |
Gene Length | 2598 bp |
Protein Length | 865 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644971520 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003129378 |
Protein GI | 257051545 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.143165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCGG CCCTGGGACC GCCGGCGAAG ATGACCGACC GGCGCGAGGA TCTCACGCCG ATGTTGCGCC AGTACGTCGA GTTGACCGAG CGCTACGACG ACGCCCTGGT GCTCTTTCAG TCGGGTGACT TCTACAAGGG ATTTTGCGAG GCTGCCGAGG TGCTTGCACG GATCTGTGAG GTGACGCTGA CCGAACGCGA GGATTCGACC GGCACCTACG CGATGACGGG TGTGCCGATC GACAACGCCG AATCCTATAT CGAGAAGTTA CTGGACGCGG GCTACCGCGT GGCGATTGCC GACCAGGTCG AGGACCCCGA CGAGGTCAGT GGCGTCGTCG AGCGCGCGGT CACGCGGATC ATCACGCCAG GGACACTCAC CGAGGACGAA CTGCTCGGTG GTGCCGAGAA CAACTACGTC GCCGCGCTGG CCGCTGACGA TGGCCGATTT GGTGTGGCCG TCCTCGACGT TTCGACGGGG GACTTCTACG CGACCAGCAC CGACGACCGG GAGACGGTCA GGGACGAACT CGGTCGATTC TCGCCCGCTG AGGGAATCCT CGGCCCCGAC GTCCCGAACC TCTTCGATGG GGCGTGTACC GTCAGTCCCG TCGAGGGGAC GTACTTCGCC ACTGACCGGG CAGCCGAGCG CGTCGGCGAG TACTTCGGGA CGCCTGATCG GCTCCTCGCG ACCGATGCCG AGGTCCGGGC CTGTGGCGCG TTGCTGGCCT ACGCCGAGTA CGCCCGCGGG GGTGAAGCCG GTCGGCTGGA CTACCTCAAC CACCTCACAC GATACGATCC GCGGGCGTAC ATGGTACTCG ACGCGGTGGC GCTCGAAAGC CTAGAGATCT TCGAACGCCG GAGTGTCACC GGCGGCGCGG ACCTGACGCT CGTGGACGTG ATCGACGAGA CGGCTTCTGC GCTCGGCCGT CGTCGGCTGA CCGAATGGCT TCGCCGTCCG CTCATCGACC GCGACCGGAT CGAGGCGAGA CACGCGGCAG TGGATGCGCT TGTTTCGGAG CTCCAAACCC GCGAACGGCT CCACGAGCTA CTATCCGACG TCTACGACCT CGAGCGACTC ATCTCACGCG TTTCCCGCTC GCGAGCCGAC GCCCGTGACC TCCGCTCGTT GAAAGACACA CTCGACGTGA TCCCGGAGAT CAAAGCGGCG TTGGACGGCA TCGATGCCCC GCTGTTGACC GACCTTCGAG ACCGTCTCGA CGAGATGGAC GACGTCCGCG GGTTGATCGA CGACGCGATC GCGGCGGACC CACCGACCGA AATCACCGAG GGTGGGATCA TCAGTGAGGG CTACGACGAC CGACTCGACG AGTTGCGCGC GACCGAACGA GAGGGCAAGG AGTGGATCAC CGACTTGGAG GAAAGCGAAC GCGAGCGGAC CGGCATCGAC TCGTTGAAGG TCGGCCACAA CGCCGTCCAC GGCTACTACA TCGAAGTGAC CGACGCGAAC GTCGATCGGG TTCCCGAGGA CTACCAGCGG CGACAGACGC TGAAGAACGC CGAGCGTTAC TACACGCCCG AACTCAAGGA GCGGGAAGAC GAGATCCTTC GGGCGGAAGG GCAGGCCGAC GATCTGGAGT ACGAGTTGTT CGTGGAAGTG CGTGACGACG TCGCCGCCGA GTCCGAGCGC GTCCAGGCAG TCGCCGACGC TGTGGCAAAT CTCGACGTGC TGGTTGGCTT TGCCACCGTC GCGGCCGAGC GGGATTATTG CCGCCCCTCG GTCGGTGGGG ACGGAATCGA CATCGAGGGT GGTCGCCACC CGGTCGTCGA GCGCACCGAG GACGCGTTCG TTCCGAACGA CACCCATCTC GACGACGACG CCTGTCTCGC GGTGATCACT GGGCCGAACA TGAGCGGGAA GTCGACCTAC ATGCGCCAGG TCGCGCTGAT CTCGATTCTC GCCCAGGTCG GGAGCTTCGT GCCCGCCGAA TCGGCGGACC TGCGGATCGT CGACCGCGTG TTCACCCGCG TCGGCGCGAG TGACGACATC GCCGGCGGGC GCTCGACGTT CATGGTCGAG ATGAGCGAAC TCGCGACGAT CCTGGAAGGA GCGACCGCGA ACTCACTCGT CTTGCTCGAC GAGGTGGGCC GCGGGACCAG CACGACCGAC GGCCTGGCGA TCGCCCAGGC AGTGACGGAG TTCATCCACG ACGAGGTCGG CGCGACGACG CTGTTTGCGA CCCACCACCA CGAACTGACC GAGGTCGCGG CCGATCTCAA CGGCGCAGTG AACCGACACT TCCGGACCGA ACAGGCGGGC GAAGAGGTGT CGTTCCCCTA CGATATCGCC ACCGGGCCCG CCGCGGCATC CTACGGTGTC GAGGTGGCCG GCGTGGCCGG CGTGCCGGAC ACGGTCGTCG GTCGCTCGCG AGAACTGCTC GGTGACAGTA CCCCCGACGG ACGGGAACCG GGTCAAGAGC CCGATCGAAC CGCGACGGAA CGTGGAAGTG AAACGCCCGA GGAACCCGAT CGAGACGATG TCGTCGCCGA ACTCCGATCT CTTTCCGTCG CTGAGATGAC GCCCATTCAG GCGTTGAACA CGCTGGCCGA CTTACAGCGT CGGGCCGATC GAGAGTAG
|
Protein sequence | MDAALGPPAK MTDRREDLTP MLRQYVELTE RYDDALVLFQ SGDFYKGFCE AAEVLARICE VTLTEREDST GTYAMTGVPI DNAESYIEKL LDAGYRVAIA DQVEDPDEVS GVVERAVTRI ITPGTLTEDE LLGGAENNYV AALAADDGRF GVAVLDVSTG DFYATSTDDR ETVRDELGRF SPAEGILGPD VPNLFDGACT VSPVEGTYFA TDRAAERVGE YFGTPDRLLA TDAEVRACGA LLAYAEYARG GEAGRLDYLN HLTRYDPRAY MVLDAVALES LEIFERRSVT GGADLTLVDV IDETASALGR RRLTEWLRRP LIDRDRIEAR HAAVDALVSE LQTRERLHEL LSDVYDLERL ISRVSRSRAD ARDLRSLKDT LDVIPEIKAA LDGIDAPLLT DLRDRLDEMD DVRGLIDDAI AADPPTEITE GGIISEGYDD RLDELRATER EGKEWITDLE ESERERTGID SLKVGHNAVH GYYIEVTDAN VDRVPEDYQR RQTLKNAERY YTPELKERED EILRAEGQAD DLEYELFVEV RDDVAAESER VQAVADAVAN LDVLVGFATV AAERDYCRPS VGGDGIDIEG GRHPVVERTE DAFVPNDTHL DDDACLAVIT GPNMSGKSTY MRQVALISIL AQVGSFVPAE SADLRIVDRV FTRVGASDDI AGGRSTFMVE MSELATILEG ATANSLVLLD EVGRGTSTTD GLAIAQAVTE FIHDEVGATT LFATHHHELT EVAADLNGAV NRHFRTEQAG EEVSFPYDIA TGPAAASYGV EVAGVAGVPD TVVGRSRELL GDSTPDGREP GQEPDRTATE RGSETPEEPD RDDVVAELRS LSVAEMTPIQ ALNTLADLQR RADRE
|
| |