Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0493 |
Symbol | |
ID | 8382760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 495025 |
End bp | 497772 |
Gene Length | 2748 bp |
Protein Length | 915 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644971555 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003129413 |
Protein GI | 257051580 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.467736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAG CGACGGGGAT CGTCGGCGAG TTCTTCTCGC TGAAGGACGG GGCTGACGCC GATCTACTGG CGATGCAGGT CGGGGACTTC TACGAGTTCT TCGGTGCGGA CGCAGAGACA GTGGCCGACG AACTCGACCT CCAGGTCTCA CAGAAGTCCA GCCACGGCTC GTCGTATCCG ATGGCCGGCG TGCCCGTCAA CGAACTCACA CCGTACCTGA CGGCACTGGT CGAACGCGGA TACCACGTGG CAGTCGCGGA CCAACACGAG ACGGACGACG GCCACGCCCG CAAGGTCGAG AAGATCGTGA CGCCAGGGAC GTTGCTGTCC ACGACCGACG CAGGGGCCCG GTATCTCGCG GCGATCGTCG AGGGCGAGGC GTGGGGACTG GCGTTCGCCG ACGTGACGAC CGGCGAGTTC TTCGTGACGC AGGTCGCCGA CCGCGACGCC GTCTTCAGCG AACTCTACCG CTTCGACCCG GCCGAGGTGC TGCCCGGGCC GACTGTTCGG GCCGACGACG AAATGATCGA ACGACTCCGG GAACGGACGG ACGCGAGCGT CTCGCTGCAC GCGACCGAGG CGTTCGCGCC CGGCCGCGCC CGTCACCGAC TCCGCGAGCA GTTCGGGACC GAGACCATCG AGAGCGTTGG GATCGGGGAC GCCGAATCGG CCATCGCGGC CGGCGGGGCT GTCCTCAGCT ACGTCGAGGA GACTGGCCAG GGCGTGCTCG CGTCGATGAC TCGCCTCCAG CGCTACGGCG CGAGCGACCA CGTCGAACTC GACGCGACGA CCCAGCGCAA CCTCGAACTC ACGGAGACGA TGCGGGGGGA GCGTACCGGG TCGCTCCTGG ATACGATCGA CCACACCGTC ACGAGCGCGG GGACCCGGAC GTTGCGCGCG TGGCTCCAGC GCCCTCGACG ATCGCGGGAA ACGCTGGACC GGCGGGGGGA CAGCGTCGAA GCGCTCGCGA CTGAAGCGAT GGCCCGCGAA CGCCTCCGTG ACGTGCTCGG GGACGCCTAC GACCTCGAAC GGCTCGCCAG TAAGGCTGCC TCGGGGAGCG CCGACGCCCG CGACCTCCGG GCCGCGGTCG ACACACTGGA GTTGTTCGAG ACAGTTCGGG CCATCGTTCG CGAGACACCG ACGCTGGCCG AATCGCCCCT CTCGACGTGG CTCGACGAAC CCGATCCCGC CGCCGTCGCG ACTCTCGCGG CCGACCTCGA CGCGGCGATC GTCGATGATC CGCCGGGGAC GATCACCGAA GGCGGGATCA TTCGCGAAGG ATACGACGCG GAACTCGACG AGGTTATCGA CGAGCACGAA ACAGCACTTG AATGGATCGA GACGCTGCCC GAGCGCGAGC AGCGCGAGCA CGGCATCACC CACCTCTCGG TCGACCGGAA CAAGACGGAC GGCTACTACA TCCAGGTCGG CAAAAGCGAG ACCGGGAAAG TCCCCGACCA CTACGAGAAC GTCAAGACGC TGAAAAACTC CGAGCGCTAC ACGATCGCGG AGTTGACCGA ACGCGAACGG AAGATCTTCC GGCTCGAAGA GCGTCGCCAC GACCTCGAAC GCCAGTGTTT CGAGGAGTTG CGCGAAGCGG TCGCCGACCA CGCCGATCTC CTCCAGGGCG TCGGCCAGGC CCTCGCGGCG GTCGACGTGA TGGCGGCACT CGCCACCCAC GCCGTCCGCA ACGACTGGAC CCGCCCGACG TTGCGCGATT CGCGGGCGCT CGACGTCGAG GCCGGTCGCC ACCCGGTCGT CGAGCAGACC ACGGAGTTCG TGCCCAACGA CCTCCGGATG GACGACGACC GCCGGTTCCT GATCGTGACG GGGCCGAACA TGAGCGGGAA ATCGACGTAC ATGCGCCAGG CGGCGCTGAT CGTCCTGCTG GCACAGATCG GATCGTTCGT GCCGGCCAGG TCGGCGGAGG TTGGCCTCGT CGACGGGATA TACACTCGCG TCGGGGCCCT GGACGAACTC GCCCAGGGCC GCTCGACGTT CATGGTCGAA ATGCAGGAAC TCGCGAACAT CCTCCATTCG GCGACAGAGG ACTCGCTGGT CATCCTAGAC GAGGTCGGTC GCGGGACGGC GACCTACGAC GGCGTCTCGA TCGCGTGGGC GGCGACCGAA TATCTCTCTT CGGCCCAGTC GGCCTCGCCG TCACCGAAAA CGCTCTTTGC GACACATTAC CACGAACTCA CCACGCTTGC CGATCACATC TCGGGCGTCG AAAACGTCCA CGTCGCCGTC GACGAGCCCG CAGGCGGCAC AGACGGCGAC GATGCCGGCC CGCCCACCAC CGACGGGGCC GACGACGATG TCACGTTCCT CCGGACTGTC CGGGACGGCC CGGCCGACCG CTCCTACGGC GTCCACGTCG CCACCCTCGC AGGCGTCCCG GATCCCGTCG TCTCCCGGGC GCGGGAGGTA CTCCGGAAAC TCCGGGCCGA CGAGGCCATC GACGTCCAGA ACGGTCGAGC GAGCGAGGAG ACCCGGCAGG TCGTCTTCGA CCTTGATTCG GGGGAACTGC GGGAATCGGA CACGGACGAC AGTGGGACAG GAGACCCACC GGCGGATCAG ATGGGCGACA CCGACGAGAC GGTGGGTGGG GGAACGGGTC CGATCGTCGA CCAGTTCGGC GAGGACGCTC CGGCGGTCCT CGAAGCCCTC GAATCCCTCG ACATCGAGGA GACCCCGCCC GTCGAACTGC TGGGAGAGGT GCAGGAATGG CAGCGGCGAT TGGAATGA
|
Protein sequence | MTEATGIVGE FFSLKDGADA DLLAMQVGDF YEFFGADAET VADELDLQVS QKSSHGSSYP MAGVPVNELT PYLTALVERG YHVAVADQHE TDDGHARKVE KIVTPGTLLS TTDAGARYLA AIVEGEAWGL AFADVTTGEF FVTQVADRDA VFSELYRFDP AEVLPGPTVR ADDEMIERLR ERTDASVSLH ATEAFAPGRA RHRLREQFGT ETIESVGIGD AESAIAAGGA VLSYVEETGQ GVLASMTRLQ RYGASDHVEL DATTQRNLEL TETMRGERTG SLLDTIDHTV TSAGTRTLRA WLQRPRRSRE TLDRRGDSVE ALATEAMARE RLRDVLGDAY DLERLASKAA SGSADARDLR AAVDTLELFE TVRAIVRETP TLAESPLSTW LDEPDPAAVA TLAADLDAAI VDDPPGTITE GGIIREGYDA ELDEVIDEHE TALEWIETLP EREQREHGIT HLSVDRNKTD GYYIQVGKSE TGKVPDHYEN VKTLKNSERY TIAELTERER KIFRLEERRH DLERQCFEEL REAVADHADL LQGVGQALAA VDVMAALATH AVRNDWTRPT LRDSRALDVE AGRHPVVEQT TEFVPNDLRM DDDRRFLIVT GPNMSGKSTY MRQAALIVLL AQIGSFVPAR SAEVGLVDGI YTRVGALDEL AQGRSTFMVE MQELANILHS ATEDSLVILD EVGRGTATYD GVSIAWAATE YLSSAQSASP SPKTLFATHY HELTTLADHI SGVENVHVAV DEPAGGTDGD DAGPPTTDGA DDDVTFLRTV RDGPADRSYG VHVATLAGVP DPVVSRAREV LRKLRADEAI DVQNGRASEE TRQVVFDLDS GELRESDTDD SGTGDPPADQ MGDTDETVGG GTGPIVDQFG EDAPAVLEAL ESLDIEETPP VELLGEVQEW QRRLE
|
| |