Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0551 |
Symbol | |
ID | 8382818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 557406 |
End bp | 559157 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644971613 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003129471 |
Protein GI | 257051638 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.27881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGTAG AGGACTACTG GGGCGTCGGG CCGAAGACCC GTGATCTCCT CGCCGAGTCG CTGGGCATCG AAACTGCCAT CGCCGCGATC GAATCCGGCG ATCTTCGGGC GTTGACCGAG GCCGGATTGA GTCGCGGCCG AGCGACACGG ATCCTTCGCC GGGCCCAGGG CGGGGCGATG GACGTGCTGG CGACCCGCGA CGCCCGATCC GTGTACAAGT CGGTGCTGGA CCTGGCGAGT GATTACGCCG TGACCCGCCA CGCCGCCGAC AGCATCCGAC TCCTCACGCC GCTCGATTCC CGGTCCGCCA TGGAATCACG CCTCGAAACC GTCATGGACG CCGTGGCCGT CTGGAACGCA CTCGACGAGT CGACTCGGGA GGGCGTCATC GAGGCGTTCG ACGCCTACGA CAGCGTCGAA GGCGGTGATC TCGCCGGCGT CCGGACCGCA CTGGCATTGC GGGAGACCGG TGTCACCGAC GGTGTGTTCG CGCCGCTCGC AGACCTCGAG GTCGAACAGC TCGACGCCGC GGCCGACGCG CTCGCCGCGC TCTCGGCGGA CGGCGTCGCG GCCGGGGCTG ACGACCGTCT CGACGATCTG AGGACGCAGC TCGGGGCGAT CGAGGATATG GCCGCCGACG CCGAGTCGGT GATCGCCACG ATCCGCGACG CTGGCGTCCG GGGTGGCGAC GAGTTCCGCG AGCGATTCGT CGACCACGTC GTCAGCGAAG CCGGCGTCGA TGTCGGGGCC GTCAGGGAGG CGATGGTCAC CGACGCCCCG GACGTGACCG ACTTCGTTTC GGAGACGCTT CGCGGCCTCG CCGCGGATCG ACGTGACGCC GTCGAGGAAC GCGAGGCGGA CGTCCGCGAG CGCCTCGAAA CGAGCCTGGA ACTCGCCCGC GAGGACGTCG ACGCCGCTGT CGACGTCGTC GACGAGATTG CCCGAGACGT CTCGCTGGCC CGGTTCGCTC GTGCGTTCGA TCTCACCGCG CCGACCTATC GTGAGGGACG GGTGCTAGCC GTCGAGAACG CTCGCAACCT CGAACTGATG GGTGGAGATG TCGCCGTCCA GCCGGTCACC TACGGGATCG GCGACCACTC GCTGTCGGTG GCCGGCGCGA ACGAACCGCC ACGGGGCGAC CGCGTCGCCG TCCTGACAGG AGCCAACAGC GGCGGGAAGA CGACGCTGCT GGAGACGCTC GCGCAGGTGC AGTTGCTCGC CCAGATGGGG CTGCCCGTGC CAGCGGATGC CGCCGAAGTG GGGGTCGTCG ACGCCGTGGT CTTCCACCGC CGACACGCGA GTTTCAACGC GGGCGTGCTC GAATCGACAC TCCGGACAGT CGTCCCGCCC CTGACTGACG AGGGACGAAC CCTGATGCTT GTCGACGAGT TCGAGGCGAT CACCGAACCC GGCAGCGCCG CCGATCTCCT TCACGGCCTG GTCACGCTGA CGGTCGACCA GCCGGCGCTT GGCGTGTTCG TCACCCACCT GGCTGACGAC CTGGAACCGC TCCCATCGGC AGCCCGAACT GACGGCATCT TCGCCGAAGG GTTGACGACG GATCTCGAAC TCGAAGTCGA CTATCAGCCC CGGTTTGGCA CGGTCGGGCG CTCGACACCG GAGTTCATCG TCTCGCGACT CGTGGCCGAC GCCGACGATC GACGCGAACG CGGCGGGTTC CAGACGCTTG CCGCGGCCGT CGGCGAGCAA GCCGTCCAGC GGACACTGTC GGACGCCGAG TGGTCCGGTT GA
|
Protein sequence | MDVEDYWGVG PKTRDLLAES LGIETAIAAI ESGDLRALTE AGLSRGRATR ILRRAQGGAM DVLATRDARS VYKSVLDLAS DYAVTRHAAD SIRLLTPLDS RSAMESRLET VMDAVAVWNA LDESTREGVI EAFDAYDSVE GGDLAGVRTA LALRETGVTD GVFAPLADLE VEQLDAAADA LAALSADGVA AGADDRLDDL RTQLGAIEDM AADAESVIAT IRDAGVRGGD EFRERFVDHV VSEAGVDVGA VREAMVTDAP DVTDFVSETL RGLAADRRDA VEEREADVRE RLETSLELAR EDVDAAVDVV DEIARDVSLA RFARAFDLTA PTYREGRVLA VENARNLELM GGDVAVQPVT YGIGDHSLSV AGANEPPRGD RVAVLTGANS GGKTTLLETL AQVQLLAQMG LPVPADAAEV GVVDAVVFHR RHASFNAGVL ESTLRTVVPP LTDEGRTLML VDEFEAITEP GSAADLLHGL VTLTVDQPAL GVFVTHLADD LEPLPSAART DGIFAEGLTT DLELEVDYQP RFGTVGRSTP EFIVSRLVAD ADDRRERGGF QTLAAAVGEQ AVQRTLSDAE WSG
|
| |