Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1024 |
Symbol | |
ID | 8383297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 989124 |
End bp | 991118 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644972088 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003129940 |
Protein GI | 257052107 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.318391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCTGA CGGCGATCCC CGGCGTCGGC GAGAAGACCG CGGCGTCGCT TGCCGAACTC GACGATCCAG CGGCGGCGAT CGAGAACGGC GACGTCGCGG CCGTCGCTCG GGCCCCCGGC ATCAGTCAGG GTCGTGCCGC CCGGATCGTC CGCGCGGCGA TCCGCGAGCG CCACGGCGAC GCGGGCGAGT TCCTGGCGAC GCCGCGCGCC CGGGAAGTGT ACCGGGACGT TCTGGAACTG CTCGAAGCAC GCACCGTCAC CGACTACGCC GCCGCCCGCC TGGAGACGCT GTACCCCAGC GCCAGCGATT CTCGGATCGC CGAAGTCCGC CAACTCAGCG AGCGGGCGAT CGAACGCGAT CCCGACGAGA CCGTCCTCGA GGCGCTCGAA GACGTCGAAC CGCTGGAGCG GCCTGGCGAC GTTCGTGTTC GGGACCGGTG TCTCGCGACG ACCGACGGCG AGACCTACGC CAGCGCCCGT GAGGCGATCC CCGAGATGAG CGTCGAAGTC GTCGAGGACA GCCGCGATCT GGCGGAACTC GCCCGCGGAT ACGCGACGGT CGTCGCACTG GACGATTCCT TCGCCGGCGT CGACGTCGAG GGTGACGTCC GCGTCGAACC CGACGCCCTC GAGGATCCGG CCTCGGTCGT GCCCGAGCGC CCGATTGCCT TTTTCGCGCA CAACCGCGAC CGCATCCTCG CGGCGATTTC CGTCCACCGG GCAGCCGACT TCGATCCGCC GTGCGACCTC GACGCGCTCG AAGCCGCGCT GGACCGACTC GACGCGGAGG GGACGCCGAC CGGTGACGAC GAACTAACGC GATTGCAGGC TGCCGTCAAC GACCTCGACG CCGCCGTGAG CGAGGCCGAA TCGGTCGCCA ACGACCGCCT CCGGGAAGCG ATCGAAGCGC AGGATGTCAC CATCGAGGGG GCGGACCTGC TCTCGCTCGT CGAGCGGGGC GCTGGCGTCG ACGAGGTGCT CTCCCGGGAA CTGGCCGACG AGTACGACGA CGCCGTCGAA GCGGCCCGCG AGCACGTGAT CGACACACTC GACCTGCGGG ACGTAGCTGA CATCACGAAG CGGGCGTTCC CGGACGAACC CACGTTTCCC GTCGAGCGCG AGGAGAGCGT CGTCTCCCGA CTCCGGGAGG AGCTCACGAC GGCCCGGGAC CGCCGGGCCG AACGGCTCAA AACCGAGCTG GCCGACGAAC TGGCGTCGAT GCGAGAGCCG GCCGAGGATC TCGTCGATAC CGCGCTCGAA CTGGACGTCG AACTCGCCAT CGCCCGCTTG GCTGCCGATT TCGACGCGAC GATGCCGGCA CTCGACGGCG ACGGGATCAC GATCGAGGGG GGTCGGTCGC CGCTGCTCGA CGTGGACTTC GTTGACGTCG AACCAGTCGA CTATGAGGTC AGCGGTGTTC GCCTCCTCTC GGGGGTGAAC AGCGGCGGGA AGACCTCGAC GCTGGACCTG CTCGCGCTGA TTGTCATCCT CGCGCACATG GGGCTACCGG TGCCCGCAGA CCGGGCCCGA GTCGGGCGGA TCGACGCGCT GCACTACCAC GCCAAGACCC AGGGCACGCT GGACGCGGGG GCCTTCGAGA GCACGCTCCG ATCGTTCGGC GAGTTGGTTA CTGACGCCGC GAACGAGGGT GAGACACTCG TGCTGGTCGA CGAGCTGGAG AGCATCACCG AACCCGGCGC GAGCGCGAAG ATCATGGCCG GGATTCTGGA GGCGCTGGCC GAACGCGACC AGACGGCCGT GTTCGTCTCC CACCTCGCCC GGGAGATCCG CGAGACGGCC GATCAGGACA TCGGTGTCGA CGGCATCCAG GCACTCGGCC TCGAAGACGG CGAGTTACAG GTCGACCGGA CGCCCCGGAA GGACACGCTG GCGCGCTCGA CGCCCGAGTT GATCGTCGAA AAACTCGCCG ACGGCGACGA TCGCGAGGAC GGCGAGGGGA ACTTCTACGG ACGATTGCTC GAGAAGTTCG AGTAG
|
Protein sequence | MDLTAIPGVG EKTAASLAEL DDPAAAIENG DVAAVARAPG ISQGRAARIV RAAIRERHGD AGEFLATPRA REVYRDVLEL LEARTVTDYA AARLETLYPS ASDSRIAEVR QLSERAIERD PDETVLEALE DVEPLERPGD VRVRDRCLAT TDGETYASAR EAIPEMSVEV VEDSRDLAEL ARGYATVVAL DDSFAGVDVE GDVRVEPDAL EDPASVVPER PIAFFAHNRD RILAAISVHR AADFDPPCDL DALEAALDRL DAEGTPTGDD ELTRLQAAVN DLDAAVSEAE SVANDRLREA IEAQDVTIEG ADLLSLVERG AGVDEVLSRE LADEYDDAVE AAREHVIDTL DLRDVADITK RAFPDEPTFP VEREESVVSR LREELTTARD RRAERLKTEL ADELASMREP AEDLVDTALE LDVELAIARL AADFDATMPA LDGDGITIEG GRSPLLDVDF VDVEPVDYEV SGVRLLSGVN SGGKTSTLDL LALIVILAHM GLPVPADRAR VGRIDALHYH AKTQGTLDAG AFESTLRSFG ELVTDAANEG ETLVLVDELE SITEPGASAK IMAGILEALA ERDQTAVFVS HLAREIRETA DQDIGVDGIQ ALGLEDGELQ VDRTPRKDTL ARSTPELIVE KLADGDDRED GEGNFYGRLL EKFE
|
| |