Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0119 |
Symbol | |
ID | 7401640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 123768 |
End bp | 126563 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643707183 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002564795 |
Protein GI | 222478558 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.893903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACGG GGATCGTCGG GGAGTTCCTC GACCTCAAGG CCGAGACCGA CGCGGACATC CTCGCCATGC AGTGCGGCGA CTTCTACGAG TTCTTCGCGG ACGACGCCGA GCTGGTCGCC GACGAGCTGG ACCTGACCGT CTCACAGAAG TCCTCGCACG GCTCGTCGTA CCCGATGGCG GGCGTGCCGC TCTCGGAGCT GACCCCGTAC GTGAAGGCGC TCGTCGAGCG CGGCTACCGG GTCGCCGTCG CCGACCAGTA CGAGACCGAG GACGGCCACG CCCGGGAGAT TACCCGCGTC GTCACGCCTG GGACGCTCCT CGAAACCGCC GACGACGACG CGCGGTACCT CGCGGCGATC GTCCGCGAGG GCGACGACGC GGACGGCCCC TACGGGCTCG CGCTCGCCGA CGTGACCACG GGCCGGTTCC TCGTCACCGA AGTCGACGAC GAGGGCGACC TCCGCGCGGA GCTGTACCGC TTCGACCCCG CCGAGGTGCT CCCGGGACCG CGCGTCCGCA ACGACGACCG ACTGCTCGGA GCGGTCCGAG AGGACCTCTC GGGGTCGGTT TCCGTCTTCG ACGCGGAGGC GTTCGCGCCG GGACGCGCGA AACACGCGGT CCGCGAGCAG TTCGGGCGGG AGACCGCCGA CAGCGTCGGC ATCGACTCCG AACTGGCGCT GCGCGCGGCG GGAGCCGTCC TCGGCTACGT CGAGGAAACC GGCGCCGGCG TGTTGGCATC GATCACCCGT CTCACGGCCT ACGGCGACGG CGACCACGTC GCCGTCGACG CCACGACCCA ACGCAACCTC GAACTCACCG AGACGATGCG CGGCGACGCC GACGGCTCGC TGTTCGAGAC GGTCGATCAC ACCGTCACCG CCGCCGGCGG CCGCCTCCTC CGAGAGTGGA TCACCCGCCC GCGCCGGGAC CGCGAGGAAC TGAACCGCCG GCTCGACGCG GTGGAGGCGC TCGCGTCGGC AGCGCTCGCG CGCGACCGCC TGCGAGAGAC GCTCGGCGAC GCGTACGATC TCGAACGGCT CGCGGCGCGG GCGACTAGCG GGAGCGCGGG CGCGCGGGAA CTCCTTTCGG TGCGGGACTC GCTGGCGCTG GTGCCCGCGC TCGCCGACGC CGTGTCCGGG ACCGCGCTCG CGGACTCCCC GGTCGCAGCG GTGCTGGAGC GAATCGACCG CGAGCGCGCC GCGACCCTCC ACGACGAACT CGCGGACGCG CTCGCGGAGG ACCCGCCGAA GACGAAGACG CAGGGCGGGC TACTCAGGGT GGGATACGAC GGCGAGCTCG ACGAGCTGAT CGCGCGCCAC GAGAAGGCGA ACGAGTGGCT CGATAGGCTC GCAGAGCGCG AAAAGCGGCA GTACGGGTTG AGTCACGTCA CCGTCGACCG CAACAAGACG GACGGTTACT ACATCCAGGT CGGCAAATCC GCGGCCGACG GGGTTCCCGA GCACTACCGC GAGATCAAGA CGCTGAAGAA CTCGAAGCGG TTCGTCACCG ACGAACTGGA AGAGCGGGAA CGCGAGGTGC TCCGGTTGGA GGAGGCCCGG GGCGAGCTGG AGTACGAGCT GTTCGAGGAG CTCCGAGAGC GGGTCGCCGC CGACGCCGAA CTCTTACAGG ACGTGGGGCG AGCGGTCGCC GAGATCGACG CGCTCGCGTC GCTGGCGACC CACGCCGCCG GCAACGACTG GACGCGACCC GAACTCGCCG ACGAGCGCCG GCTCGACGTC GAGGCCGGGC GCCACCCGGT CGTCGAGCGG ACGACCGATT TCGTGCCGAA CGATCTCCGG CTCGACGGGG AGCGCGGCTT CCTCATCGTC ACCGGGCCGA ACATGAGCGG GAAATCGACG TATATGCGGC AGGCGGCGCT GATCCAGCTG CTCGCGCAGG CGGGGTCGTT CGTCCCCGCG CGGACGGCGA CGGTCGGGCT CGTCGACGGA ATCTACACCC GCGTCGGCGC GCTCGACGAG CTGGCACAGG GGCGCTCCAC GTTCATGGTG GAGATGCAGG AGCTGTCGAA CATCCTCCAC TCGGCGACCG CCGACTCGAT CGTCATCCTC GACGAGGTCG GCCGCGGCAC CGCCACCTAC GACGGCATCT CCATCGCGTG GGCCGCGACC GAGTATTTAC ATAACGAGGT GCGCGCGCGG ACCCTCTTCG CCACGCACTA CCACGAGCTG ACGACGCTGG CAGACCACCT CCCGCGCGTG GAGAACGTCC ACGTCGCCGT CGACAAGCGC GACGGCGAGG TGACGTTCCT CCGGACCGTT CGCGACGGCC CGACAAATCG GTCGTACGGG GTCCACGTCG CCGACCTCGC GGGCGTTCCG GCTCCAGTCG TCTCCCGCGC CGGGACGGTG CTCGACCGGC TTCGCGAGGA GAAGGCGATC GAGGCGAAGG GCGGAGCGCG GGGCGGAGGG GGCGAACGCG GAGGCTTCAC CGGCACCGCC GACGGCGACA CGAAACAGGT CGTTTTCGAC CTCTCGTCCG GGTCGTTCTC TGAAAGCGAC GACGCGGAGT CGACCGCGGC CGGCGCCCCC GGCTCCGGAG GAGGTCGAAA CGGGGCGACT CCGGGGTCGG CATCGGACGG CGCCAGTGGG TCCGCGGGGA CCGCAGAGAC CGCAGGAGCC GCAGAGACCG CTGAAAGCGC GGAGACCGAC CGGTTCGATC CCGAGACCCG CGCCGTGATC GAGGAACTGG CCGACGTCGA TGTCGCGGAG ACCGCGCCGG TGGAGTTGCT GTCTCGGGTT CAAGAGTGGC AAGAGCGGCT CGACGAGAAC CGCTGA
|
Protein sequence | MPTGIVGEFL DLKAETDADI LAMQCGDFYE FFADDAELVA DELDLTVSQK SSHGSSYPMA GVPLSELTPY VKALVERGYR VAVADQYETE DGHAREITRV VTPGTLLETA DDDARYLAAI VREGDDADGP YGLALADVTT GRFLVTEVDD EGDLRAELYR FDPAEVLPGP RVRNDDRLLG AVREDLSGSV SVFDAEAFAP GRAKHAVREQ FGRETADSVG IDSELALRAA GAVLGYVEET GAGVLASITR LTAYGDGDHV AVDATTQRNL ELTETMRGDA DGSLFETVDH TVTAAGGRLL REWITRPRRD REELNRRLDA VEALASAALA RDRLRETLGD AYDLERLAAR ATSGSAGARE LLSVRDSLAL VPALADAVSG TALADSPVAA VLERIDRERA ATLHDELADA LAEDPPKTKT QGGLLRVGYD GELDELIARH EKANEWLDRL AEREKRQYGL SHVTVDRNKT DGYYIQVGKS AADGVPEHYR EIKTLKNSKR FVTDELEERE REVLRLEEAR GELEYELFEE LRERVAADAE LLQDVGRAVA EIDALASLAT HAAGNDWTRP ELADERRLDV EAGRHPVVER TTDFVPNDLR LDGERGFLIV TGPNMSGKST YMRQAALIQL LAQAGSFVPA RTATVGLVDG IYTRVGALDE LAQGRSTFMV EMQELSNILH SATADSIVIL DEVGRGTATY DGISIAWAAT EYLHNEVRAR TLFATHYHEL TTLADHLPRV ENVHVAVDKR DGEVTFLRTV RDGPTNRSYG VHVADLAGVP APVVSRAGTV LDRLREEKAI EAKGGARGGG GERGGFTGTA DGDTKQVVFD LSSGSFSESD DAESTAAGAP GSGGGRNGAT PGSASDGASG SAGTAETAGA AETAESAETD RFDPETRAVI EELADVDVAE TAPVELLSRV QEWQERLDEN R
|
| |