Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2226 |
Symbol | |
ID | 7399934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2208885 |
End bp | 2211755 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643709298 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002566873 |
Protein GI | 222480636 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.137389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.561287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACTG CGGACCACGA GACGGTGACC GGGCTCCCGC CGGGGATCGC CGCCGCCCGC GAGGAGCTGA CACCGATGCT CTCGCAGTAC GCCGACCTGT GTGCAGCCCA CGAGGACGCC CTCGTGTTGT TCCAGGTCGG CGACTTCTAC GAGGCGTTCT GCGAGGCGGC CGAGGCGGTC GCCGGCGTCT GCGAGGTGAC GCTGACGGAG CGCTCCGACT CCACCGGCGA CTACCCGATG GCCGGGATCC CCATCGACAA CGCCGCGCCC TACCTCGGAT CGCTCCTCGA TGCCGGCTAC CGCGTCGCCC TCGGCGATCA GGTCGAAGAC GCCGAGCAGG CGTCCGGGCT CGTCGACCGC GCCGTCACCG AGGTGATCAC TCCCGGGACC GTCGTCGAAG ACGAGCTGCT GGAGGCGGGG ACGACGAACT ACGTGGCGGC GGTGGCGGGC GGAGGCGAGG ACGGCGAGAG CGACGACGCC CCCGAGCCGG CCCCCGTCGG CCTCGCCGCC GTCGACGTCT CGACCGGCGA GTGTCTCGTC ACCGCGGGCG ACCGCGACAC GGTCGCGGAG GAGCTCGACC GGATCGCACC CGCCGAGCTG ATCGCGGGAC CGGACACCCC CGACTTCAAG CCGGCCGACG CCGAGCGCGG CTGGACGGTC CACGAGTACG ATTCCGGCGT TTTCGAGCGG CGAGCCGCGA TCGAACGACT GGAGCCGTAC CTCCCCGCGC CCGACCGCCG GTTTGACGGC GACGCCGAGC TACGTGCGGC TGGCGCCGTG CTCGCGTACG CCGAGTACAC GCAGGGCGAC GACGGCCCGC TCTCGTACGT TACCCGGATC CGGCGGTACG ACCCGCGCGA CCGGCTCCGG CTCGACGCGG CGGCCCAGCG CAGCCTCGAA CTGTTCGAGA ACCGCGGACT GGGCGCGAGC GACACCCTGT TCGACGCGCT CGACGAGACG AGGTGCGCGC TCGGCCGGCG GTGTTTAGAA CGGTGGCTCC GTCGCCCGCT CGTCGACGCC GACGCGATCC GGAGCCGCCA CGACGCGGTC GGCGAGCTGG CCGATCGCAC CCTCGTCCGT GAGGGAGTCG CCGACGCGCT CGCGGCCGCC TACGACCTCG AACGCCTCGT CGGACGGATC TCCCGCGGAC GGGCCGACGC TCGCGACCTG CGCTCGCTGC ACGCGACGCT CGCGGTCGTG CCGGATCTGA AGGCGACGCT GGCGGGGGGG AACGGTGGGG ACGATAGAGA GGGCGGAACC GACGCCGACC GCCCCCGCAC CGACCACCTC CGCGACCTCC GCGACCGCCT CGACGAGCTG ACCGAGGTCC GCGAGCTGAT CGACGACGCT ATCGCCGCAA ACCCCCCGCC GGAGATCACC GAGGGCGGCG TGATCGGCGA GGGGTTCGAC GACGACCTCG ACGCCCTCCG CGCCACCGAG CGCGAGGGGC GCGAGTGGGT CGCAGACTTG GAGGCGAGCG AGCGCGAGCG CACCGGGATC GACTCGCTGT CGGTCGGCCA CAATCAGGTC CACGGCTACT ACATCGAGGT GACCGACGCC AACCTCGACC GCGTCCCCGA CGACTACCGC CGCCGACAGA CGCTGAAGAA CAGCGAGCGC TACTACACGC CCGAACTGAA GGAGCGCGAA GAGGAGATCG TCGGCGCCGC GGAGCGCGCG GACGCCTTGG AGTACGAGCT GTTCGTCGAC GTGCGCGAGC GCGTCGGGAG CGAGACGGAA CGGATACAGG GGCTCGCGGA CGCGATCGCC GAGATCGACG CGCTCCGCTC GCTGGCGACC GTCGCCGTCG AGTACGACTA CGTCCGCCCG GAGATCGTCG ACGAACCCGC TTCCGATACG AACGCCGGCG TCGAGATCGA AGGGGGCCGC CACCCGGTCG TCGAGCGCGC CGAGGAGTCG TTCGTCCCGA ACGACGCGGA CCTCCCGCGC GGGTCGATCG CGGTTATCAC GGGGCCGAAC ATGAGCGGGA AGTCAACGTA CATGCGGTCG GTCGCGCTCG CGGTCGTGTT AGCGCAGACC GGCTCGTTCG TCCCCGCGCA GGCGGCCTCG CTCCCCGTCT TCGACCGGCT GTTCACCCGC GTCGGCGCCT CCGACGACAT CGCCGGCGGC CAGTCGACGT TCATGCGCGA GATGAGCGAG CTGACCGAGA TCCTCCACGA CGCCGGCCCG GACTCCCTCG TCCTCCTCGA CGAGGTGGGC CGCGGTACCG CCACGACCGA CGGGCGGGCC ATCGCCCGCG CGGCCGCCGA GTTCATCCAC GACGAGCTCG GCGCGACCGC CATCTTTGCC ACCCACTACC ACGACCTGAC CGACCTCGCG GCTGAGCGCG AGCGCGTCTT CAACCTCCAC TTCACGGCGA CCCGCGAGGA CGGCGACGTG ACGTTCCTCC ACCGGATCGT CCCCGGCGCC TCCTCTTCCT CGTACGGCGT CGAGGTCGCC GAACTCGCCG GCGTGCCGGC GCCCGTCGTC GAGCGGTCCC GTTCGCTGGT GACCGCCGAC ACTGCGGGTG GCTCCCCTGA CAGCGAAGCC GCGTCGGAGG ACGACGAGTC GGATGGGCGG GACGCACCGG CCGAGGAGGA GAAGCCGGAC AGCGTTTCGC TCCGCGAGTT CCTCGCCGAG GAGAGCGCGG GCGACGACGC CGACGACACC CCGAGCGCGA GCGACGACGT TCCAAACGCG AGCGACACGG ACGGCAATAT CGACACCCCG ACCGCAGACG ATCGGGCCGG AGTCGACGCC GAAAGCGACC TGACCGCCGA TCTCCGCGAT CTCGACCTCG CCCGGATGAC GCCAATAGAG GCGCTGAACG CCCTCCACGA CCTCCAGTCG CGAGCCGACG ATGACGGGTG A
|
Protein sequence | MPTADHETVT GLPPGIAAAR EELTPMLSQY ADLCAAHEDA LVLFQVGDFY EAFCEAAEAV AGVCEVTLTE RSDSTGDYPM AGIPIDNAAP YLGSLLDAGY RVALGDQVED AEQASGLVDR AVTEVITPGT VVEDELLEAG TTNYVAAVAG GGEDGESDDA PEPAPVGLAA VDVSTGECLV TAGDRDTVAE ELDRIAPAEL IAGPDTPDFK PADAERGWTV HEYDSGVFER RAAIERLEPY LPAPDRRFDG DAELRAAGAV LAYAEYTQGD DGPLSYVTRI RRYDPRDRLR LDAAAQRSLE LFENRGLGAS DTLFDALDET RCALGRRCLE RWLRRPLVDA DAIRSRHDAV GELADRTLVR EGVADALAAA YDLERLVGRI SRGRADARDL RSLHATLAVV PDLKATLAGG NGGDDREGGT DADRPRTDHL RDLRDRLDEL TEVRELIDDA IAANPPPEIT EGGVIGEGFD DDLDALRATE REGREWVADL EASERERTGI DSLSVGHNQV HGYYIEVTDA NLDRVPDDYR RRQTLKNSER YYTPELKERE EEIVGAAERA DALEYELFVD VRERVGSETE RIQGLADAIA EIDALRSLAT VAVEYDYVRP EIVDEPASDT NAGVEIEGGR HPVVERAEES FVPNDADLPR GSIAVITGPN MSGKSTYMRS VALAVVLAQT GSFVPAQAAS LPVFDRLFTR VGASDDIAGG QSTFMREMSE LTEILHDAGP DSLVLLDEVG RGTATTDGRA IARAAAEFIH DELGATAIFA THYHDLTDLA AERERVFNLH FTATREDGDV TFLHRIVPGA SSSSYGVEVA ELAGVPAPVV ERSRSLVTAD TAGGSPDSEA ASEDDESDGR DAPAEEEKPD SVSLREFLAE ESAGDDADDT PSASDDVPNA SDTDGNIDTP TADDRAGVDA ESDLTADLRD LDLARMTPIE ALNALHDLQS RADDDG
|
| |