Gene Hlac_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2226 
Symbol 
ID7399934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2208885 
End bp2211755 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content72% 
IMG OID643709298 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002566873 
Protein GI222480636 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.561287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACTG CGGACCACGA GACGGTGACC GGGCTCCCGC CGGGGATCGC CGCCGCCCGC 
GAGGAGCTGA CACCGATGCT CTCGCAGTAC GCCGACCTGT GTGCAGCCCA CGAGGACGCC
CTCGTGTTGT TCCAGGTCGG CGACTTCTAC GAGGCGTTCT GCGAGGCGGC CGAGGCGGTC
GCCGGCGTCT GCGAGGTGAC GCTGACGGAG CGCTCCGACT CCACCGGCGA CTACCCGATG
GCCGGGATCC CCATCGACAA CGCCGCGCCC TACCTCGGAT CGCTCCTCGA TGCCGGCTAC
CGCGTCGCCC TCGGCGATCA GGTCGAAGAC GCCGAGCAGG CGTCCGGGCT CGTCGACCGC
GCCGTCACCG AGGTGATCAC TCCCGGGACC GTCGTCGAAG ACGAGCTGCT GGAGGCGGGG
ACGACGAACT ACGTGGCGGC GGTGGCGGGC GGAGGCGAGG ACGGCGAGAG CGACGACGCC
CCCGAGCCGG CCCCCGTCGG CCTCGCCGCC GTCGACGTCT CGACCGGCGA GTGTCTCGTC
ACCGCGGGCG ACCGCGACAC GGTCGCGGAG GAGCTCGACC GGATCGCACC CGCCGAGCTG
ATCGCGGGAC CGGACACCCC CGACTTCAAG CCGGCCGACG CCGAGCGCGG CTGGACGGTC
CACGAGTACG ATTCCGGCGT TTTCGAGCGG CGAGCCGCGA TCGAACGACT GGAGCCGTAC
CTCCCCGCGC CCGACCGCCG GTTTGACGGC GACGCCGAGC TACGTGCGGC TGGCGCCGTG
CTCGCGTACG CCGAGTACAC GCAGGGCGAC GACGGCCCGC TCTCGTACGT TACCCGGATC
CGGCGGTACG ACCCGCGCGA CCGGCTCCGG CTCGACGCGG CGGCCCAGCG CAGCCTCGAA
CTGTTCGAGA ACCGCGGACT GGGCGCGAGC GACACCCTGT TCGACGCGCT CGACGAGACG
AGGTGCGCGC TCGGCCGGCG GTGTTTAGAA CGGTGGCTCC GTCGCCCGCT CGTCGACGCC
GACGCGATCC GGAGCCGCCA CGACGCGGTC GGCGAGCTGG CCGATCGCAC CCTCGTCCGT
GAGGGAGTCG CCGACGCGCT CGCGGCCGCC TACGACCTCG AACGCCTCGT CGGACGGATC
TCCCGCGGAC GGGCCGACGC TCGCGACCTG CGCTCGCTGC ACGCGACGCT CGCGGTCGTG
CCGGATCTGA AGGCGACGCT GGCGGGGGGG AACGGTGGGG ACGATAGAGA GGGCGGAACC
GACGCCGACC GCCCCCGCAC CGACCACCTC CGCGACCTCC GCGACCGCCT CGACGAGCTG
ACCGAGGTCC GCGAGCTGAT CGACGACGCT ATCGCCGCAA ACCCCCCGCC GGAGATCACC
GAGGGCGGCG TGATCGGCGA GGGGTTCGAC GACGACCTCG ACGCCCTCCG CGCCACCGAG
CGCGAGGGGC GCGAGTGGGT CGCAGACTTG GAGGCGAGCG AGCGCGAGCG CACCGGGATC
GACTCGCTGT CGGTCGGCCA CAATCAGGTC CACGGCTACT ACATCGAGGT GACCGACGCC
AACCTCGACC GCGTCCCCGA CGACTACCGC CGCCGACAGA CGCTGAAGAA CAGCGAGCGC
TACTACACGC CCGAACTGAA GGAGCGCGAA GAGGAGATCG TCGGCGCCGC GGAGCGCGCG
GACGCCTTGG AGTACGAGCT GTTCGTCGAC GTGCGCGAGC GCGTCGGGAG CGAGACGGAA
CGGATACAGG GGCTCGCGGA CGCGATCGCC GAGATCGACG CGCTCCGCTC GCTGGCGACC
GTCGCCGTCG AGTACGACTA CGTCCGCCCG GAGATCGTCG ACGAACCCGC TTCCGATACG
AACGCCGGCG TCGAGATCGA AGGGGGCCGC CACCCGGTCG TCGAGCGCGC CGAGGAGTCG
TTCGTCCCGA ACGACGCGGA CCTCCCGCGC GGGTCGATCG CGGTTATCAC GGGGCCGAAC
ATGAGCGGGA AGTCAACGTA CATGCGGTCG GTCGCGCTCG CGGTCGTGTT AGCGCAGACC
GGCTCGTTCG TCCCCGCGCA GGCGGCCTCG CTCCCCGTCT TCGACCGGCT GTTCACCCGC
GTCGGCGCCT CCGACGACAT CGCCGGCGGC CAGTCGACGT TCATGCGCGA GATGAGCGAG
CTGACCGAGA TCCTCCACGA CGCCGGCCCG GACTCCCTCG TCCTCCTCGA CGAGGTGGGC
CGCGGTACCG CCACGACCGA CGGGCGGGCC ATCGCCCGCG CGGCCGCCGA GTTCATCCAC
GACGAGCTCG GCGCGACCGC CATCTTTGCC ACCCACTACC ACGACCTGAC CGACCTCGCG
GCTGAGCGCG AGCGCGTCTT CAACCTCCAC TTCACGGCGA CCCGCGAGGA CGGCGACGTG
ACGTTCCTCC ACCGGATCGT CCCCGGCGCC TCCTCTTCCT CGTACGGCGT CGAGGTCGCC
GAACTCGCCG GCGTGCCGGC GCCCGTCGTC GAGCGGTCCC GTTCGCTGGT GACCGCCGAC
ACTGCGGGTG GCTCCCCTGA CAGCGAAGCC GCGTCGGAGG ACGACGAGTC GGATGGGCGG
GACGCACCGG CCGAGGAGGA GAAGCCGGAC AGCGTTTCGC TCCGCGAGTT CCTCGCCGAG
GAGAGCGCGG GCGACGACGC CGACGACACC CCGAGCGCGA GCGACGACGT TCCAAACGCG
AGCGACACGG ACGGCAATAT CGACACCCCG ACCGCAGACG ATCGGGCCGG AGTCGACGCC
GAAAGCGACC TGACCGCCGA TCTCCGCGAT CTCGACCTCG CCCGGATGAC GCCAATAGAG
GCGCTGAACG CCCTCCACGA CCTCCAGTCG CGAGCCGACG ATGACGGGTG A
 
Protein sequence
MPTADHETVT GLPPGIAAAR EELTPMLSQY ADLCAAHEDA LVLFQVGDFY EAFCEAAEAV 
AGVCEVTLTE RSDSTGDYPM AGIPIDNAAP YLGSLLDAGY RVALGDQVED AEQASGLVDR
AVTEVITPGT VVEDELLEAG TTNYVAAVAG GGEDGESDDA PEPAPVGLAA VDVSTGECLV
TAGDRDTVAE ELDRIAPAEL IAGPDTPDFK PADAERGWTV HEYDSGVFER RAAIERLEPY
LPAPDRRFDG DAELRAAGAV LAYAEYTQGD DGPLSYVTRI RRYDPRDRLR LDAAAQRSLE
LFENRGLGAS DTLFDALDET RCALGRRCLE RWLRRPLVDA DAIRSRHDAV GELADRTLVR
EGVADALAAA YDLERLVGRI SRGRADARDL RSLHATLAVV PDLKATLAGG NGGDDREGGT
DADRPRTDHL RDLRDRLDEL TEVRELIDDA IAANPPPEIT EGGVIGEGFD DDLDALRATE
REGREWVADL EASERERTGI DSLSVGHNQV HGYYIEVTDA NLDRVPDDYR RRQTLKNSER
YYTPELKERE EEIVGAAERA DALEYELFVD VRERVGSETE RIQGLADAIA EIDALRSLAT
VAVEYDYVRP EIVDEPASDT NAGVEIEGGR HPVVERAEES FVPNDADLPR GSIAVITGPN
MSGKSTYMRS VALAVVLAQT GSFVPAQAAS LPVFDRLFTR VGASDDIAGG QSTFMREMSE
LTEILHDAGP DSLVLLDEVG RGTATTDGRA IARAAAEFIH DELGATAIFA THYHDLTDLA
AERERVFNLH FTATREDGDV TFLHRIVPGA SSSSYGVEVA ELAGVPAPVV ERSRSLVTAD
TAGGSPDSEA ASEDDESDGR DAPAEEEKPD SVSLREFLAE ESAGDDADDT PSASDDVPNA
SDTDGNIDTP TADDRAGVDA ESDLTADLRD LDLARMTPIE ALNALHDLQS RADDDG