Gene Hlac_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0119 
Symbol 
ID7401640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp123768 
End bp126563 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content71% 
IMG OID643707183 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002564795 
Protein GI222478558 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.893903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACGG GGATCGTCGG GGAGTTCCTC GACCTCAAGG CCGAGACCGA CGCGGACATC 
CTCGCCATGC AGTGCGGCGA CTTCTACGAG TTCTTCGCGG ACGACGCCGA GCTGGTCGCC
GACGAGCTGG ACCTGACCGT CTCACAGAAG TCCTCGCACG GCTCGTCGTA CCCGATGGCG
GGCGTGCCGC TCTCGGAGCT GACCCCGTAC GTGAAGGCGC TCGTCGAGCG CGGCTACCGG
GTCGCCGTCG CCGACCAGTA CGAGACCGAG GACGGCCACG CCCGGGAGAT TACCCGCGTC
GTCACGCCTG GGACGCTCCT CGAAACCGCC GACGACGACG CGCGGTACCT CGCGGCGATC
GTCCGCGAGG GCGACGACGC GGACGGCCCC TACGGGCTCG CGCTCGCCGA CGTGACCACG
GGCCGGTTCC TCGTCACCGA AGTCGACGAC GAGGGCGACC TCCGCGCGGA GCTGTACCGC
TTCGACCCCG CCGAGGTGCT CCCGGGACCG CGCGTCCGCA ACGACGACCG ACTGCTCGGA
GCGGTCCGAG AGGACCTCTC GGGGTCGGTT TCCGTCTTCG ACGCGGAGGC GTTCGCGCCG
GGACGCGCGA AACACGCGGT CCGCGAGCAG TTCGGGCGGG AGACCGCCGA CAGCGTCGGC
ATCGACTCCG AACTGGCGCT GCGCGCGGCG GGAGCCGTCC TCGGCTACGT CGAGGAAACC
GGCGCCGGCG TGTTGGCATC GATCACCCGT CTCACGGCCT ACGGCGACGG CGACCACGTC
GCCGTCGACG CCACGACCCA ACGCAACCTC GAACTCACCG AGACGATGCG CGGCGACGCC
GACGGCTCGC TGTTCGAGAC GGTCGATCAC ACCGTCACCG CCGCCGGCGG CCGCCTCCTC
CGAGAGTGGA TCACCCGCCC GCGCCGGGAC CGCGAGGAAC TGAACCGCCG GCTCGACGCG
GTGGAGGCGC TCGCGTCGGC AGCGCTCGCG CGCGACCGCC TGCGAGAGAC GCTCGGCGAC
GCGTACGATC TCGAACGGCT CGCGGCGCGG GCGACTAGCG GGAGCGCGGG CGCGCGGGAA
CTCCTTTCGG TGCGGGACTC GCTGGCGCTG GTGCCCGCGC TCGCCGACGC CGTGTCCGGG
ACCGCGCTCG CGGACTCCCC GGTCGCAGCG GTGCTGGAGC GAATCGACCG CGAGCGCGCC
GCGACCCTCC ACGACGAACT CGCGGACGCG CTCGCGGAGG ACCCGCCGAA GACGAAGACG
CAGGGCGGGC TACTCAGGGT GGGATACGAC GGCGAGCTCG ACGAGCTGAT CGCGCGCCAC
GAGAAGGCGA ACGAGTGGCT CGATAGGCTC GCAGAGCGCG AAAAGCGGCA GTACGGGTTG
AGTCACGTCA CCGTCGACCG CAACAAGACG GACGGTTACT ACATCCAGGT CGGCAAATCC
GCGGCCGACG GGGTTCCCGA GCACTACCGC GAGATCAAGA CGCTGAAGAA CTCGAAGCGG
TTCGTCACCG ACGAACTGGA AGAGCGGGAA CGCGAGGTGC TCCGGTTGGA GGAGGCCCGG
GGCGAGCTGG AGTACGAGCT GTTCGAGGAG CTCCGAGAGC GGGTCGCCGC CGACGCCGAA
CTCTTACAGG ACGTGGGGCG AGCGGTCGCC GAGATCGACG CGCTCGCGTC GCTGGCGACC
CACGCCGCCG GCAACGACTG GACGCGACCC GAACTCGCCG ACGAGCGCCG GCTCGACGTC
GAGGCCGGGC GCCACCCGGT CGTCGAGCGG ACGACCGATT TCGTGCCGAA CGATCTCCGG
CTCGACGGGG AGCGCGGCTT CCTCATCGTC ACCGGGCCGA ACATGAGCGG GAAATCGACG
TATATGCGGC AGGCGGCGCT GATCCAGCTG CTCGCGCAGG CGGGGTCGTT CGTCCCCGCG
CGGACGGCGA CGGTCGGGCT CGTCGACGGA ATCTACACCC GCGTCGGCGC GCTCGACGAG
CTGGCACAGG GGCGCTCCAC GTTCATGGTG GAGATGCAGG AGCTGTCGAA CATCCTCCAC
TCGGCGACCG CCGACTCGAT CGTCATCCTC GACGAGGTCG GCCGCGGCAC CGCCACCTAC
GACGGCATCT CCATCGCGTG GGCCGCGACC GAGTATTTAC ATAACGAGGT GCGCGCGCGG
ACCCTCTTCG CCACGCACTA CCACGAGCTG ACGACGCTGG CAGACCACCT CCCGCGCGTG
GAGAACGTCC ACGTCGCCGT CGACAAGCGC GACGGCGAGG TGACGTTCCT CCGGACCGTT
CGCGACGGCC CGACAAATCG GTCGTACGGG GTCCACGTCG CCGACCTCGC GGGCGTTCCG
GCTCCAGTCG TCTCCCGCGC CGGGACGGTG CTCGACCGGC TTCGCGAGGA GAAGGCGATC
GAGGCGAAGG GCGGAGCGCG GGGCGGAGGG GGCGAACGCG GAGGCTTCAC CGGCACCGCC
GACGGCGACA CGAAACAGGT CGTTTTCGAC CTCTCGTCCG GGTCGTTCTC TGAAAGCGAC
GACGCGGAGT CGACCGCGGC CGGCGCCCCC GGCTCCGGAG GAGGTCGAAA CGGGGCGACT
CCGGGGTCGG CATCGGACGG CGCCAGTGGG TCCGCGGGGA CCGCAGAGAC CGCAGGAGCC
GCAGAGACCG CTGAAAGCGC GGAGACCGAC CGGTTCGATC CCGAGACCCG CGCCGTGATC
GAGGAACTGG CCGACGTCGA TGTCGCGGAG ACCGCGCCGG TGGAGTTGCT GTCTCGGGTT
CAAGAGTGGC AAGAGCGGCT CGACGAGAAC CGCTGA
 
Protein sequence
MPTGIVGEFL DLKAETDADI LAMQCGDFYE FFADDAELVA DELDLTVSQK SSHGSSYPMA 
GVPLSELTPY VKALVERGYR VAVADQYETE DGHAREITRV VTPGTLLETA DDDARYLAAI
VREGDDADGP YGLALADVTT GRFLVTEVDD EGDLRAELYR FDPAEVLPGP RVRNDDRLLG
AVREDLSGSV SVFDAEAFAP GRAKHAVREQ FGRETADSVG IDSELALRAA GAVLGYVEET
GAGVLASITR LTAYGDGDHV AVDATTQRNL ELTETMRGDA DGSLFETVDH TVTAAGGRLL
REWITRPRRD REELNRRLDA VEALASAALA RDRLRETLGD AYDLERLAAR ATSGSAGARE
LLSVRDSLAL VPALADAVSG TALADSPVAA VLERIDRERA ATLHDELADA LAEDPPKTKT
QGGLLRVGYD GELDELIARH EKANEWLDRL AEREKRQYGL SHVTVDRNKT DGYYIQVGKS
AADGVPEHYR EIKTLKNSKR FVTDELEERE REVLRLEEAR GELEYELFEE LRERVAADAE
LLQDVGRAVA EIDALASLAT HAAGNDWTRP ELADERRLDV EAGRHPVVER TTDFVPNDLR
LDGERGFLIV TGPNMSGKST YMRQAALIQL LAQAGSFVPA RTATVGLVDG IYTRVGALDE
LAQGRSTFMV EMQELSNILH SATADSIVIL DEVGRGTATY DGISIAWAAT EYLHNEVRAR
TLFATHYHEL TTLADHLPRV ENVHVAVDKR DGEVTFLRTV RDGPTNRSYG VHVADLAGVP
APVVSRAGTV LDRLREEKAI EAKGGARGGG GERGGFTGTA DGDTKQVVFD LSSGSFSESD
DAESTAAGAP GSGGGRNGAT PGSASDGASG SAGTAETAGA AETAESAETD RFDPETRAVI
EELADVDVAE TAPVELLSRV QEWQERLDEN R