Gene Hmuk_0699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0699 
Symbol 
ID8410213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp668511 
End bp670187 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content68% 
IMG OID645019034 
Productthermosome 
Protein accessionYP_003176537 
Protein GI257386764 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.758666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.97393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGTC AGCCCATGAT CGTGCTGGGA GAGGACTCCC AGCGAACCTC CGGAAAGGAC 
GCACAGTCGA TGAACATCAC GGCCGCACAG GCCGTCGCGG AGGCCGTACG GACGACACTC
GGCCCGAAGG GCATGGACAA GATGCTCGTC GACGACTCCG GCGGCGTCGT CGTCACCAAC
GACGGTGTCA CCATCCTCGA CGAGATGGAC ATCGAGCACC CCGCGGCCAA CATGATCGTC
GAAGTCGCCC AGACCCAGGA AGACGAGGTC GGCGACGGCA CGACGACCGC GGTCGTCATC
TCCGGCGAAC TCCTCTCGGA AGCCGAGGAC CTCATCGACC AGGACATCCA CGCCTCCATC
CTGGCACAGG GGTACCGCCA GGCCGCCGAG AAGGCAAAGG AGATCCTCGA AGAGCAGGCC
ATCGAGGTCG GCCCCGAGGA CACCGAGATG CTCAAGAAGG TCGCCGCGAC GGCGATGACC
GGCAAGGGCG CGGAATCCTC CAAGGACGTC CTCGCCGAAC TCGTCGTTCG CGCCGCACAG
TCCGTCGCCG ACGACGGCGA GGTCGACACC GACAACATCC AGCTCGAGGT CGTCGTCGGT
GGCTCCACCG ACGAGTCAGA GCTCGTCGAG GGCGTCATCA TCGACAAGGA GCGCGTCCAC
GACAACATGC CCTACGCCGT CGAGGACGCC AACATCGCGC TGCTCGACAC CGCGATCGAG
GTCCCCGAGA CCGAACTCGA CACCGAAGTC AACGTCACTG ACCCGGACCA GCTCCAGCAG
TTCCTCGACC AGGAAGAGGA GCAGCTCAAG GAGATGGTCG ATGACCTCAA AGCGGCCGGT
GCCGATGTCG TCGTGACCCA GAAGGGCATC GACGACATGG CCCAGCACTA CCTCGCACAG
GAGGGCATCC TCGCCGTCCG CCGTGCGAAG AAGTCCACCA TCAAGGCCCT CTCGCGCTCG
ACCGGCGCTC GCATCGTCTC GAACATCGCC GACGTGACCG AGGACGACCT CGGCTTCGCC
GGCTCGGTCG CCCAGAAGGA CGTTGCCGGC GACGAGCGCA TCTTCGTCGA GGACGTCGAC
GAGGCCAAGT CCGTCACGAT GATCCTCCGC GGTGGCACCG AACACGTCGC CGACGAGGTC
GAGCGCGCCA TCGAGGACTC GCTCGGCGTC GTCGCCGCCA CGCTCGAGGA CGGCAAGGTC
CTGCCCGGCG GCGGTGCCCC CGAGACACAG CTCGCACTCG GTCTGCGCGA CCACGCCGAC
TCCGTCGGTG GCCGCGAGCA GCTGGCCGTC GAGGCCTTCG CCGACGCCAT CGATGTCGTC
CCGCGCACCC TCGCCGAGAA CGCGGGTCTC GACCCGATCG ACTCGCTGGT CGACCTGCGC
AGCAAGCACG ACGGCGGCGA CAACACCGCC GGTCTCGACG CCTACACCGG CGAAGTCGTC
GACATGACCG AGGACGGCGT CGTCGAGCCG CTCCGCGTCA AGACCCAGGC CATCGAGTCC
GCCACCGAGG CGGCCGTGAT GATCCTGCGC ATCGACGACG TGATCGCTGC CGGCGACCTC
AAGGGTGGCG GCAGCGACGA CGACGAGGAC GACGCACCCG GCGGCCCCGG CGGCGCGCCC
GGCGGAATGG GCGGCGGCAT GGGCGGCATG GGCGGCGGCA TGGGCGGCAT GATGTGA
 
Protein sequence
MQGQPMIVLG EDSQRTSGKD AQSMNITAAQ AVAEAVRTTL GPKGMDKMLV DDSGGVVVTN 
DGVTILDEMD IEHPAANMIV EVAQTQEDEV GDGTTTAVVI SGELLSEAED LIDQDIHASI
LAQGYRQAAE KAKEILEEQA IEVGPEDTEM LKKVAATAMT GKGAESSKDV LAELVVRAAQ
SVADDGEVDT DNIQLEVVVG GSTDESELVE GVIIDKERVH DNMPYAVEDA NIALLDTAIE
VPETELDTEV NVTDPDQLQQ FLDQEEEQLK EMVDDLKAAG ADVVVTQKGI DDMAQHYLAQ
EGILAVRRAK KSTIKALSRS TGARIVSNIA DVTEDDLGFA GSVAQKDVAG DERIFVEDVD
EAKSVTMILR GGTEHVADEV ERAIEDSLGV VAATLEDGKV LPGGGAPETQ LALGLRDHAD
SVGGREQLAV EAFADAIDVV PRTLAENAGL DPIDSLVDLR SKHDGGDNTA GLDAYTGEVV
DMTEDGVVEP LRVKTQAIES ATEAAVMILR IDDVIAAGDL KGGGSDDDED DAPGGPGGAP
GGMGGGMGGM GGGMGGMM