Gene Hmuk_3203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3203 
Symbol 
ID8412756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3093434 
End bp3095020 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content68% 
IMG OID645021548 
Productchaperonin Cpn60/TCP-1 
Protein accessionYP_003179013 
Protein GI257389240 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.203796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGT CCGGCCAGCG ACGCATGGGT GGCCAGCCTC TCTTCATTCT CGACGAGGAC 
GCCCAGCGTA CGCAAGGAGA CGACGCACAG AGTTCGAACA TCGCCGCCGG GAAGGCAGTC
ACCGAGTCCG TACGGACGAC ACTCGGCCCC CGTGGCATGG ACAAGATGCT CGTCTCGGAC
TCGGGCGATG TCGTCATCAC CAACGACGGT GCGACGATTC TCGACGAGAT GGACATCGAG
CATCCCGCGG CCCAGATGAT CGTCGAAGTC GCCGAGACCC AGGAAGAGGA GGTCGGTGAC
GGTACCACGA CGGCTTCCGT CCTGGCCGGT CAGCTCCTGA CACAGGCCGA GGACCTGCTC
GAAGACGACG TCCACCCGAC GACGATCGTC GAAGGCTACC ACGAGGCGGC ACAGCTCGCC
CAGGAAGCCA TCGACGCCGA AGTGCTGGAC GTCGACATCG ACGACGAGAC GCTGATCAGC
GTCGCCGAGT CCTCCATGAC GGGGAAAGGC ACCGGCGACG TGGAGGCCGG CGCACTCGCC
GAGACCGTCG TCGCCGCCGT CCGGCAGGCG ACCGACGGCT CCGGTTCGAC GCGCGACCAG
ATCGCGATTC GGAGCCAGGC CGGCGCGTCC TCCTCTGCGA CGGAGCTCGT CGAGGGGATC
ATCATCGAGG AGGAGCCGGT CCACGGCAAC ATGCCGAGCT CGGTCGCCGA CGCGACCATC
GCCGTGATCG ACGCCGACCT CGAAGTCCGA GAGAGCAACA TCGACGCAGA GTACAACGTC
TCCAGCGTCG ACCAGCTCAA CGCCGCCATC GAGGCCGAAG AGGAAGAGCT GTCGAGCTAC
GCCGAGGCCA TCGCCGCGGT CGGCGCAGAC GTCGTGTTCG TCTCCGGTGA CGTCGACGAC
CGCGTCGGCG CACAGCTCTC CAAGGAGGGC ATCGTCGCCT TCGACGGCGT CGACAGCGAC
GAACTCCAGG ACGTGACCCA CACCACCGGC GCGAGCCGCG TGGGCTCCGT CGACACGCTG
GAGGCAGACG ACCTCGGCGA GGCCGAGGAC GTGAGCGTCC AGAAGTACGG CGACGAGGAA
CTGGCCTTCG TGCAGGGCGG TGCCAGCTCC GAGACGGTGA CGATCTTCGC CCGCGGCGGG
ACCGACCACG TCACGGACGA ACTCGAACGC GCGCTCAACG ACGCGCTCGA CGTGGTCGTC
GCCGCACTCG ACAAGGGCGG CGTCGTCCCC GGTGCCGGTG CGACCGAGAT CGCCATCGCG
GACCACATCC GGAGCGAGGC CGCCTCCATC GAGGGCCGCA AGCAACTCGC CGTCGAGTCC
TTCGCCGACG CCGTCGACGT GATCCCGCGC ACGCTCGCGG AGAACACGGG CATGGACCCG
ATCGACGCGC TGGTCGACCT GCGCGCCGAA CACGAGAGTG AGGGCATCGC GGGCATCATC
AGCGAGGGCC AGACCGGCGT CATCGACGAC CCGATCGACT ACGGCGTCCT CGACCCCGCC
GCGGTCAAGC GCGAAGCCGT CGAGAGCGCC ACCGAGGCAG CGACGATGAT CGCTCGGATC
GACGACGTCA TCTCCTCGGA CGCGTAA
 
Protein sequence
MSQSGQRRMG GQPLFILDED AQRTQGDDAQ SSNIAAGKAV TESVRTTLGP RGMDKMLVSD 
SGDVVITNDG ATILDEMDIE HPAAQMIVEV AETQEEEVGD GTTTASVLAG QLLTQAEDLL
EDDVHPTTIV EGYHEAAQLA QEAIDAEVLD VDIDDETLIS VAESSMTGKG TGDVEAGALA
ETVVAAVRQA TDGSGSTRDQ IAIRSQAGAS SSATELVEGI IIEEEPVHGN MPSSVADATI
AVIDADLEVR ESNIDAEYNV SSVDQLNAAI EAEEEELSSY AEAIAAVGAD VVFVSGDVDD
RVGAQLSKEG IVAFDGVDSD ELQDVTHTTG ASRVGSVDTL EADDLGEAED VSVQKYGDEE
LAFVQGGASS ETVTIFARGG TDHVTDELER ALNDALDVVV AALDKGGVVP GAGATEIAIA
DHIRSEAASI EGRKQLAVES FADAVDVIPR TLAENTGMDP IDALVDLRAE HESEGIAGII
SEGQTGVIDD PIDYGVLDPA AVKREAVESA TEAATMIARI DDVISSDA