Gene Hmuk_0971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0971 
Symbol 
ID8410488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp930698 
End bp932356 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content67% 
IMG OID645019307 
Productthermosome 
Protein accessionYP_003176807 
Protein GI257387034 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.497136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0946075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGTC AGCCGATGAT CATTCTGGGA GAGGACTCCC AGCGGATGAA GGACAAGGAC 
GCACAGAGCC ACAACATCTC GGCGGCCCGC GCGGTCGCGG AGTCGGTCCG ATCGACACTC
GGCCCGAAGG GGATGGACAA GATGCTCGTC TCCTCGCTCG GTGACGTGAC CGTCACGAAC
GACGGCGTCA CTATCCTCAC GGAGATGGAC ATCGACAACC CGACCGCCGA GATGATCGTG
GAGGTCGCCG AGGCCCAGGA AGACGAGGCC GGCGACGGCA CGACGACCGC CGTCTCCATC
GCGGGCGAAC TGCTGAAAAA CGCCGAGGAG CTCCTCGAAC AGGACATTCA CCCGACGGCG
ATCATCAAGG GCTTCGACCT CGCGTCGACG GAAGCCAAGA ACCAGATCGG CGAGATCGCC
ACGTCCGTCG ATCCCGACGA CGAGGAACTG CTCAAGAAGC TCGCCGAGAC CTCGATGACG
GGGAAGGGCG CGGAGCTCAA CAAGGAGCTG CTCGCCCAGC TCATCGTCGA CGCGGTCAAC
GCCGTCACCG TCGAGGCCGC GGACGGCTCG GTCATCGCCG ACCTGGAGTT CCTCAACATC
GAGACCCAGA CCGGTCGCGC GGTCTCGGAC TCGGAGCTCA TCGAGGGCGC GGTCGTCGAC
AAGGACCCGG TCCACGAGGA GATGCCGACC ACCGTCGACG ACGCCGACGT GCTCCTGCTC
GACACGCCGA TCGAGCTCGA CGAGACCGAA GTCGACGCAC AGCTCTCCGT CGACGACCCG
AGCCAGCTCC AGAACTTCCT CGACAAGGAA GAACAGCAGC TCGAAGAGAT GGTCGACGCC
ATCGCCGCGA CCGGCGCGAA CGTCGTCTTC TGCCAGAAGG GCATCGACGA CATGGCCCAG
CACTACCTCG CCAAGGAGGG CATCCTCGCC GTCCGACGGG CCAAGAAGTC CGACATCGAG
TTCCTCCGAG AGGTGCTCGG CGCGAACATC GTCTCCGACG TACACAACGC CACCGCCGAC
GACCTCGGCC ACGGCTCCGT CCGCCGGGAC ACCGAGGAAG AGCTCTTCTA CGTCGAAGGC
GCGGGCGAGG ACGCACACGG CGTCACGCTC CTCCTGCGTG CCTCCACCGA CCACGTCGTC
GACGAACTCG AACGCGGCGT CCAGGACGCG CTCGACGTGG TCGCCTCGAC CGTCGCGGAC
GGCCAGATCC TCGCCGGTGG CGGCGCACCC GAGGTCGAAC TCGCCAGCCG CCTGCGAGAC
TACGCGGACG GCGTCGAAGG CCGCGAACAG CTGGCCGTCG AGGCCTTCGC CGACGCGCTG
GAACTCATCC CGCGCACGCT CGCCGAGAAC GCGGGACTGG ACTCCATCGA CTCGCTGGTC
GACCTGCGCG CCGCCCACGA GGGCGGCGAC GTACAGGCCG GCCTCGACGT GTACAGCGGT
GACGTGGTCA ACACGCTCGA CGAGGGCGTC GTCGAGCCGG CCCACGCGAA GCGACAGGCG
ATCTCGTCGG CTGCCGAGGC GGCGAACCTC GTGCTCAAAA TCGACGACAT CATCGCTGCT
GGTGACCTCT CGACCAGTGG CGGCGACGAG GAAGGCGGTC CCGGCGGCGC GCCCGGCGGC
ATGGGCGGTA TGGGCGGCGG CATGGGCGGC ATGATGTAG
 
Protein sequence
MQGQPMIILG EDSQRMKDKD AQSHNISAAR AVAESVRSTL GPKGMDKMLV SSLGDVTVTN 
DGVTILTEMD IDNPTAEMIV EVAEAQEDEA GDGTTTAVSI AGELLKNAEE LLEQDIHPTA
IIKGFDLAST EAKNQIGEIA TSVDPDDEEL LKKLAETSMT GKGAELNKEL LAQLIVDAVN
AVTVEAADGS VIADLEFLNI ETQTGRAVSD SELIEGAVVD KDPVHEEMPT TVDDADVLLL
DTPIELDETE VDAQLSVDDP SQLQNFLDKE EQQLEEMVDA IAATGANVVF CQKGIDDMAQ
HYLAKEGILA VRRAKKSDIE FLREVLGANI VSDVHNATAD DLGHGSVRRD TEEELFYVEG
AGEDAHGVTL LLRASTDHVV DELERGVQDA LDVVASTVAD GQILAGGGAP EVELASRLRD
YADGVEGREQ LAVEAFADAL ELIPRTLAEN AGLDSIDSLV DLRAAHEGGD VQAGLDVYSG
DVVNTLDEGV VEPAHAKRQA ISSAAEAANL VLKIDDIIAA GDLSTSGGDE EGGPGGAPGG
MGGMGGGMGG MM