Gene Hmuk_3287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3287 
Symbol 
ID8409365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp82935 
End bp85757 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content71% 
IMG OID645018221 
Productserine/threonine protein kinase 
Protein accessionYP_003175742 
Protein GI257372968 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.092483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA CCGACGAGGA CGAAGTCCAA CTCGACGAGG CCCGGGCGTT CCTCGCGACG 
CTGCCCGACG ACCCGGCCGC CTACACCGAG GCCGACCGGT GGCGCATCGA GGAGTTGCTG
CTGTCTGACA ACCACGGCGT CGTCCACGTC ACGGCGCAAC TGGTCAGAGA GGCGACGGCA
CGGGACGCAG AGGCGATGGC ACCGCTGGTA CAACCGCTGA TCGACGCGGT CGTCCACACC
GACGGGCTCC GCCGTCCGGA ACTGGTGGCC GCGATCGCGG AGTTCGATCG CACACGAGTC
GCGGCGGCCG ATCCCCAGTT CGACCGGACT CCGGTCGCGC GGGCCGACCC CGTCGGCCAG
TACCACCGAA CGCTGGCCGA CACCGAAAAC GAGGAGCTCG TCCGCTCGAA ACGGGCGGCA
GCACTCGCGG CGATCTACGA CCTCTTTCCC CAGGAAGTCC GGGACACGGT CCCGACGCTG
CTCGACTGTC TGGAACCGGC CGAGGACGAT GTCGGCCTGC TCTGTGAGGC CGCCGGGAAG
GCCCTGGGGA CGATCGCGGG CAAAGACGAT CAGGTGCGCG CACGGGTCAT AGAACGGTTC
GAGCGCGAAC TCGACCGCGG GTCGCTCCCG AACAAAGGCG TGGTGTACGG CGTGGGGAAC
CTCGTGACTT GCCGCTCGGA CGCGGGGTTG ACGCTCGTCG AGCCGCTGGT CGAGGCGACG
AACCGGGCCG GCGTCGACAG TGCCGCCCAC GAGGCGGAGC GCTGGCACGT CGTCAGCGCG
CTCGAGACGA TCGCAGAGGA GCGCCCGGCT CTTCTCGATC ACCACACAGA CCGGCTCACG
CTGCTCCTGA CCGACGACGA GACGATGGTC CGGAAGCAGG CGATCGGCCT GTTCGAGACG
CTCGGCGAGA CGGAGCCGGC CGTCCTCGAC CGCGTCGCCG ACGACCTCGT TGCCGCCGCC
GACGATCCCG ACCCGAAGAT ACGACGCGCG GTCGTGAGCG CTCTCGCGCC GCTGGAGTCG
CGACTGGCCG AGGCGGCCGA ACCTGCGGCC CGCGCGCACC TCGCGGCTCT GGAGTCGGCG
GCGGATTTCC GGAAGAGCAC GGTGCTGGGC GAGCTGGGCC GGCTGGCCGA GCACTGTCCG
GACCTGGTCG TCGGCGGGCT GGTCGAGATC GTCGCCCACC TCGATCACCA GTCGTGGGGC
GTCAGGTCGA GTGCAGCGTC CGCGCTCGGC AGGCTGGGTA GCGAACGGCC CGACGCCGTT
CGCCCGCTGG TCGAGCGGGA CGCGTCGGGA CTGATCGCTT CGACCGGGTC GACGCCCCTC
CTCGGTCTCC TCGACGACGA GCAGGACTCG GTCCGTGCGG ACGCTGTCCA CGCCCTCGGA
CGGATCGGCG CGGCCGATCG GGAACTGGCC GACGTCGTGT TTCCAGCCGT TCTCGGGGCC
GTCGACGACG GTGACTCACA CGTCGAGCAA CGGGCGCTCC GGGCTTTCGG ACGGATCGGC
GCGGCCCATC CCGACGCGGT GACCGACGAT CACGTCTCGA CGCTCGTCAC CAGCGTGGGG
CACGTCAGCG AGACGGTCAG AGCGGCGGCG GCCGAGGCGA TCGGCGCGCT CGCGGCGAGC
GGTCTGGATC TGCGTGTCGC TCGCGATCCG CTCGTCGACC TGCTCGACGA CTACTATCCC
CGTCCCCGCG TCGAAGCCTG CGAGTCGCTC GCGGCACTCG GCGCGGAGCC GGCGGCTGCA
CGGATCGCGC CACTCCGCAC ACACTACGAC GACGACGTTC GGGAGGCCGC CCAGGCGGCA
CTCGCGGCGC TCGACTACGA CCCCGACGGT TCGCCCGAGT CGGACTCGAA GTCGTCGGCT
GACGCGGGCT CCGGTTCGGA GCGCCCGGCG GGAGCACGGT CCGAGGCGAC CGGTCGCCGG
GTCCGCGACG ACCGGTCTGA GACCACCAGC GAGGCCGCTT CGACGGCGAT ACCGACCGGC
CAGCCGACAC AGGCCCCGTC GCTGTCGCTG GACTACGGCG ACGTGGAGAC CGGTCGAGTC
GTCGGCCGTG GCGGCAACGC AGTCGTCCGG GAGGGGACCG TCGAGATCGA CGGGGAGTCG
CGGACGGTCG CCGTCAAAGA GCCCGTCACC GAGGGGACAC AGACGGCAGA CGACATCGCC
GCCTTCGAGC GCGAGGCGCG ACAGTGGGAC CGACTCTCGG ACCGTGCTCA CGTCGTCGGC
GTCGTCGACT GGGACGTCGA CCCCCTCCCC TGGATCGCCC TGGAGTACAT GGACGGGGGC
TCGCTGGCGG CCAGAGTCGG CGAGTGTGAC TACCGCCAGA CGCTGTGGAT CTGCGAGCGC
GTCGCGAACG CGGTCCGACA CGACCAGACC GGGATGGCAC ACCTCGATCT CAAACCCGCG
AACATCCTCT TCGAGTCGAC GCCCGACGGC GTCTGGGACG TGCCGAAGGT CGCCGACTGG
GGAATCGCCC GACTGCTGAA CGAGCACGGT GCGACTGCCG GCGGGATGAC CGCGACGTAC
GCGGCACCGG AACAGTTCGA CCCCGACACC TACGGGACGC CCGACAGCAG GACGGACATC
TACCAGCTCG GGGTCGTCCT CTACGAGCTG CTCGTGGGCG AGCCGCCCTT TACCGGCTCC
GACGTGGACG TGATGCAGAG CGTTCTGTCC GACGAGCCCG CACTGCCGGG CGAGCGAGTC
GACGGTCTCC CGGACGGGAT CGACGACACC GTCGAGACGG CCCTCGCGAC GGACCCCGAG
CGCCGCTTCG AGACGATCGA ACGGATGCAG TCGCGGCTGG GAGCCCACCT CGACGAGCTG
TAG
 
Protein sequence
MSDTDEDEVQ LDEARAFLAT LPDDPAAYTE ADRWRIEELL LSDNHGVVHV TAQLVREATA 
RDAEAMAPLV QPLIDAVVHT DGLRRPELVA AIAEFDRTRV AAADPQFDRT PVARADPVGQ
YHRTLADTEN EELVRSKRAA ALAAIYDLFP QEVRDTVPTL LDCLEPAEDD VGLLCEAAGK
ALGTIAGKDD QVRARVIERF ERELDRGSLP NKGVVYGVGN LVTCRSDAGL TLVEPLVEAT
NRAGVDSAAH EAERWHVVSA LETIAEERPA LLDHHTDRLT LLLTDDETMV RKQAIGLFET
LGETEPAVLD RVADDLVAAA DDPDPKIRRA VVSALAPLES RLAEAAEPAA RAHLAALESA
ADFRKSTVLG ELGRLAEHCP DLVVGGLVEI VAHLDHQSWG VRSSAASALG RLGSERPDAV
RPLVERDASG LIASTGSTPL LGLLDDEQDS VRADAVHALG RIGAADRELA DVVFPAVLGA
VDDGDSHVEQ RALRAFGRIG AAHPDAVTDD HVSTLVTSVG HVSETVRAAA AEAIGALAAS
GLDLRVARDP LVDLLDDYYP RPRVEACESL AALGAEPAAA RIAPLRTHYD DDVREAAQAA
LAALDYDPDG SPESDSKSSA DAGSGSERPA GARSEATGRR VRDDRSETTS EAASTAIPTG
QPTQAPSLSL DYGDVETGRV VGRGGNAVVR EGTVEIDGES RTVAVKEPVT EGTQTADDIA
AFEREARQWD RLSDRAHVVG VVDWDVDPLP WIALEYMDGG SLAARVGECD YRQTLWICER
VANAVRHDQT GMAHLDLKPA NILFESTPDG VWDVPKVADW GIARLLNEHG ATAGGMTATY
AAPEQFDPDT YGTPDSRTDI YQLGVVLYEL LVGEPPFTGS DVDVMQSVLS DEPALPGERV
DGLPDGIDDT VETALATDPE RRFETIERMQ SRLGAHLDEL