Gene Hmuk_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0501 
Symbol 
ID8410001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp469494 
End bp472331 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content70% 
IMG OID645018825 
Producthistidine kinase 
Protein accessionYP_003176342 
Protein GI257386569 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0931305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAT TCACTCGTCG GGCAGTCGTC GCCGGAACGG TCGTCGCGGT GGTCCTCGTC 
TGGAGCGGAC TCGTCCCGCT ATCGGCCGGT GCAGCCGGCG TCTCAGGCCT CGACTGTCAG
CCGGGGAGCG AGGGAGCCGT CTTCGCGGCC GACAGCGGCC TCGAAGCGGT GTACGACGGC
GAGACGCTGG ACGGCAATCC GTTCGTCGAC GACACCACGC TCGCGTTCCC GAACGTCACG
GTCAGTGCGA CCGACACCGC GTCGCTTCGC ATCGTCGCGG CGACCGACGA CGGCGTCTGC
CTGCGCTCTA TCGAACCCAC CAGTGCACCG GTCCGGGTGA CGCCGGACGC CGGTGAGACC
GTCGTCGTTC GCGATTCGCT GGTGAACCTG AGCTACGGCT CGTTTCGGTA CGCACGCTCG
GCCGGCGGCG TCGATCTGGC CTACAACGCC AGCGCGCCCG CCGCGATCAC GGTCGAGGAC
GGCGATCTCT CGGCCGGACG CACCGTCGAG GCCGTCGACG CCGACAGCGG GACGCAACTG
ACGACCGGAA CCGTCAGTGC CGACGACACG GTCGACCTCC AGTTGCCGGC GGGACACCGG
AACGTGGATC TGCGGTACGC GTCGACACAG ACGGCGACGG CCACGTCGAC CGTCCAGCCG
GCCGCGGCGA CAGCGACGGA CACCGCGACA CCGACCGCGA CGGACACTGC GACACAGACC
GCGACGAACG CCACGACACG GACACAGACA GCGACGGAGA CCGCGACACC GACCGCGCGG
GAGACGCCAC CGGACAGCGA CTCGGCCACG GGCGGTTCGT CGGCTGGCGA CGACACTGCG
ACGCCGACTC CGACGCCGGA CAGAACCCAG ACTGCGACTC CGACCGCGAC GCCCGCGAAC
GCCACGGAGT TCACCGTCCC GGAGTGGACC GGCGAGCTGG TGGCCTACGA GCCGACGCCA
CAGACCCAGC ACGTCGGCGG CCTGCTCCCG TTGACGGTGT CGCTGTGGGG ACTCGCCCTG
TGGCTCGTGG TCGCCCGTCG GGGCCCCGAG ACGCGCTTGC TCGCGCTCGT CCTCGTCGTC
GCCTCCGTGC GAGCGACGAG CGATCTGACA CAGATCGTGC TCGACGGCTT CGTGGGCGTC
GAAGCCCCGC TAGCGACGCT CAACCTCCTG CTGGAGTTCG CGACGGCGGT GCTGTTTGCC
GGCTTCGCCG TCCAGTACGC CGACATCGGC GAGCGCAGGA CCCGACACGC GAAACGGGCC
CTCGGTGTGC TCGGGGCAGT CGGAGCCACC GCCGTCCTGA CGAACCCCAT CCACGGACAG
GTCTTTACCG ACGCGGCCGT CGCGGCCGGC CCCTTCACCT ACGTGACGGC CAGCGTCGGT
CCGGTCGGCT GGCTCCTGTT CGCGCTCGCG ACCGGGCTGG TCGCTGCCGG GGGCGTGCTC
GTCGCTCGAA CCTTCGTCGT CGGCTCGCCG CGGGGAGCCT GGCGACCGGT TGCCGTCATC
GGGACCGGAC TCGCCGTCGC GATCGGCATC GCGGCTCTCG ACGTGCTGGA GCTGGGGCCG
GTGACCGGCT ACGACTACAG CGCGACCGGC GTCAACTACT TCCTCCTCGC GACGACCGTC
TCGCTGCTCG GCTACGGCTT CCAGCGGCTC AAGCCCAGCG GGCAGCGCTC GATCGTCGCC
GACCTCGACG ACGCGATCGT CATCCTCGAC GACGCCTGGC GAGTCGTCGA GTGGAACGGG
GCCGCCGAAG AGATCGTCCC GGAGCTGTCG ACCGGCCGCT CGTTCGACGC CGTCTTCTCC
GAGCCGCTGG CACGCCCGAC CGTCGACCAG ACGGTCACCC GCGAGATGAG CCTGGAGGTC
GATCGATGGG TGACCGACAG CGGAACCGAC GCCGAGCCGC CGACCGACAG TGACGACCGG
ACGGACGGAG ACTCGAACGG GACGGACGGT CAGGCGGCGG GATCGAGTGA ACGCAGCGAG
ACGACAGAAC CCGGAACGAA CGGGACAGAG CTGGACGGGG CCGCCGAAGG CCACGGCGAG
CCCCTCGACA CCGAGCGACG CCACTTCATC GTCAACGCGC GGGCAGTGAC CACCGAGACC
AGCGACGTGA TCGGCTACAC GGTCCGGTTC GCCGACGTGA CGGCACTGAA GCGCCACATG
TCCCAGCTCG AGCGTCGCAA CGAGCAGCTC GATCAGTTCG CGGGCGCTGT CACGCAGGAC
CTCCGCGGCC CGCTGGGCGA GGCACGCGAG GAGACGGAGC GCGTGCGAGC AGTGCTGGAG
GACGCCGACG AGCCGGAGGC GGTCGACCGG CGGGCGCTCA CGACCGCGCT CGGTTCGATC
GACGCCGCGC TGAACCGGAT GGCCCGGCTC GTCGAAGACA TCCTCGGACT GGCTCGCGAC
CGAGACTTAC AGACCGATCC GGAACCGATC CCGTTCGACG CGATCGTCGA GTCGGTCTGG
GACCGCTTCG ATCCGAAGGA AGCCACCCTC TCGGTCGAGG CGACCGGCGA GATCAGCGCC
GACCGCGAGC ACCTCGATCG GCTCCTCGCC GTGCTGGTCC GGAACGCGAT CCAGCACGGC
GGGGAGGGAG TCACCGTCCG CGTCGGCCTC GACGACGACG GGTTCTACGT GGCCGACGAC
GGCCCGGGCA TCGATCCGTC GGTCCGAGAC CGCGCGTTCG AAGCGGGCGT GACGACCCGC
GACGCTGCGG CCGGTCTCGG GCTCACGATG GCGCGACAGC GGGCCGCCGC CCACGGGTGG
GAGATCGCAC TCGACGACGG CGCGACCGGA ACGCGTGTCG TCGTCAGCGG TTGTGAAACG
GAGGGACCCG ACGAATGA
 
Protein sequence
MKPFTRRAVV AGTVVAVVLV WSGLVPLSAG AAGVSGLDCQ PGSEGAVFAA DSGLEAVYDG 
ETLDGNPFVD DTTLAFPNVT VSATDTASLR IVAATDDGVC LRSIEPTSAP VRVTPDAGET
VVVRDSLVNL SYGSFRYARS AGGVDLAYNA SAPAAITVED GDLSAGRTVE AVDADSGTQL
TTGTVSADDT VDLQLPAGHR NVDLRYASTQ TATATSTVQP AAATATDTAT PTATDTATQT
ATNATTRTQT ATETATPTAR ETPPDSDSAT GGSSAGDDTA TPTPTPDRTQ TATPTATPAN
ATEFTVPEWT GELVAYEPTP QTQHVGGLLP LTVSLWGLAL WLVVARRGPE TRLLALVLVV
ASVRATSDLT QIVLDGFVGV EAPLATLNLL LEFATAVLFA GFAVQYADIG ERRTRHAKRA
LGVLGAVGAT AVLTNPIHGQ VFTDAAVAAG PFTYVTASVG PVGWLLFALA TGLVAAGGVL
VARTFVVGSP RGAWRPVAVI GTGLAVAIGI AALDVLELGP VTGYDYSATG VNYFLLATTV
SLLGYGFQRL KPSGQRSIVA DLDDAIVILD DAWRVVEWNG AAEEIVPELS TGRSFDAVFS
EPLARPTVDQ TVTREMSLEV DRWVTDSGTD AEPPTDSDDR TDGDSNGTDG QAAGSSERSE
TTEPGTNGTE LDGAAEGHGE PLDTERRHFI VNARAVTTET SDVIGYTVRF ADVTALKRHM
SQLERRNEQL DQFAGAVTQD LRGPLGEARE ETERVRAVLE DADEPEAVDR RALTTALGSI
DAALNRMARL VEDILGLARD RDLQTDPEPI PFDAIVESVW DRFDPKEATL SVEATGEISA
DREHLDRLLA VLVRNAIQHG GEGVTVRVGL DDDGFYVADD GPGIDPSVRD RAFEAGVTTR
DAAAGLGLTM ARQRAAAHGW EIALDDGATG TRVVVSGCET EGPDE