Gene Hmuk_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1898 
Symbol 
ID8411425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1807077 
End bp1810205 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content68% 
IMG OID645020228 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003177718 
Protein GI257387945 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.755924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTC TTCGCGTCCT CCACGTCAGA CCCGCCGACT GGTCCGGTCC CGAGCTCGGT 
GACGACTTTG CGGTCACCGT CGTCCACGGG ACCGACGGTG CTGACTTCGA GACGGCGAAT
CTGGACTGTG CCGTCGTCGA GGCGGCCCTC GGCGACGACG ATGGGATCGA CGCGCTGAGG
GCACTGCGCG AGCGGGCTCC ATCGCTGCCG GTCGTCCTCT GTACTGCGGT CGCCGACGGG
ACCGTTGGGG CTGCCGCCAC GCGCCACGGC GTGACCGAGT ACGTACCCCG AGACGGCGCG
GTTCGCGTCG CCGATCGGGT GCGAGCCGTT GTCGCCGACC ACACGTCCGA CGATGCGTCC
GCGCCCGAGT CGACGGGGAG GTCGGCGGCC GCCAGCGCCG GCATCTCGGA ACGCGAGCAG
CGCCGGCACG CGCTGACGGC GACCAACGAG GCACTGGAGT CACTCGCCAC GCTTGCGTCG
CGAAACGACC TCGAGCAGAC GGAACGGATC CGCCAAGCCT TGGAGATCGG CCGCCAGCGT
CTCGGACTCC CGCTCGGATA CTTCACCCGG ATCGACGGCT CGACCCAGCG GATCGAGGTG
ACGGCCGGTC GCTCTGATCT GGTCAGTGCG GGAATGTCGG CTCCCCTGTC GAAGACGTAC
TGTCGGCGGA CGATCCAGCA GGACGGCCTG CTGACGATCG CAGACGCCGA GCAGGAAGGG
TGGAGCGACG ACTCGGCCTT CGAGAGCTTC GGCGTCTCCT GTTATCTCGG CGGGACCGTC
GTCGTCGACG GCGAGACCTA CGGTACGCTG TGTTTCGGCG CGACGGATCC CAGAGACCGG
TCGTTTACCG ACGCCGAGAA GCGCCTCGTC AACCTGCTCG TCGAGTGGGT CGGCAACGAG
ATCGAGCGCA ACCAGCGAGA GACGGATCTC CGGCGATACG CCGACATCAT CGAAGCCGTC
GACGACGGCG TTTACGCGCT CGACGACGAG GGCCGCTTTA CCCTGGTCAA CGAGGCGATG
ACGGATCTCA CGGGCTACGA TCGCGAGACG CTGCTGGGTT CGCACACGGC CCGCATCAAG
GACGACGGGA CCGTCGACCG CGCCCAGTCG ATCCTCGTGG AGATGCTCCG CGGGGTCCGA
TCCGACGAGG AGACGTTCGA GCTGTCGATC CAGCGCGCAG ACGGGAGCGC CTTTCCCGCA
CAGGATCACA TGACGATTCT GCGCGACGAC GGCTCGTTCG CCGGGACTGC CGGCGTGATC
CGCGACGTGA CCGCACAGCG AGCCCGCGAC GAGGCACTGG AGGGGCTATT AGAGACGACC
CGATCGCTGA TGGCCGCCCA GACACCGACG GAGGTCGCAG AGATCGTCGC CGGTGCCGCC
AGCGAGACGC TGGGGTTCGA ACTCAATCTG GTTCGACTGT ACGATTCCGA CGACGACACG
CTCGTCCCGA TCGCCAGAGC CGGCGACAGC GCCGACGAGA TCGATCGGCC GATCCGGGAC
AGCGACGAGG GGTATCCCGG CGAGGCCTTC TCCACCGGGG AGACGCTCGT CGTCGACGAC
CTGAACGACC GGGCCGGCTA CGACAGCGGG CCAGCGGCGT CGGCGATCTA CCTCCCACTC
GGCGAGTACG GGGTCTGTAC GGTCGCCAGC ACCGAGCCAC GCGCCTTCGA CGACAGCGAC
CGATCGATCG CCGAGATCCT CGCCTCGAAC GCCGCCGCGG CGTTCGAGCG CGTCGATCGC
GAACAGGAGC TGTTGCGCTA CGAGACGGCC GTCGAGAACG TCAACGACAT GCTCTACGTC
CTCGACGAGG AAGGCCGATT CCAGCTGGTC ACACAGCCGC TGGCGACGTA CCTCGGCTTC
GACCGGTCCG AACTTCTCGG GGCGCGCCCG GAGATCGTCC TCGACGACGC GACCATCGAC
CGGTTCGAAG CCGAGATCGG CTCGCTCCGC CGTGGAGACG CCGACCACGC CGACGTACAG
ACCGAGCTGT CGACGGCAGA CGGCAACGAT CGACCGGTCG AGATCGAGAT CTCGCTCATC
TCCGGCGAGG GGGCGTTCCA GGGAACCGTC GGCGTCGTTC GCGACCGGAC GGAACTCCAG
CAGACCCGCG AACGGCTCCA TCAGGAACAG ACTCGCTTTT CCTACCTCTT CGACGCGCTG
CCGGATCCGG TCGTGGAGAC GGAGCTGGTC GACGGCGAGC CGGTCGTTCG ATCGGCCAAC
CCCGCGTTCG CCGAGACGTT CGATCTCGGC GACTCGTCGC TGTCGGGTCG GACGATCGAT
TCCCTGCTGC GGGGGCCGGA CGACTCGGAG TCGACCGATC CCTCGCTGTC TGCGGTCGCC
GACGACGAGT CCGTCCAGGG CGAACTCAGG CGCTGGACGG CCGACGGGTT CCGGGACTTT
CTCTTCCGGG CGGTCCCCTA TCAGCGGGGC GACGGCCGGC AGTTCGCCTT CGGTATCTAC
ACCGACATCA CAGAGCAGCG CGAGCGCCAG CGGCGACTCG AAGTCCTCAA TCGCGTCTTG
CGCCACAACC TCCGCAACGA CATGACCGTC ATTCTGGGGA CGGCCGAAGA ACTGGTCGGC
CGGGCCGACG ACGAGGAGAA CCGAACGCTC CTCAGGCGAC TCCTTCGAAA GGCCGAGGGC
GTCGTCTCGC TGTCGGACCG GGCCCGCGAG ATCGAGGCAG CCGTCCGCCG GGACCCCGCG
ACGACGGACT CCGTCTCGGT ACCGGAGATC GTCGAGAGCG TCGTCGACGA ACTCGCGACC
GAGCACCCCG AGGCGACGCT CCGGACCGAC TGCGATCCCG TTCCGCCCAT TGCGGACCCG
CGACTCCGAA CGGCGCTGTA CGAGACCGTC GAGAACGCCC TGGAACACAA CGAGGTACCG
ACCGTCTCCG TCGACGTGAC GGCCGACGCC GAGAGCGTCC GTATCCGGAT CGAGGACGAC
GGCAGCGGCA TTCCGGCGGA CGAACTCGCC GTCGTCACCG GGGACGCAGA GATCACGCAA
CTGACGCACG GCACTGGCCT GGGCCTGTGG CTCATCACGT GGCTGATCGA GTCCTACGGC
GGCACCGTGA CCTTCGAGAA CACCGCCGGG ACGACCGTGA CGCTCCGCGT GCCCCACACG
GACCGCTGA
 
Protein sequence
MSALRVLHVR PADWSGPELG DDFAVTVVHG TDGADFETAN LDCAVVEAAL GDDDGIDALR 
ALRERAPSLP VVLCTAVADG TVGAAATRHG VTEYVPRDGA VRVADRVRAV VADHTSDDAS
APESTGRSAA ASAGISEREQ RRHALTATNE ALESLATLAS RNDLEQTERI RQALEIGRQR
LGLPLGYFTR IDGSTQRIEV TAGRSDLVSA GMSAPLSKTY CRRTIQQDGL LTIADAEQEG
WSDDSAFESF GVSCYLGGTV VVDGETYGTL CFGATDPRDR SFTDAEKRLV NLLVEWVGNE
IERNQRETDL RRYADIIEAV DDGVYALDDE GRFTLVNEAM TDLTGYDRET LLGSHTARIK
DDGTVDRAQS ILVEMLRGVR SDEETFELSI QRADGSAFPA QDHMTILRDD GSFAGTAGVI
RDVTAQRARD EALEGLLETT RSLMAAQTPT EVAEIVAGAA SETLGFELNL VRLYDSDDDT
LVPIARAGDS ADEIDRPIRD SDEGYPGEAF STGETLVVDD LNDRAGYDSG PAASAIYLPL
GEYGVCTVAS TEPRAFDDSD RSIAEILASN AAAAFERVDR EQELLRYETA VENVNDMLYV
LDEEGRFQLV TQPLATYLGF DRSELLGARP EIVLDDATID RFEAEIGSLR RGDADHADVQ
TELSTADGND RPVEIEISLI SGEGAFQGTV GVVRDRTELQ QTRERLHQEQ TRFSYLFDAL
PDPVVETELV DGEPVVRSAN PAFAETFDLG DSSLSGRTID SLLRGPDDSE STDPSLSAVA
DDESVQGELR RWTADGFRDF LFRAVPYQRG DGRQFAFGIY TDITEQRERQ RRLEVLNRVL
RHNLRNDMTV ILGTAEELVG RADDEENRTL LRRLLRKAEG VVSLSDRARE IEAAVRRDPA
TTDSVSVPEI VESVVDELAT EHPEATLRTD CDPVPPIADP RLRTALYETV ENALEHNEVP
TVSVDVTADA ESVRIRIEDD GSGIPADELA VVTGDAEITQ LTHGTGLGLW LITWLIESYG
GTVTFENTAG TTVTLRVPHT DR