Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1898 |
Symbol | |
ID | 8411425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1807077 |
End bp | 1810205 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645020228 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_003177718 |
Protein GI | 257387945 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.755924 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCTC TTCGCGTCCT CCACGTCAGA CCCGCCGACT GGTCCGGTCC CGAGCTCGGT GACGACTTTG CGGTCACCGT CGTCCACGGG ACCGACGGTG CTGACTTCGA GACGGCGAAT CTGGACTGTG CCGTCGTCGA GGCGGCCCTC GGCGACGACG ATGGGATCGA CGCGCTGAGG GCACTGCGCG AGCGGGCTCC ATCGCTGCCG GTCGTCCTCT GTACTGCGGT CGCCGACGGG ACCGTTGGGG CTGCCGCCAC GCGCCACGGC GTGACCGAGT ACGTACCCCG AGACGGCGCG GTTCGCGTCG CCGATCGGGT GCGAGCCGTT GTCGCCGACC ACACGTCCGA CGATGCGTCC GCGCCCGAGT CGACGGGGAG GTCGGCGGCC GCCAGCGCCG GCATCTCGGA ACGCGAGCAG CGCCGGCACG CGCTGACGGC GACCAACGAG GCACTGGAGT CACTCGCCAC GCTTGCGTCG CGAAACGACC TCGAGCAGAC GGAACGGATC CGCCAAGCCT TGGAGATCGG CCGCCAGCGT CTCGGACTCC CGCTCGGATA CTTCACCCGG ATCGACGGCT CGACCCAGCG GATCGAGGTG ACGGCCGGTC GCTCTGATCT GGTCAGTGCG GGAATGTCGG CTCCCCTGTC GAAGACGTAC TGTCGGCGGA CGATCCAGCA GGACGGCCTG CTGACGATCG CAGACGCCGA GCAGGAAGGG TGGAGCGACG ACTCGGCCTT CGAGAGCTTC GGCGTCTCCT GTTATCTCGG CGGGACCGTC GTCGTCGACG GCGAGACCTA CGGTACGCTG TGTTTCGGCG CGACGGATCC CAGAGACCGG TCGTTTACCG ACGCCGAGAA GCGCCTCGTC AACCTGCTCG TCGAGTGGGT CGGCAACGAG ATCGAGCGCA ACCAGCGAGA GACGGATCTC CGGCGATACG CCGACATCAT CGAAGCCGTC GACGACGGCG TTTACGCGCT CGACGACGAG GGCCGCTTTA CCCTGGTCAA CGAGGCGATG ACGGATCTCA CGGGCTACGA TCGCGAGACG CTGCTGGGTT CGCACACGGC CCGCATCAAG GACGACGGGA CCGTCGACCG CGCCCAGTCG ATCCTCGTGG AGATGCTCCG CGGGGTCCGA TCCGACGAGG AGACGTTCGA GCTGTCGATC CAGCGCGCAG ACGGGAGCGC CTTTCCCGCA CAGGATCACA TGACGATTCT GCGCGACGAC GGCTCGTTCG CCGGGACTGC CGGCGTGATC CGCGACGTGA CCGCACAGCG AGCCCGCGAC GAGGCACTGG AGGGGCTATT AGAGACGACC CGATCGCTGA TGGCCGCCCA GACACCGACG GAGGTCGCAG AGATCGTCGC CGGTGCCGCC AGCGAGACGC TGGGGTTCGA ACTCAATCTG GTTCGACTGT ACGATTCCGA CGACGACACG CTCGTCCCGA TCGCCAGAGC CGGCGACAGC GCCGACGAGA TCGATCGGCC GATCCGGGAC AGCGACGAGG GGTATCCCGG CGAGGCCTTC TCCACCGGGG AGACGCTCGT CGTCGACGAC CTGAACGACC GGGCCGGCTA CGACAGCGGG CCAGCGGCGT CGGCGATCTA CCTCCCACTC GGCGAGTACG GGGTCTGTAC GGTCGCCAGC ACCGAGCCAC GCGCCTTCGA CGACAGCGAC CGATCGATCG CCGAGATCCT CGCCTCGAAC GCCGCCGCGG CGTTCGAGCG CGTCGATCGC GAACAGGAGC TGTTGCGCTA CGAGACGGCC GTCGAGAACG TCAACGACAT GCTCTACGTC CTCGACGAGG AAGGCCGATT CCAGCTGGTC ACACAGCCGC TGGCGACGTA CCTCGGCTTC GACCGGTCCG AACTTCTCGG GGCGCGCCCG GAGATCGTCC TCGACGACGC GACCATCGAC CGGTTCGAAG CCGAGATCGG CTCGCTCCGC CGTGGAGACG CCGACCACGC CGACGTACAG ACCGAGCTGT CGACGGCAGA CGGCAACGAT CGACCGGTCG AGATCGAGAT CTCGCTCATC TCCGGCGAGG GGGCGTTCCA GGGAACCGTC GGCGTCGTTC GCGACCGGAC GGAACTCCAG CAGACCCGCG AACGGCTCCA TCAGGAACAG ACTCGCTTTT CCTACCTCTT CGACGCGCTG CCGGATCCGG TCGTGGAGAC GGAGCTGGTC GACGGCGAGC CGGTCGTTCG ATCGGCCAAC CCCGCGTTCG CCGAGACGTT CGATCTCGGC GACTCGTCGC TGTCGGGTCG GACGATCGAT TCCCTGCTGC GGGGGCCGGA CGACTCGGAG TCGACCGATC CCTCGCTGTC TGCGGTCGCC GACGACGAGT CCGTCCAGGG CGAACTCAGG CGCTGGACGG CCGACGGGTT CCGGGACTTT CTCTTCCGGG CGGTCCCCTA TCAGCGGGGC GACGGCCGGC AGTTCGCCTT CGGTATCTAC ACCGACATCA CAGAGCAGCG CGAGCGCCAG CGGCGACTCG AAGTCCTCAA TCGCGTCTTG CGCCACAACC TCCGCAACGA CATGACCGTC ATTCTGGGGA CGGCCGAAGA ACTGGTCGGC CGGGCCGACG ACGAGGAGAA CCGAACGCTC CTCAGGCGAC TCCTTCGAAA GGCCGAGGGC GTCGTCTCGC TGTCGGACCG GGCCCGCGAG ATCGAGGCAG CCGTCCGCCG GGACCCCGCG ACGACGGACT CCGTCTCGGT ACCGGAGATC GTCGAGAGCG TCGTCGACGA ACTCGCGACC GAGCACCCCG AGGCGACGCT CCGGACCGAC TGCGATCCCG TTCCGCCCAT TGCGGACCCG CGACTCCGAA CGGCGCTGTA CGAGACCGTC GAGAACGCCC TGGAACACAA CGAGGTACCG ACCGTCTCCG TCGACGTGAC GGCCGACGCC GAGAGCGTCC GTATCCGGAT CGAGGACGAC GGCAGCGGCA TTCCGGCGGA CGAACTCGCC GTCGTCACCG GGGACGCAGA GATCACGCAA CTGACGCACG GCACTGGCCT GGGCCTGTGG CTCATCACGT GGCTGATCGA GTCCTACGGC GGCACCGTGA CCTTCGAGAA CACCGCCGGG ACGACCGTGA CGCTCCGCGT GCCCCACACG GACCGCTGA
|
Protein sequence | MSALRVLHVR PADWSGPELG DDFAVTVVHG TDGADFETAN LDCAVVEAAL GDDDGIDALR ALRERAPSLP VVLCTAVADG TVGAAATRHG VTEYVPRDGA VRVADRVRAV VADHTSDDAS APESTGRSAA ASAGISEREQ RRHALTATNE ALESLATLAS RNDLEQTERI RQALEIGRQR LGLPLGYFTR IDGSTQRIEV TAGRSDLVSA GMSAPLSKTY CRRTIQQDGL LTIADAEQEG WSDDSAFESF GVSCYLGGTV VVDGETYGTL CFGATDPRDR SFTDAEKRLV NLLVEWVGNE IERNQRETDL RRYADIIEAV DDGVYALDDE GRFTLVNEAM TDLTGYDRET LLGSHTARIK DDGTVDRAQS ILVEMLRGVR SDEETFELSI QRADGSAFPA QDHMTILRDD GSFAGTAGVI RDVTAQRARD EALEGLLETT RSLMAAQTPT EVAEIVAGAA SETLGFELNL VRLYDSDDDT LVPIARAGDS ADEIDRPIRD SDEGYPGEAF STGETLVVDD LNDRAGYDSG PAASAIYLPL GEYGVCTVAS TEPRAFDDSD RSIAEILASN AAAAFERVDR EQELLRYETA VENVNDMLYV LDEEGRFQLV TQPLATYLGF DRSELLGARP EIVLDDATID RFEAEIGSLR RGDADHADVQ TELSTADGND RPVEIEISLI SGEGAFQGTV GVVRDRTELQ QTRERLHQEQ TRFSYLFDAL PDPVVETELV DGEPVVRSAN PAFAETFDLG DSSLSGRTID SLLRGPDDSE STDPSLSAVA DDESVQGELR RWTADGFRDF LFRAVPYQRG DGRQFAFGIY TDITEQRERQ RRLEVLNRVL RHNLRNDMTV ILGTAEELVG RADDEENRTL LRRLLRKAEG VVSLSDRARE IEAAVRRDPA TTDSVSVPEI VESVVDELAT EHPEATLRTD CDPVPPIADP RLRTALYETV ENALEHNEVP TVSVDVTADA ESVRIRIEDD GSGIPADELA VVTGDAEITQ LTHGTGLGLW LITWLIESYG GTVTFENTAG TTVTLRVPHT DR
|
| |