Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0501 |
Symbol | |
ID | 8410001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 469494 |
End bp | 472331 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645018825 |
Product | histidine kinase |
Protein accession | YP_003176342 |
Protein GI | 257386569 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0931305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCAT TCACTCGTCG GGCAGTCGTC GCCGGAACGG TCGTCGCGGT GGTCCTCGTC TGGAGCGGAC TCGTCCCGCT ATCGGCCGGT GCAGCCGGCG TCTCAGGCCT CGACTGTCAG CCGGGGAGCG AGGGAGCCGT CTTCGCGGCC GACAGCGGCC TCGAAGCGGT GTACGACGGC GAGACGCTGG ACGGCAATCC GTTCGTCGAC GACACCACGC TCGCGTTCCC GAACGTCACG GTCAGTGCGA CCGACACCGC GTCGCTTCGC ATCGTCGCGG CGACCGACGA CGGCGTCTGC CTGCGCTCTA TCGAACCCAC CAGTGCACCG GTCCGGGTGA CGCCGGACGC CGGTGAGACC GTCGTCGTTC GCGATTCGCT GGTGAACCTG AGCTACGGCT CGTTTCGGTA CGCACGCTCG GCCGGCGGCG TCGATCTGGC CTACAACGCC AGCGCGCCCG CCGCGATCAC GGTCGAGGAC GGCGATCTCT CGGCCGGACG CACCGTCGAG GCCGTCGACG CCGACAGCGG GACGCAACTG ACGACCGGAA CCGTCAGTGC CGACGACACG GTCGACCTCC AGTTGCCGGC GGGACACCGG AACGTGGATC TGCGGTACGC GTCGACACAG ACGGCGACGG CCACGTCGAC CGTCCAGCCG GCCGCGGCGA CAGCGACGGA CACCGCGACA CCGACCGCGA CGGACACTGC GACACAGACC GCGACGAACG CCACGACACG GACACAGACA GCGACGGAGA CCGCGACACC GACCGCGCGG GAGACGCCAC CGGACAGCGA CTCGGCCACG GGCGGTTCGT CGGCTGGCGA CGACACTGCG ACGCCGACTC CGACGCCGGA CAGAACCCAG ACTGCGACTC CGACCGCGAC GCCCGCGAAC GCCACGGAGT TCACCGTCCC GGAGTGGACC GGCGAGCTGG TGGCCTACGA GCCGACGCCA CAGACCCAGC ACGTCGGCGG CCTGCTCCCG TTGACGGTGT CGCTGTGGGG ACTCGCCCTG TGGCTCGTGG TCGCCCGTCG GGGCCCCGAG ACGCGCTTGC TCGCGCTCGT CCTCGTCGTC GCCTCCGTGC GAGCGACGAG CGATCTGACA CAGATCGTGC TCGACGGCTT CGTGGGCGTC GAAGCCCCGC TAGCGACGCT CAACCTCCTG CTGGAGTTCG CGACGGCGGT GCTGTTTGCC GGCTTCGCCG TCCAGTACGC CGACATCGGC GAGCGCAGGA CCCGACACGC GAAACGGGCC CTCGGTGTGC TCGGGGCAGT CGGAGCCACC GCCGTCCTGA CGAACCCCAT CCACGGACAG GTCTTTACCG ACGCGGCCGT CGCGGCCGGC CCCTTCACCT ACGTGACGGC CAGCGTCGGT CCGGTCGGCT GGCTCCTGTT CGCGCTCGCG ACCGGGCTGG TCGCTGCCGG GGGCGTGCTC GTCGCTCGAA CCTTCGTCGT CGGCTCGCCG CGGGGAGCCT GGCGACCGGT TGCCGTCATC GGGACCGGAC TCGCCGTCGC GATCGGCATC GCGGCTCTCG ACGTGCTGGA GCTGGGGCCG GTGACCGGCT ACGACTACAG CGCGACCGGC GTCAACTACT TCCTCCTCGC GACGACCGTC TCGCTGCTCG GCTACGGCTT CCAGCGGCTC AAGCCCAGCG GGCAGCGCTC GATCGTCGCC GACCTCGACG ACGCGATCGT CATCCTCGAC GACGCCTGGC GAGTCGTCGA GTGGAACGGG GCCGCCGAAG AGATCGTCCC GGAGCTGTCG ACCGGCCGCT CGTTCGACGC CGTCTTCTCC GAGCCGCTGG CACGCCCGAC CGTCGACCAG ACGGTCACCC GCGAGATGAG CCTGGAGGTC GATCGATGGG TGACCGACAG CGGAACCGAC GCCGAGCCGC CGACCGACAG TGACGACCGG ACGGACGGAG ACTCGAACGG GACGGACGGT CAGGCGGCGG GATCGAGTGA ACGCAGCGAG ACGACAGAAC CCGGAACGAA CGGGACAGAG CTGGACGGGG CCGCCGAAGG CCACGGCGAG CCCCTCGACA CCGAGCGACG CCACTTCATC GTCAACGCGC GGGCAGTGAC CACCGAGACC AGCGACGTGA TCGGCTACAC GGTCCGGTTC GCCGACGTGA CGGCACTGAA GCGCCACATG TCCCAGCTCG AGCGTCGCAA CGAGCAGCTC GATCAGTTCG CGGGCGCTGT CACGCAGGAC CTCCGCGGCC CGCTGGGCGA GGCACGCGAG GAGACGGAGC GCGTGCGAGC AGTGCTGGAG GACGCCGACG AGCCGGAGGC GGTCGACCGG CGGGCGCTCA CGACCGCGCT CGGTTCGATC GACGCCGCGC TGAACCGGAT GGCCCGGCTC GTCGAAGACA TCCTCGGACT GGCTCGCGAC CGAGACTTAC AGACCGATCC GGAACCGATC CCGTTCGACG CGATCGTCGA GTCGGTCTGG GACCGCTTCG ATCCGAAGGA AGCCACCCTC TCGGTCGAGG CGACCGGCGA GATCAGCGCC GACCGCGAGC ACCTCGATCG GCTCCTCGCC GTGCTGGTCC GGAACGCGAT CCAGCACGGC GGGGAGGGAG TCACCGTCCG CGTCGGCCTC GACGACGACG GGTTCTACGT GGCCGACGAC GGCCCGGGCA TCGATCCGTC GGTCCGAGAC CGCGCGTTCG AAGCGGGCGT GACGACCCGC GACGCTGCGG CCGGTCTCGG GCTCACGATG GCGCGACAGC GGGCCGCCGC CCACGGGTGG GAGATCGCAC TCGACGACGG CGCGACCGGA ACGCGTGTCG TCGTCAGCGG TTGTGAAACG GAGGGACCCG ACGAATGA
|
Protein sequence | MKPFTRRAVV AGTVVAVVLV WSGLVPLSAG AAGVSGLDCQ PGSEGAVFAA DSGLEAVYDG ETLDGNPFVD DTTLAFPNVT VSATDTASLR IVAATDDGVC LRSIEPTSAP VRVTPDAGET VVVRDSLVNL SYGSFRYARS AGGVDLAYNA SAPAAITVED GDLSAGRTVE AVDADSGTQL TTGTVSADDT VDLQLPAGHR NVDLRYASTQ TATATSTVQP AAATATDTAT PTATDTATQT ATNATTRTQT ATETATPTAR ETPPDSDSAT GGSSAGDDTA TPTPTPDRTQ TATPTATPAN ATEFTVPEWT GELVAYEPTP QTQHVGGLLP LTVSLWGLAL WLVVARRGPE TRLLALVLVV ASVRATSDLT QIVLDGFVGV EAPLATLNLL LEFATAVLFA GFAVQYADIG ERRTRHAKRA LGVLGAVGAT AVLTNPIHGQ VFTDAAVAAG PFTYVTASVG PVGWLLFALA TGLVAAGGVL VARTFVVGSP RGAWRPVAVI GTGLAVAIGI AALDVLELGP VTGYDYSATG VNYFLLATTV SLLGYGFQRL KPSGQRSIVA DLDDAIVILD DAWRVVEWNG AAEEIVPELS TGRSFDAVFS EPLARPTVDQ TVTREMSLEV DRWVTDSGTD AEPPTDSDDR TDGDSNGTDG QAAGSSERSE TTEPGTNGTE LDGAAEGHGE PLDTERRHFI VNARAVTTET SDVIGYTVRF ADVTALKRHM SQLERRNEQL DQFAGAVTQD LRGPLGEARE ETERVRAVLE DADEPEAVDR RALTTALGSI DAALNRMARL VEDILGLARD RDLQTDPEPI PFDAIVESVW DRFDPKEATL SVEATGEISA DREHLDRLLA VLVRNAIQHG GEGVTVRVGL DDDGFYVADD GPGIDPSVRD RAFEAGVTTR DAAAGLGLTM ARQRAAAHGW EIALDDGATG TRVVVSGCET EGPDE
|
| |