Gene Hmuk_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2120 
Symbol 
ID8411658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2029330 
End bp2030484 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content66% 
IMG OID645020461 
Producthistidine kinase 
Protein accessionYP_003177940 
Protein GI257388167 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.238176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.722587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTCG AAAACCAGAC GGACTTCGCC AAGCAGGTCG CCGACCTCAA CAAGTACGGA 
CAGGCGCTGA ACCGATGTGA GAGCGTCGAC GAGGTCGTTT CGATGACGCT GGAAGCGATG
TCACTGCTGT TCGACGCCGC CGACAACACG TTCGTGGAAG TTCGAAACGA CGACCTGCAG
GTCGTCCACA GCACGAATCC CGCGTTGTCG GTCGGCGAAG CGCCGACGAG CGTGGCACGG
CGGGCCTACG AGTCCAGAAC GACCGAGGTG GCCAGCGGTG CGGACGCTCG TGCCGCCACC
GACACGGAGA CGACCGCGGC ACTGGCCGTC CCGGCGACGA TCGTCGACGA GGTGACGGCG
GTGCTCGTGA TGCGCTCGAC GAGCCGGTCC GAGTTCGACG ACACCGTCGT GCGCCCGATG
GAGATTCTGG CGTCTCACGC CGCGACGGCG ATCAGCAACA TCCGGTCGCG GGAGCGACTC
GAACGGGCCA GACAGGACCT GGAGACGAAA AAGGAGATGG TCGAACTGTA CGACCGCCTG
TTGCGCCACG ACCTGGGCAA CGACCTGCAG GTGATCACCG GGTTCTCCGA GGTCCTCGCC
GACGAAGCCG ACGGCGAGAC CGCTGCCTAC GCCGAGCGGA TCAACGAGGC CGCACACAGC
TCTGCCGACC TGATCCAGCG GGTCGGGAAC CTCGTCTCGA CGCTGGAAGA AGAAGAGGAA
CCGGAACCGA GAGGCCTCGC GCCGATACTC GAACGGACCG TCAGTGAGGC CGAGACCGGC
TACGGCGAGC TGACCGTCGA GTTCGACAAG GCGGCATTCG AGGAGACGGT GTACGCCGGG
GACCTGCTCG AATCGGTGTT CACGAACATC CTCACGAACG CCGTCGTCCA CAACGAGGGA
GAAGTCACGG TCCGGACGAG CGTCGAGACG GGCGTCGACG ACGTGGTTGT CTGCTTCGCC
GACGACGGAG CGGGCATCGA CCCGTCGGTC CGCGACGAGC TGTTCGAGAT GGGCGAGAAA
GGCCCCGACA GCAGCGGCAG CGGGTTCGGC CTCGGCTTCG TCCGCGCCCT GACCGAGTCG
TACGGCGGCG ACGTGACCGT CACCGAGAGC GATGCCGGCG GCGCGGAGTT CCGCGTTCGG
CTCCAGCGTG GCTGA
 
Protein sequence
MSFENQTDFA KQVADLNKYG QALNRCESVD EVVSMTLEAM SLLFDAADNT FVEVRNDDLQ 
VVHSTNPALS VGEAPTSVAR RAYESRTTEV ASGADARAAT DTETTAALAV PATIVDEVTA
VLVMRSTSRS EFDDTVVRPM EILASHAATA ISNIRSRERL ERARQDLETK KEMVELYDRL
LRHDLGNDLQ VITGFSEVLA DEADGETAAY AERINEAAHS SADLIQRVGN LVSTLEEEEE
PEPRGLAPIL ERTVSEAETG YGELTVEFDK AAFEETVYAG DLLESVFTNI LTNAVVHNEG
EVTVRTSVET GVDDVVVCFA DDGAGIDPSV RDELFEMGEK GPDSSGSGFG LGFVRALTES
YGGDVTVTES DAGGAEFRVR LQRG