Gene Hmuk_2699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2699 
Symbol 
ID8412250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2585538 
End bp2586878 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content66% 
IMG OID645021045 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003178512 
Protein GI257388739 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.112792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0892885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCG AGACGCTGGC CGAGGACCAC GGCTTCGGTA CGCACGCGGC CAGCGACCCG 
AAGGCTGCTC TCGACTACTT CTCTGACAAC GACGTGGACT GTATCGTGAG CGACTACGAG
ATGCCCGGAA TGGACGGCCT GGAGTTCCTC GATGCGGTCC GGGAACGCGA GTCCGACATC
CCCTTTATCC TCCTCACCGG GCGCGGCGAC GAGGAGGTCG CCAGCGAGGC CATCGCCGCG
GGTGTCACCG ACTACCTGCT GAAGCTCGAG GTCGTCGAAG ACAAGCAGTA CGGCCGACTC
GCGAACAGAA TCCGAAACGT CGTCTCGAAG CGCCGCACCC AGGCCAAGTA CGAGCTGCTG
GTCGACACCT CGCCGGACCT GATCGCACAG GTCGCCGAGG ACGGGACGAT CCTCGCGGCC
AACCCCGCGA TGGCCGAGTT CACCGACATC AGCCGAGCGG AACTCGTCGG CCAGACGCTC
GCTGCGGTCC TCGGGACGAT CGGTGAAGAG CGGACGGCAG TCGGCCAGTC CGTCATCGAG
GACGGCGATC CCACCCACGC AGAAGACCAG ATCGACGGCC GGTCGTTTCA CAACATCTAC
GTCCCGATCG ACGTACACAG CCACCGGCCG TCGTTCCAGC TGGTCTCTCG GGACATCACG
GAGCGCGTCG AACGCGAACG AGAGCTCAAG CGTCAGAACG AGCGCCTCGA AGAGTTCGCC
AGCGTCGTCA GCCACGACAT GCGAAATCCG CTCAACGTGG CCCAGTCCGC ACTCCAGTTG
TTAGAGAAAG ACAGGGGAGA CGACGCCGAG CTGCGGGCGA GACTGACGCG CTCGCTCGAC
CGGATGGAGT CGCTGATCGA CGACGTGTTG ACACTCGCTC GCGAGGGCGA GACCGTCGAC
GACCCCAGCG TCGTCGACCT TGGGACGATC GCCCGCGAGA GCTGGATGGT GACCGACAGC
GGCGAGGCGG AGCTGGTGAT CGACTCGTCG GCCCGGATTC AGGCCGACGA CGGCCGTCTC
GGGAGCATCT TCTCGAACCT CTACGGGAAC GCCGTCGACC ACGGCCGCCC CTCGTCGGAC
GCCGAGAACG CCGAGCCGGT GACCGTCACC GTCAGCGTGA CCGACGACGG GTTCGTCGTC
GCCGACGACG GCTGTGGCAT GGACCTCTCG ACGGACGGCG ACCCCTTCGA GATGGGGTAC
TCAAGCGACA CCGAGGGGAC CGGCATGGGA CTGACCATCG TCGAGCGGGT GGCCGAGGCC
CACGGCTGGT CGGTCGACCT CGGCGAGAGC GAGGACGGCG GTCTGGCCGT CGAGATCAGC
GGCGTCACCT TCGTCGAGTG A
 
Protein sequence
MTAETLAEDH GFGTHAASDP KAALDYFSDN DVDCIVSDYE MPGMDGLEFL DAVRERESDI 
PFILLTGRGD EEVASEAIAA GVTDYLLKLE VVEDKQYGRL ANRIRNVVSK RRTQAKYELL
VDTSPDLIAQ VAEDGTILAA NPAMAEFTDI SRAELVGQTL AAVLGTIGEE RTAVGQSVIE
DGDPTHAEDQ IDGRSFHNIY VPIDVHSHRP SFQLVSRDIT ERVERERELK RQNERLEEFA
SVVSHDMRNP LNVAQSALQL LEKDRGDDAE LRARLTRSLD RMESLIDDVL TLAREGETVD
DPSVVDLGTI ARESWMVTDS GEAELVIDSS ARIQADDGRL GSIFSNLYGN AVDHGRPSSD
AENAEPVTVT VSVTDDGFVV ADDGCGMDLS TDGDPFEMGY SSDTEGTGMG LTIVERVAEA
HGWSVDLGES EDGGLAVEIS GVTFVE