Gene Hlac_2158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2158 
Symbol 
ID7401091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2146071 
End bp2147246 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content71% 
IMG OID643709228 
Producthistidine kinase 
Protein accessionYP_002566805 
Protein GI222480568 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.249399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.778679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAC GGTACGACAC GCGGGACACG AGGGAGGGCC GACTGACGGA ACTGCTCGTG 
ACGCTCGGGG TCACCGAGCC GGACCACGTC TCCGTGGAGC CGGAGGGCGG GATCGCCCGC
GTGGCCGCCG CGCTGTTCGT CTCGGTTTCG GGATTCGCCC TGCTGATCCC GAACGTGACC
CCGCTGGTCT CGGGAGGCGA ACCCCCGATC GGGATCGCCC TCTCGGTGCT GGGGACCGTG
CTTTCGGTCG CGCTGGTGCT CGTCGGCGGG CTCTTATACA GGAGCACGTT CACAACGCGC
AACGCGGTCC GCATCGCCGT CTGGAGCCTC TTCGGGATCG TCGTGCTCGG CGGGATCATG
CTCGGGATCA TCCGGTATCA GGCGCAGCTC GGGACGCCGA TGGTCGCGCC CACCTTCACG
GTCGCGAAGG TGCTCTCGAT CGGCGCAGTC GCACACGTGA TCATCGGGGT CTACGACGCC
CGACGGGTCC GCGCCCAGCA GCTCGCCAAT GAGCAGCGCC GAGTCTCCGT GCTGAACCGG
ATGATCAGGC ACAACCTCCG AAACGAGACG ACGGTGCTCG GCGGGCACGC GTCGCTGATC
GCCGAGCGCG TCGACGATCC GGAACTCCGC GACTCGGCTG AGGCGGTGGC GAAGAGTGCC
GCAGTGATCG GCGGGCTCGC TAAGGATGCG AACCGGCTAC AGGCTGCCTT CGAACGTCGC
CCCGACGCGC GTGGGGCGGT CCCGGTCGAG CCGCTGATCG AGGACGCCGC CGAGGCCGCC
CGCGAGGCCG GCGCCGAGAC CGTCGCGGTC GATGTCGAGC CCTGCGCGGC GCTGGCCGAC
GACCGGCTCG CGACCGCGGT CGAGGAGCTC GCGACGAACG CGCCCGAACA CGGCGCGACC
GCGGTCGAGC TATCGGCTCG CGCGGTCGAC GGCGGCGTCG AAATCGCCGT CACCGACGAC
GGTCCCGGGA TCCACGAGTC CGAGAGCCGC GTGATCACCG GTCGCGACCC GGAGACGCAA
CTGGAGCACG CGAGCGGGCT CGGGCTTTGG ATCGTCCGGG CGACCGCCGA CGCGTTCGAC
GGGTGGCTGT CGATCGAGAC GAGGCGCGCG GGCGACGAGG GCGCGAGCGA CGACGAGACC
GGCACGACGG TGACGATCGG GGTTCCGGCA CCGTAG
 
Protein sequence
MTGRYDTRDT REGRLTELLV TLGVTEPDHV SVEPEGGIAR VAAALFVSVS GFALLIPNVT 
PLVSGGEPPI GIALSVLGTV LSVALVLVGG LLYRSTFTTR NAVRIAVWSL FGIVVLGGIM
LGIIRYQAQL GTPMVAPTFT VAKVLSIGAV AHVIIGVYDA RRVRAQQLAN EQRRVSVLNR
MIRHNLRNET TVLGGHASLI AERVDDPELR DSAEAVAKSA AVIGGLAKDA NRLQAAFERR
PDARGAVPVE PLIEDAAEAA REAGAETVAV DVEPCAALAD DRLATAVEEL ATNAPEHGAT
AVELSARAVD GGVEIAVTDD GPGIHESESR VITGRDPETQ LEHASGLGLW IVRATADAFD
GWLSIETRRA GDEGASDDET GTTVTIGVPA P