Gene Hlac_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1000 
Symbol 
ID7401895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp993873 
End bp995621 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content69% 
IMG OID643708066 
Producthistidine kinase 
Protein accessionYP_002565667 
Protein GI222479430 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.798946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGGCC TGAGTCCACT CTCCGTCGCG CTACTCACCG CGGTCGTGAC CGGCACCTCC 
GCGGCGATCC TCGCGTGGCG GGAGCGCCCG GACGCGGGTG CGACGTGGCT CGCGCTGCTG
CTCCTCGGGC AGGTCTGGTG GACGACGTTT CTCGTCTTCG AGTTGGAGGC GTCGACGCTG
GCCGCGAAGG CGCTCTGGTA CGACATTCAG TGGGTCGGGG TGGTGCTGGT TCCGGTAGGG
TGGCTGTTGT TCGCACTGGA GTACACGGGC CGGGACCGGT ACGTGAGACC CGGAGTCGTG
GCGGCGGCCT GCGTCGCGCC GGCGATAACC GTTCTCGTGG TCGCGACGGG TGACCCGGCC
GGACTCATCG TCGCCAACCG TGAGGTCGCC GACACCGCAG TCGGGTCGTT TCTCCGCGTC
GACCCCGGGC CGTGGTACTA CGTCATCGCC GGCTACACCT ACCTTCTGGG GCTGATTGGA
TCGGTCCCGA TCCTCCAGCT CGTCCGTGAC GACGCCCGCC CGTTCCGAGG ACAGAGCGCG
GCGCTGCTCG TGGGCACGGC GGCACCGTGG GTCAGCAGCC TCCTCCACGT TACTGGCGCC
ATCCCGGTTC CCGGACTCGA TCCCACGCCG CTCGCATTCG CCGTCTCCGG GGTCGCGTAC
CTCCTCGCGC TCTCGCGGTT CCGCCTGCTC ACGCTCACGC CCGCGCCGCG ACGGCGGGCG
CGCCAGCTGG TGTTCGAGCA GCTTCACGAT CCCGTGTTCG TCGTCGGTAC CGAGGGCCAC
GTACTCGACT TGAACCGCAG TGCTGCCGAC GTGTTCGAAG TCGACCGGCG GACGGCGGTT
GGAGAGGCGG CGAGTACGGT CATCCCCCGC TACGACTCGC TCAACGGCGA TCGCGGCGAC
ATCGGCCCGC TCTCGATCGT GGGGCGGAAC GGTCGGCAGC CCTACGAGAT CACGGTGCGA
GGCGTGAGCG ACGACCACGG GCGAGCGGTC GGCCGCCTTA TCGTCTTCCA CGACGTGGGC
GAGTACCTCG GCCAGCAGCA GCGGCTGAAC GTGCTCAACA GGGTGTTCCG GCACGACGTT
CGGACCGAGA CGAACCTGAT CCACGGGTAC GCCGATCAGC TGATGACGAA CCCGAGCGAC
GAACGGGCGC TGTCGATCGT CAAGAAGAGC GCCTCGCGTA TCCTCGATCT CAGCGAGCGC
ACCCGGACCG CCAGCGAGCT GTTCGACCCG GTCTCGGAGC CGGAGCCGCC GGTCCCGCTG
TCCGAGGTGG TCGACGAGGC GGTCGCGGAC CTCCGCGCAG AATCTCCCGA CGCACGGGTG
TCGGTCGACG GCGATCTACC GGATGTAAGC GTGCCGGCGA CCCTCCGGGT CGTTGCGTCG
AACCTCTGTT CGAACGCGGT CGAGCACAAC GACGCGGCCG CGCCGTCCGT GTGGCTCGAG
GCGGCCGTCG AGAACGGCTG GGTCGAACTC TCTGTCGCCG ACGACGGGCC GGGGATCGAC
CCGGCGGAGT ACGAGGTCCT GGCGCACGGC ACGGAGACGC CGCTGGAGCA CGGCAGCGGA
ATCGGCCTGT GGATCGTGAA GTGGGGGATC GACCAGGTAG GCGGGAGCGT CTCCTTCGCG
GAGCGGGAGC CCCACGGGAC GATCGTCACG GTCTCCGTTC CCACGGGCGA CGCGGTACCC
ATCAATGGGG CCGGCTCGGA GCACGCCGAG TCCGCGGAAT CGACGGACTC GGCGGGCACA
GGTCAGTAG
 
Protein sequence
MIGLSPLSVA LLTAVVTGTS AAILAWRERP DAGATWLALL LLGQVWWTTF LVFELEASTL 
AAKALWYDIQ WVGVVLVPVG WLLFALEYTG RDRYVRPGVV AAACVAPAIT VLVVATGDPA
GLIVANREVA DTAVGSFLRV DPGPWYYVIA GYTYLLGLIG SVPILQLVRD DARPFRGQSA
ALLVGTAAPW VSSLLHVTGA IPVPGLDPTP LAFAVSGVAY LLALSRFRLL TLTPAPRRRA
RQLVFEQLHD PVFVVGTEGH VLDLNRSAAD VFEVDRRTAV GEAASTVIPR YDSLNGDRGD
IGPLSIVGRN GRQPYEITVR GVSDDHGRAV GRLIVFHDVG EYLGQQQRLN VLNRVFRHDV
RTETNLIHGY ADQLMTNPSD ERALSIVKKS ASRILDLSER TRTASELFDP VSEPEPPVPL
SEVVDEAVAD LRAESPDARV SVDGDLPDVS VPATLRVVAS NLCSNAVEHN DAAAPSVWLE
AAVENGWVEL SVADDGPGID PAEYEVLAHG TETPLEHGSG IGLWIVKWGI DQVGGSVSFA
EREPHGTIVT VSVPTGDAVP INGAGSEHAE SAESTDSAGT GQ