Gene Hlac_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2051 
Symbol 
ID7402070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2041216 
End bp2043462 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content68% 
IMG OID643709122 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002566699 
Protein GI222480462 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.167205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCGG TATCGATGGT GTGGCAACCG ACGCCGTACA CGGTTCCGCT GCTCGTGGCA 
GCGCTCGCGT CGTTCTCTTT CGCCGTGTAC GCTGTCCGGA ACCGGTCGCG GGGAGAGATA
CCGCTCGTAC GGAGCTTCGT TGGTGTCACG GTCAGTAGCG GGGTCTGGTC GCTGGCGTAC
GCGGCGCAGC TGTCGGCGAC GACGCTCGAA GCGACCCTGC TGTGGAACCG CCTCGTCTGG
GTCGGAGTCG CAGCGCTCAC CGTCGCCTGG CCCGTGTTCG TGTTCGTCTA CGTCGACTGG
ACGGCGTGGA TTCGACCGCG CCGCATCGCC CTGCTGTGTC TCGTCCCCGC GACCGCCGTC
GGCGGCATCT TCGTCGTCGG CGCCGACCCG ATTTTTTACA CGTCTCCGTC GCTGTCCGAT
ACGAACGGGT TCCTCGTGAT GGAGTACCTG CCGACGCCCG CGCTCTTGGG GTTCATGACG
TACGCTTACG CCGTGAACCT GTTTACCTTC GCCGTGCTCG GCTACGCCGC GCTCGCCCGC
GACGGCGTGT TCCGTCGGCA GGCCGCGCTG CTGCTCGCCG CCGGTGTCGC ACCGATGACC
GTCGGCGTCA TCGGGATCTG GGGGCTGATC GGCCCCCGGT TCGTCGACCT CACGCCGGTC
ACGTTCGCGG CCACCTCCGG TCTGCTCGGC TGGGTCGTAT TCCGCTACCG GCTCCTCGAC
CTCTCGCCCA TCGCTCGCGA CGCGGTGTTC GCAAACCTTT CCGATGGTAT CGTAGTCGTC
GACGACGCCG GTCGCGTCGT CGACCTGAAC CGGCCGGCTC GTCGACTGTT CCCGTCCGCG
GCGATCGGCT GCGGCGTCGA CGAGGCGTTC GAGCGCGCCC CGGCGGTCAC GGATCTCGTG
TCGCGCGACG GGACTGTCGA CGGGGCCGGG TCGCGTGGCA CCCCCGGAGC GGACCCGGAC
GACGATCTCC GGGTGACCGT CGACGGTGGG TCAGATCCCC GCTTTCTGAC GGTCGCCGCC
CACTCGATCG CCGGGGGCGG GTCCGGGGCC GGGGTCGAAT CCGAGTTCGA GAGCCCGCCC
GAATTCGGGT CCACCTCCGC GGCGGGCGGA ACCGTCCTCC TGTTTCGCGA TGTCACCGAG
CGCGAGACGC TCCAGCGCCG TTACCGGGCG CTCATCGAGA AGTCACCGAA CATCATCGCC
GTCTGCGGAA CCGACGGCCT GCTCCGGTAC GTGAGCCCGT CGATAGAGCG TCTGCTTGGC
CACTCGCCTG CGGAGATCGA GGGGCGGCCC GTGATCGACT TGGTCCACCC CGACGACCGT
CGGGAGGCGC AACGCGCGTT CGAGTGCGCG TTCGAGACCG GCGAGCCGCA GGCGATCGAC
CACCGGATAG CCCACGCAGA CGGAAACTGG CGCCAGTTCG ACACGAGGGT CGAGCGCCTG
TTCGAGGACA CCGAAGAGGT GGTGATAACC GCCACCGATG TGACCGAGAT CCGACGGTCC
GAACAGCGGC TACAGGTCCT CAATCGGGTT CTCAGACACG ATCTGAAGAA CGACACGAAC
GTTATCGGCG GATACGCGAA CCTCCTCCGA AACCATGTCG ACGAGGAGGG CGACGACTAC
CTCGACATCA TCGATCGGAA GGTCGAGACG CTGACACACC TCAGCGACCA AGCCCGCGAG
ATCGACGTCG CGCTCCACAG GGACGGCGCG CGGGCGGAGA TCGACCTGTC GGAACTGGTC
ACGCGACTCT GCGAGTCGCT CGAGTCGTCG TTCCCGCGGG CGACGGTGAC GGTGTCGACG
CCCGGCGCGG CCGTGGTGTG TGCCGACGAG CTGTTGGAGT CCGCGGTCCG GAACGTGTTG
GAGAACGCGG TCGTCCACAA CGACGGGGAC CAACCGATCG TCGAGGTGGC CGTCGATCCG
GACGGCGAGG GGTACCGCAT CGCCGTCGCG GACAACGGCC CCGGGATCCC CGCGGTCGAG
CGGAGCGTGT TCTCGGAGGC CCGCGAGACG GCGCTCGAAC ACGCGAGCGG GCTGGGGCTC
TGGCTCGTCC ACTGGATCGT CACCGAGTCC GGCGGCGAAC TGGAGATCCA CACGCGAGAG
CCGACGGGAA CGCTGATCAC GATGTGGCTT CCGACGGCAA ACGGAGAGGC GGCGAAGGCT
AGGGAGGCGG TGGCCAATGG CGAGGCAGCG GCCGACGCCG AGGCGGCGTC GGCGAGCAGC
GATCCGAGCC CCGACCCCGC GCCGTGA
 
Protein sequence
MHPVSMVWQP TPYTVPLLVA ALASFSFAVY AVRNRSRGEI PLVRSFVGVT VSSGVWSLAY 
AAQLSATTLE ATLLWNRLVW VGVAALTVAW PVFVFVYVDW TAWIRPRRIA LLCLVPATAV
GGIFVVGADP IFYTSPSLSD TNGFLVMEYL PTPALLGFMT YAYAVNLFTF AVLGYAALAR
DGVFRRQAAL LLAAGVAPMT VGVIGIWGLI GPRFVDLTPV TFAATSGLLG WVVFRYRLLD
LSPIARDAVF ANLSDGIVVV DDAGRVVDLN RPARRLFPSA AIGCGVDEAF ERAPAVTDLV
SRDGTVDGAG SRGTPGADPD DDLRVTVDGG SDPRFLTVAA HSIAGGGSGA GVESEFESPP
EFGSTSAAGG TVLLFRDVTE RETLQRRYRA LIEKSPNIIA VCGTDGLLRY VSPSIERLLG
HSPAEIEGRP VIDLVHPDDR REAQRAFECA FETGEPQAID HRIAHADGNW RQFDTRVERL
FEDTEEVVIT ATDVTEIRRS EQRLQVLNRV LRHDLKNDTN VIGGYANLLR NHVDEEGDDY
LDIIDRKVET LTHLSDQARE IDVALHRDGA RAEIDLSELV TRLCESLESS FPRATVTVST
PGAAVVCADE LLESAVRNVL ENAVVHNDGD QPIVEVAVDP DGEGYRIAVA DNGPGIPAVE
RSVFSEARET ALEHASGLGL WLVHWIVTES GGELEIHTRE PTGTLITMWL PTANGEAAKA
REAVANGEAA ADAEAASASS DPSPDPAP