Gene Hlac_0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0920 
Symbol 
ID7401292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp917584 
End bp919551 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content54% 
IMG OID643707986 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_002565588 
Protein GI222479351 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.301008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000022653 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCGATT ACCTACAGAA ACGAGGCTCA GAACAGTACG ACCGGCTTGC CACACGCATC 
GGTCACGCAG TTGCCCAGTA CCGAACGGAA CACGAACTGC GAGAACGAGT GAAAGAACTC
ACTGCAATTC AAACGATCAG CGATCTACTC ACCGACAGTG ACGGCCAGCT GGCAGGGCAA
CTCCAGCAAG TCGTTACGTA CCTTTCACAG TCTCTCCAGT TCACAGAAGC GGCAGTTGCT
TCGCTCTCTA TCGATGAGAC CGAGTTCACC TCTCCGGAGT ACGAACCACC AGTTCACCAA
CTCTCAGTCC AAGACGTGAC CACCGCCGGT AATGAACTCA CACTTATAAT TGGCTATACG
ATCGACTCAG TGTCAGAAAC CGATGGTGAT GTGTTTCTCC CCGAAGAACG AGAGCTAATC
ACCACGGTTC TACAGCTCGT CACGGCCTAC CTTGATCGGC GACACGTCCT CTCAGATCTC
CAAGAGGCAG AGCGTCGGCT CAATCTCATT CTCGACAATA CGACCGCAGT GATGTATCTG
AAAGATACTG AGGGTCGATA TGTGTTTGTG AATGCCGAAT ACGAGCGGTT GTTCAATGTG
GACTCCGAGG AGCTCATCGG ACAGTACGAT GCGGATATCC ACCCGCCGGA CATGGCCGAA
ACCGTCCAGG CGAACGATCG TCGTGTCCTC GAAACAGGCG AACCGATTGA GACTGAAGAA
CAAATCACGG TGGATGGTGA CGAGCGAACG TATCTCTCAT TGAAAGTCCC TGTGCTGGAT
GCTGCTGGTG AAGTGGAGGG CGTGTTCGGC GTATCCACAG AAATCACCGA ACGTAAAGAA
CGAGAACGGC AACTCGAAGC ACTCAATCGA GAGATACCAC GGTTATTGTC CGCTGAGACG
ATCGAGGAAG TAGCCGAACT CGGGGTTGTC GCCGCTCATG AGATCTTGGA TCTACAGGCG
AATGCGATTC ACCTGTACGA CGCGGAGAAC GAGGAACTGG CACCTGTTGC ATACACTGAT
GGTGTTCTGG AGCTCATCGG TGACCCCCCG ACGTTTCGTG ACGGACAGAG TATCGCGTGG
CGGGTGTTTG CGGATGGAAA CGCGACAGCG ATTGATGATA TCCAGACTGA CACAGATATC
TACAACCCGG AGTCACCGAT TCGCAGCGAG TTGTACCTCC CGCTGGGTGA GTATGGTATC
TTGATGGCCG GGTCACCAAC GCCCTCGAAG TTCGATTCAC AAGACGTAAC CGTTGGTGAA
CTACTTACCG CCCATCTTAC CACTGCACTC TCCCAGGTCA CCACCGAACA GGAGTTACGT
GAGCGAGAAG CGGAGTTAGA AGCCCAAAAC GAACGGCTCG AACAGTTTGC CAGTATGGTC
TCACACGATT TACGCAATCC GTTATCAGTT GCGTCCGGAA ACTTGGAATT ATATCGGGAA
ACCGGTGAGG AATCCCGTTT AGAAACGATT GATACGGCTC TCACCCGCAT TCAGGAGCTC
ATTACCGATC TTACCTCACT CGCACGTTAT GGCATCCCCG ACGAAAACCA CGAACTAGTG
TCGGTGTCTG AGGTGGCCCG CGACGCATGG GAGTTGATAG ACACGCGGTC AGCAACCCTG
TCAACGGAAC CGTGCACAGT AACTGGCGAC GCAAGCCAGA TCACGACACT CTTCGAAAAT
CTGTTCCGGA ATGCGGTCGG GCACGGTGGC CCAGACGTGA CTGTCCGGGT CGGACCGCTT
GAAAACGGGT TCTACGTTGA AGATACCGGT GACGGTATTC CCCCTGACGA ACGTGACACC
GTGTTTGATC ACGGTTATAC GACAGGCTAC AGCGGAAGCG GTATCGGACT CACAATTGTC
TCACGAATTG CCCAAGCTCA CAGCTGGGAC GTTACCCTTA CAGACAGCAC GGAAGGCGGC
GCACGGTTCG AGTTCCGAGC GACATGTTCA GATGATCCGG ATAACTAA
 
Protein sequence
MTDYLQKRGS EQYDRLATRI GHAVAQYRTE HELRERVKEL TAIQTISDLL TDSDGQLAGQ 
LQQVVTYLSQ SLQFTEAAVA SLSIDETEFT SPEYEPPVHQ LSVQDVTTAG NELTLIIGYT
IDSVSETDGD VFLPEERELI TTVLQLVTAY LDRRHVLSDL QEAERRLNLI LDNTTAVMYL
KDTEGRYVFV NAEYERLFNV DSEELIGQYD ADIHPPDMAE TVQANDRRVL ETGEPIETEE
QITVDGDERT YLSLKVPVLD AAGEVEGVFG VSTEITERKE RERQLEALNR EIPRLLSAET
IEEVAELGVV AAHEILDLQA NAIHLYDAEN EELAPVAYTD GVLELIGDPP TFRDGQSIAW
RVFADGNATA IDDIQTDTDI YNPESPIRSE LYLPLGEYGI LMAGSPTPSK FDSQDVTVGE
LLTAHLTTAL SQVTTEQELR EREAELEAQN ERLEQFASMV SHDLRNPLSV ASGNLELYRE
TGEESRLETI DTALTRIQEL ITDLTSLARY GIPDENHELV SVSEVARDAW ELIDTRSATL
STEPCTVTGD ASQITTLFEN LFRNAVGHGG PDVTVRVGPL ENGFYVEDTG DGIPPDERDT
VFDHGYTTGY SGSGIGLTIV SRIAQAHSWD VTLTDSTEGG ARFEFRATCS DDPDN