Gene Hlac_2596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2596 
Symbol 
ID7399822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2574736 
End bp2575812 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content71% 
IMG OID643709669 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002567238 
Protein GI222481001 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.834028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCG ACGAGGGATC CGGATCGGGA TACGCCGAGT CGATCGTCGC CGCGCTCGGC 
GACGGCGCGT ACGTCCTCGA CGCGGACCGG AACCGCGTGT TCGTCAACGA CCGGCTCCGG
GAGGTGACGG GGTTCTCCGA CGAGGTGCTC CACGGGAAAC ACCCGGAGCG TATCGTCGCC
GAGGGGTACT GGGACGAAGC GACCGGCGAG CGCTTCCGCG TCGCCACGGA GCGCGTGCTC
GCGGGCGAGT CCGACGACGA GCGCGTTCAG TTGACGACGA CGCTCAGCGA CGGGTCGACG
GTCACGACCG AGACGCGGCT CACGCCGCTG ACCGAAGAGG CCGCCGGCGG TTCGGAAGGT
CCCACCGAGG TGGTCGGCCT CGTCGGTGTG ATCCGCGACG TGACAGAGCG CGTCGAGCGA
GAAAACGAAC TCGCACGGCT CAACGAGCGA CTCGAACGCC TTTCCGGGTT CCTCTCGCAC
GACCTCAAGA ACCCCCTCGC CGTCGCGCGC GGGTATCTGG ATGTGGCCCG TGATACCGGG
GACCTCGACC GTCTCGACCC CGCAGATGAC GCGCTCGACC GGATCGAGAC GCTGATCGAC
GAGGCGCTCG TGATGGCCAG AGAGCCCGCG GTGATCGAGG TCGATCTCGC TCCGATCGAC
CTCGACGCGC TCGCGACCGA CTGCTGGGAG TCCGGTGACT TCGGCGATCC GCCGGCTGAC
GCGACGCTCG TCGTCGAGGA GGTCGGTCCG ATCGCGGCGG ACCGCGACCT GCTCCGGCGG
GCGATCGGGA ACCTGCTCGG GAACGCGTTC GACCACGCCG GCGACGCGCC CGCGGTTCGG
GTCGGCGTCG ACGACCGCGG GATCTACGTC GCTGACGACG GGCCGGGACT CGCCGCCGAC
GAGCGCAGCG AGCACGGCGA TGTGACCGAG TTCGGCGTCT CGAACGACGG CGGGACGGGA
ATCGGGCTCG CCATCGTCGA GCGCGTCGCG GCCGCGCACG GCTGGACGCT GGAGATCGGC
GAGTCGGCCG ACGGCGGCTT CGAGGCGCGG CTCGTCGGCG CGGAACCGAC GGGATAA
 
Protein sequence
MNRDEGSGSG YAESIVAALG DGAYVLDADR NRVFVNDRLR EVTGFSDEVL HGKHPERIVA 
EGYWDEATGE RFRVATERVL AGESDDERVQ LTTTLSDGST VTTETRLTPL TEEAAGGSEG
PTEVVGLVGV IRDVTERVER ENELARLNER LERLSGFLSH DLKNPLAVAR GYLDVARDTG
DLDRLDPADD ALDRIETLID EALVMAREPA VIEVDLAPID LDALATDCWE SGDFGDPPAD
ATLVVEEVGP IAADRDLLRR AIGNLLGNAF DHAGDAPAVR VGVDDRGIYV ADDGPGLAAD
ERSEHGDVTE FGVSNDGGTG IGLAIVERVA AAHGWTLEIG ESADGGFEAR LVGAEPTG