Gene Hlac_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1800 
Symbol 
ID7399673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1814739 
End bp1816505 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content66% 
IMG OID643708866 
Producthistidine kinase 
Protein accessionYP_002566449 
Protein GI222480212 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.89662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCG TCTTTCTCGC GCACGTTTCC GTCTTCGCGC TCTCGGCGCT CGCATGCTTC 
GCGGCCGTCC CTCGTGCTAT GGACGTCGGA CACCCCGAAA CCCGCGAAGG GCTGGTTGGC
CTCCTCCTCA CGGTCGCCGT CTGGTCGAGT GGGTACGTCG GGTACCTCGT CGCGCCGGGC
GAGTCGCTCA AACTGGCGTT TTACATCGTC GGATTCGTCT TCGCGCTGCT CGCGGTCGGC
GCGTGGCTCT ACTTCTGTGC CGCTTACACC GGCCGCTCCC CGCGTCACGC ACCCTACAGG
TGGCTTGTCG TCGGCGTGTT CGCGGCGATA AGTCTGCTGA AGGTCACCAA CCCGCTCCAC
GAGCTCTACT TCACGACGGC GTGGGTCACC GAGCCGTTTC CGCACCTCGC GATCCGGCAC
GAGCTGCTCT ACTGGGCGGT CCTCGGGCTC TCGTACGCCA TCATCGCGGT CGGCTTCTTC
ATGTTGATCG AACAGTTCTA CTACGCCGGC GCCGACAGTC GACCGCTCGT CGCGCTGATC
TCCATCACGG CGATTCCGAC GGTCGCGACC GTCTTCGGCG GAGAGCTGTC GTGGCTCCTC
CCGTTGATGT ACGAGCCGCC CGGTGTGTCG CTTTTCGCCG TCGGTACGCT GTTCGTCTAC
CTCCGGCGAT TCGAGGCGAT CCGGTTCGCG GCCGAGAGCA ACGACGCGAC GATTTTCCTC
GATAAGTCCA GGCGAATCCG CGATTACAAC CGGGCCGCGC GCCTGCTGTT CCCGTCGCTC
CACGGAGCGA TCGGCGAGCC CCTCGACGCC GTGCTCCCGC GACTCGCCGT CGGCAGTGAT
GCAGACGGAG TTATCGCGGT GGAGGGTGTG ACGCTGGACG AGGACGACGC GTCGATCCAG
ACGCAGGAGC CGACCCGGAG CCCCCGGTTC TTCCACATCT CCCGAAACGA ATTCACCACC
GGCGGCGTGA CATTTGGCGA GCTGCTCACG ATCGATGACG TCACCGATCG GGAGCGGTAC
CGGATCCAGC TAGAGGAGCG GACCGAACAG CTGGAGGCGC TGAACCGGGT GGTCCGCCAC
GACATCCGCA ACGACATGGC CGTGATCCAC GGCTGGAGCG AGACCCTCCG CGACCACGTC
GACAAGGAGG GACAGGATGC GCTCGACCGC GTGCTCCGGA AGTCCACGCA CGTCATCGAG
CTCACCGAGA CGGCCCGTGA CTTCGTCGAT TCGCTCACCG GAGACGCTGT GCTCGAAGCG
AAGCCGACCG ATCTGGGGGA CGTGTTGACG ACCGAGATAC AGGCCGCGCG GGACTCCTTC
CCGAATGCGA CGTTCCGCGT GCCCGCGGAC CCGCCCCGGG ATACCGTGCT CGCGAACGAG
ATGCTCGCGT CGGTGTTCCG GAACCTCCTG AACAACGCTG TCCAGCACAA CGACACCGAC
GCCCCGGAGG TGACGGTCAC CTGCGAGGAG ACCGACGATC GGATCCGCGT CCGGGTCGCC
GATAACGGGC CCGGCGTTCC CGACGCGGAG AAAGACGAGA TATTCGGCAA AGGCGAGAAG
GGTCTCGACA GCGCCGGCTC CGGGATCGGA CTCTACCTCG TGTCCGTCCT GACCCGGCAG
TTCGGAGGCG ACGTGTGGGT CGAGGACAAC GAGCCGCGCG GCGCCGTCTT CGTCGTCGAG
CTCTTGAGAG CCGAGGGCAA CGCGGCCGAC GACCAGCCAG CGGAGTCCGT CGAATCGACG
GGTTCGGCCG CGCCGCCGGA TCGATGA
 
Protein sequence
MDTVFLAHVS VFALSALACF AAVPRAMDVG HPETREGLVG LLLTVAVWSS GYVGYLVAPG 
ESLKLAFYIV GFVFALLAVG AWLYFCAAYT GRSPRHAPYR WLVVGVFAAI SLLKVTNPLH
ELYFTTAWVT EPFPHLAIRH ELLYWAVLGL SYAIIAVGFF MLIEQFYYAG ADSRPLVALI
SITAIPTVAT VFGGELSWLL PLMYEPPGVS LFAVGTLFVY LRRFEAIRFA AESNDATIFL
DKSRRIRDYN RAARLLFPSL HGAIGEPLDA VLPRLAVGSD ADGVIAVEGV TLDEDDASIQ
TQEPTRSPRF FHISRNEFTT GGVTFGELLT IDDVTDRERY RIQLEERTEQ LEALNRVVRH
DIRNDMAVIH GWSETLRDHV DKEGQDALDR VLRKSTHVIE LTETARDFVD SLTGDAVLEA
KPTDLGDVLT TEIQAARDSF PNATFRVPAD PPRDTVLANE MLASVFRNLL NNAVQHNDTD
APEVTVTCEE TDDRIRVRVA DNGPGVPDAE KDEIFGKGEK GLDSAGSGIG LYLVSVLTRQ
FGGDVWVEDN EPRGAVFVVE LLRAEGNAAD DQPAESVEST GSAAPPDR