Gene Hlac_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1231 
Symbol 
ID7399499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1241385 
End bp1242515 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID643708295 
Producthistidine kinase 
Protein accessionYP_002565893 
Protein GI222479656 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000541094 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTGACG ACGCTCTCAG CGCGATGGAA GCCAGAGAGC GGCTCTACGA GGTGATGGAC 
CGAGATGTCC CTTTTGAGGA GAAGGCGACG CTGGCGCTCT CCATCGGGGA GGCGTACCTG
GGCGTGGAGA ACGGACATCT GACGCGGATC GACGTCGAGA GCGACTACTG GAAGGCGATC
GCCAGTACGG ACTCGGTGGA CGGACGGTTC CCGGTCGGGC TCCAGTTAGA CCTCCAGAAC
ACGTACTGCC GGCGAACGAT CGACGGCGAG AGCCCGGTTC GACTGTACGA CGCACCGAAC
CAGGGGTGGG ACGACGACCC CGCGTTCGAG CGGCACGGAC TCCACTGTTA CCACGGCAGT
ACGATCACCA TCGACGACGG GGTCTGCGGA ACGCTGTGTT TCGTCTCCAC GGAGCCGCGT
CCAGAGCCGT TCGCCGACGA GGAGACGCTG TTCGCTGAGC TTATCGCGCG GCTGTTGGAA
ACGGAGCTCC AAGCGGAACG GACGGAGGCA AAGATCGACC GGCTCGATCA GTTCGCCAGC
GTCGTCTCCC ACGACCTCAG AAGCCCGTTG AACGTCGCAC AGGGCCGCGT CGATCTCGAA
CGATCGACCC GCGACAGCGA CCACTTGGGG ATCGCCGCGA GATCGTTGGA CCGGATGGAG
GACCTGATCG CCGACGTTCT CACGGTGGCC CGACAGGGAC AGGAGATCGG GGACACGGAA
CTCGTCTCGC TCGACGCGAT CGTCACAGAG TGCTGGGACG CAGTGCAAAC CGACGGCGCG
AACGTGACCG TTACCGACGA TCTGTGGTTT AAGGCGGACC GAGGGCGCGT CCGCCACCTC
TTCGAGAACC TGTTCCGGAA CGGTGTCGAG CACGCGGGCC CGGACGTGTC GATCCGCGTC
GGACCGCTCG ACAACGGAGA CGGATTCTAC GTCGAGGACG ACGGCCCCGG AATTCCGGCG
GCCGATCGGG AGCAGGTCTT CGAGTCGGGG TACACGACGG GCACGGACGG GCTCGGACTC
GGCCTCTCGA TCGTCGGGGG CGTCGTCGAC GCTCACGGGT GGACGATCGC GGTCGGAGCG
GGCGCCGACG GCGGCGCGCG ATTCGAGGTT TCCGGCGTGG TCGTTCCGTA G
 
Protein sequence
MGDDALSAME ARERLYEVMD RDVPFEEKAT LALSIGEAYL GVENGHLTRI DVESDYWKAI 
ASTDSVDGRF PVGLQLDLQN TYCRRTIDGE SPVRLYDAPN QGWDDDPAFE RHGLHCYHGS
TITIDDGVCG TLCFVSTEPR PEPFADEETL FAELIARLLE TELQAERTEA KIDRLDQFAS
VVSHDLRSPL NVAQGRVDLE RSTRDSDHLG IAARSLDRME DLIADVLTVA RQGQEIGDTE
LVSLDAIVTE CWDAVQTDGA NVTVTDDLWF KADRGRVRHL FENLFRNGVE HAGPDVSIRV
GPLDNGDGFY VEDDGPGIPA ADREQVFESG YTTGTDGLGL GLSIVGGVVD AHGWTIAVGA
GADGGARFEV SGVVVP