Gene Hlac_0469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0469 
Symbol 
ID7400349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp486394 
End bp487530 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content68% 
IMG OID643707533 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002565141 
Protein GI222478904 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.235953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.329223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAG GGTCACCTCG GGACAGGGAG TTACGACGGT ACGAGACGAT TCTCGATTCG 
CTCGATGACG CCGTCTACGC CGTCCGTCCC GACGGAACCA TCGCGTACGT CAACGATCGG
TATGCCGAGA TGAAGGGCGT GAGTCGCGAG GCGCTGCTGG GGACGGATAT CTACGACTGG
GTCACGGAGG AAACCGCGGA GAAGGCCACA CAGGTCCGCA ACGAGATGGC CGCGGGCGAT
CGCGACGTCG GCATGGTCGA GTACGAGTTC CTCACGGCAG ACGGGGAGCG ATTCCCAGCG
GAGATGCGCT TCAACAGGGT GACCGGCGAG GAAGGCGAGG AGCTGAACCG CGTCGGCGTC
ATCCGAGACG TCCGCGAGCG GAAGCGGCGC GAGGAGGCGC TCCGCCGCAA GAACGAACGG
CTCGAGGAGT TCGCGAGCAT CGTCTCGCAC GACCTCCGGA ACCCCCTCAA CGTCGCGCAG
GGGCGGCTGG ACCTCGCCCG CGAGGAGTAC GACTCCGAGC ACCTGGAGGT CGTCGCCAAC
GCCCACGAGC GAATGGCGGC GCTCATCGAC GACCTCCTGA CGCTCGCCCG TGACGGCGAG
GGCGTCGAGG AGACGGAGCG GGTTCCCCTC CGCGAACTCG CGGAGGTGTG CTGGGAGAGC
GTCGAGACCG CAGCGGCCTC GCTCCGGATC GAGACCGACC GCGCGATCCG CGCGGACCGG
AGCCGACTCA GACAGCTCGT CGAGAACCTC ATGCGGAACA GCGTGGAACA CGGTCGTTCG
AGAGACGGCG ATACCGTCTC TCGCGATGAC GAAAACTCCG AGACGGAGTT TTCGAACCAT
TCCACGAGCA GCCGGGCAGA GCCCGGCGAC GCGGTCGTGC ACGCCGGCGA GGACGTCACG
GTGACGGTCG GCGACGTCGA GGGGGGCTTC TACGTCGCCG ACGACGGCCG GGGGATCCCC
GAGAGCGACC GCGAGACGGT GTTCGAGACG GGGTACACCA CGAGCGACGA CGGGACCGGG
TTCGGCCTCG AAATCGTCGA GGCCGTCGCG ACGGCTCACG GCTGGGACGT GCGCGTCACC
GACGCCGCGG GCGGCGGCGC CCGGTTCGAG TTCACCGGGG TCGACGTGCT CGACTGA
 
Protein sequence
MTEGSPRDRE LRRYETILDS LDDAVYAVRP DGTIAYVNDR YAEMKGVSRE ALLGTDIYDW 
VTEETAEKAT QVRNEMAAGD RDVGMVEYEF LTADGERFPA EMRFNRVTGE EGEELNRVGV
IRDVRERKRR EEALRRKNER LEEFASIVSH DLRNPLNVAQ GRLDLAREEY DSEHLEVVAN
AHERMAALID DLLTLARDGE GVEETERVPL RELAEVCWES VETAAASLRI ETDRAIRADR
SRLRQLVENL MRNSVEHGRS RDGDTVSRDD ENSETEFSNH STSSRAEPGD AVVHAGEDVT
VTVGDVEGGF YVADDGRGIP ESDRETVFET GYTTSDDGTG FGLEIVEAVA TAHGWDVRVT
DAAGGGARFE FTGVDVLD