Gene Hlac_2684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2684 
Symbol 
ID7400891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2672899 
End bp2674449 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content71% 
IMG OID643709758 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002567325 
Protein GI222481088 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.989107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTCC GCCCGACCGA CCGCCCGGCG ATCGCCTGCC GCGATGTCCG GGCCGATACC 
GACGACACCC GACAGTCGAT CGACGGATCG GCCGAGTCGC CCGCTTCAAG CGGTTCGCCC
GCTTCGCCAG TCCCGGGTGC CGACCGGTTC GTCGCGGCTG TCGCGGTCGA CGGCGAGGTG
CTGTTCGCCG GTCCCTCGGT GCCCGCGGTG CTCGGGATCG AGCGGGCGAC CCTCGTCGGT
GATGACCTCT TGGATTACGT TCACCCGAAC GACCGCGAGC CCGTCTCCGA CGCGCTCTCG
GCGGTCGCGA CCAACCGCGT CGTCACCCAC CGGCTCCGCC ACGCCAACGG CGGGTTCGTC
TGGGTCGAGT CGGTCGTCGA CGAGGAGTTA GCCCCCGAGT TCGGTGGGCG CGTCGTCACG
GTGCGTCGCG TCGACGCCGA GCAGACCTTC CCGGAGCGGT TCCGGGAGTT CCTGGAGTAC
GGCACGGATC TGGTCACCGT CGTCGACGCG GACGGGCGGG TCCGGTACGA GAGCCCGGCC
GTCGAGGAGG TTCTCGGCTA CGAGCAGGGG TCGACCGTGG GGCGCTCCCC GCTCGGCTAC
GTCCACCCCG ACGACCGCGA GCGCGTGACC GAGCGGTTCT ACCGCGCGCT CAACGATCCC
GACGCGACCC CCACGATGGA GTACCGCTAC CGTACCGCCG ACGGCAACTG GGTCTGGCTG
GAGTCTCGAA GCCGGTCGCT ACCCGACGAC GTCGCGGTCG GACGCCTGCT CATCAACTCG
CGGGACGTGA GCGAGCGGAA GGCGCGCGAG CGCCGGCTCA CCGACCGCAA CGAGCGGCTC
GACCGCTTCG CCAGCATCGT CTCGCACGAC CTCCGGAACC CGCTGTCGGT GATCCGAGGA
TCGATGGAGA TGGCGGAGCT AAACGGCGAC ACAGAGCCCT TGGAGCGCGG CGAGCGCGCC
GTCGACCGGA TCGACCAGCT GGTCTCGGAG CTGTTGACGC TCGCCCGGCA GGGCTCCGGG
ATCGACGAGC CGACCGAGTT CGCGCTCGGT GGCGTCGCTC GCGAGGCGTG GGACACCGCC
GGGAGCGCGG ACGCGACCCT CGTCCTCGGC GCGGATGCCC GAGTGTGCGG CGACCGCGGC
CGGCTGCGAC AGGTGTTCGA GAACCTGTTC CGGAACGCGA CGGAACACGC CGCGCCGGAC
GGCACAGACG CGATTCGATC GACCGACAGC GGCGAAGACG CCCCACTCAC CGTCCTCGTG
ACCGCGACCG GCGGGGGATT TCTCGTCGCC GACGACGGAC CGGGGATCGA TCCGGCGCAC
CGCGAGGAGG TCTTCGACCC TGGCTTCACG ACCCGCGAGG ACGGGACAGG CTACGGGCTC
GACATCGTCC GCGAGGTCGT CGAGTCGCAC GGGTGGACGA TCGGAGTCCG GAGAGACGGC
ACCGATCCGG CGTGCCCGGA CGACGTGACG GTCCCCGACG GGGCGTGCTT CGTGGTCGGA
GGCCCCGACT CCGACGCGGC CGACGCGGAC GAACCGTGGA TCGACGGGTG A
 
Protein sequence
MTLRPTDRPA IACRDVRADT DDTRQSIDGS AESPASSGSP ASPVPGADRF VAAVAVDGEV 
LFAGPSVPAV LGIERATLVG DDLLDYVHPN DREPVSDALS AVATNRVVTH RLRHANGGFV
WVESVVDEEL APEFGGRVVT VRRVDAEQTF PERFREFLEY GTDLVTVVDA DGRVRYESPA
VEEVLGYEQG STVGRSPLGY VHPDDRERVT ERFYRALNDP DATPTMEYRY RTADGNWVWL
ESRSRSLPDD VAVGRLLINS RDVSERKARE RRLTDRNERL DRFASIVSHD LRNPLSVIRG
SMEMAELNGD TEPLERGERA VDRIDQLVSE LLTLARQGSG IDEPTEFALG GVAREAWDTA
GSADATLVLG ADARVCGDRG RLRQVFENLF RNATEHAAPD GTDAIRSTDS GEDAPLTVLV
TATGGGFLVA DDGPGIDPAH REEVFDPGFT TREDGTGYGL DIVREVVESH GWTIGVRRDG
TDPACPDDVT VPDGACFVVG GPDSDAADAD EPWIDG