Gene Hlac_0109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0109 
Symbol 
ID7401629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp114841 
End bp115845 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content67% 
IMG OID643707172 
Producthistone deacetylase superfamily 
Protein accessionYP_002564785 
Protein GI222478548 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.39401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCG GCTACAGCGA TCGGTGTCTC GAACACGACA CCGGCGAACG GCATCCGGAA 
AACCCAGATC GACTGCGCGC GATCCGTCGC GGCCTGGCGA AGCGACACGG CGTCGAGTAC
GCAGAGGCGG ACCCCGCGAC GCGCGAGGAG GTTGTCGCGG TCCACGACGC GGAGTACGTC
GACGAACTGG AGGCGTTCGT CGCCGACGGC GGCGGGAGCT GGGACCCCGA CACCGTCGCG
AGCGAGGGGA CGTGGGACGC TGCCCTCGCC TCGGCCGGCC TCGCACAGTG GGCGGCTCGA
TCCGCGCTCA ACGGCGCCGA CGGTCGAGAC ACCCCGTTTG CGCTCGGACG GCCGCCGGGC
CACCACGCGG TGCCCGATGA CGCCATGGGT TTTTGCTTTT TCAACAACGC CGCCGTCGCG
GCCCAGACCG TTCTCGACGA CGGGGCCGCA GACCGGGTCG CAGTCTTCGA CTGGGACGTA
CATCACGGAA ACGGGACCCA AGACGTATTC TACGACCGCG GTGACGTGCT CTACGCATCG
ATTCACGAGG ACGGACTCTA TCCGGATACC GGAGCGCTCG ACGAGACCGG CCACGACGAA
GGGGCGGGAA CAACGGTGAA CCTCCCGCTT TCGGCCGGGG CGGGCGACGC CGACTACCTC
TACGCCATCG ACGAGGTGGT CGCCCCGGCG ATCAAACGGT TCGACCCCGA TCTCGTGATC
GTCTCGGCCG GGTTCGATGC TCACCGACAC GACCCCATCT CGCGGATGCG CGTCTCCTCG
GAGGGGTACG CGCTGATGAC CGACCGAATC CGGACGGTCA CCGACAACAT CGAAGCTGCG
AACTCCTACG TCCTTGAAGG AGGCTACGGT CTCGACACGC TGGCCGAAGG CGTCTCGATG
GTCCACGAGA CGTTCGACGG GCGCACGCCT GTCGGCGATG ACGACGACCC CGACGAGAAG
ACGGAGTCGT TGGTGACCGA GTTGCGGGAG CTGCTCGACT TATAA
 
Protein sequence
MRFGYSDRCL EHDTGERHPE NPDRLRAIRR GLAKRHGVEY AEADPATREE VVAVHDAEYV 
DELEAFVADG GGSWDPDTVA SEGTWDAALA SAGLAQWAAR SALNGADGRD TPFALGRPPG
HHAVPDDAMG FCFFNNAAVA AQTVLDDGAA DRVAVFDWDV HHGNGTQDVF YDRGDVLYAS
IHEDGLYPDT GALDETGHDE GAGTTVNLPL SAGAGDADYL YAIDEVVAPA IKRFDPDLVI
VSAGFDAHRH DPISRMRVSS EGYALMTDRI RTVTDNIEAA NSYVLEGGYG LDTLAEGVSM
VHETFDGRTP VGDDDDPDEK TESLVTELRE LLDL