Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0952 |
Symbol | |
ID | 4709400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1027168 |
End bp | 1028091 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639855421 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001002530 |
Protein GI | 121997743 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0823921 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGACCT GGCTGATCAC CCACCCCCAG TGCCTCGAGC ACGACGCCGG CACCGGCCAC CCGGAGAGCG CCGCGCGCCT GCAGGCCATC CTGCAGGCGC TGCAGCACGC CACCTTCGAG TACGTCCTGC GCGAGGAGGC ACCGCTGGCG ACGGTCGAGC AGCTAGAGCT GGCCCACGAC CCGACCTACG TACGCTCGCT GCTCGATAGC GTCCCCGCCG AAGGGGCCCG GCACATCGAC CCGGACACCC GGCTCTGCCC GGCCACCGGC GAGGCCGCCC GGCGGGCCGC CGGGGCCGTC TGCCACGGGG TCGACGGCGT GCTCTCGGGC AAGGCGCAGC GGGTTTTCTG CGCGGTGCGC CCACCCGGCC ACCACGCCGA GCCGGACCGC GCCATGGGCT TCTGCTTCTT CAACAACATC GCCGTCGGCG CCGGCCACGC GGTGGCGCAC TACGGCCTGC AGCGGGTGGC GATGATCGAT TTCGACGTCC ACCACGGCAA CGGCACCGAG GCGATCAGCC GCGGCCGCCC GGGGTTCTAC TACTTCTCGA CCCACCAGCA CCCGCTGTTC CCCGGCACCG GCACTCCGGG CAGCGACGCC CCGGCGAATA TCGTCAACGC CACCCTGGCC GACGGCGACG GCTCCGAGGC CTTCCGTGAG GCCTTCACGG GGACCATCCT CCCCGCGCTG GAGGATCTGC AGCCGGAGCT GATTCTGATC TCGGCGGGCT TCGATGCCCA CCGCTCCGAT CCGTTGGCCA CCCTGCAACT GGACGAGACC GACTTCGCCT GGGCCACCCG AGAGCTGGTG GACGTGGCCA AGCGCCACTG CCAGGGGCGG GTGGTCTCGG TCCTCGAGGG CGGCTACAAC ACCTCCGTGG TCGGCCGCTG CGCCGCGGCC CACCTCGAGG CGCTGATGAT GTAG
|
Protein sequence | MTTWLITHPQ CLEHDAGTGH PESAARLQAI LQALQHATFE YVLREEAPLA TVEQLELAHD PTYVRSLLDS VPAEGARHID PDTRLCPATG EAARRAAGAV CHGVDGVLSG KAQRVFCAVR PPGHHAEPDR AMGFCFFNNI AVGAGHAVAH YGLQRVAMID FDVHHGNGTE AISRGRPGFY YFSTHQHPLF PGTGTPGSDA PANIVNATLA DGDGSEAFRE AFTGTILPAL EDLQPELILI SAGFDAHRSD PLATLQLDET DFAWATRELV DVAKRHCQGR VVSVLEGGYN TSVVGRCAAA HLEALMM
|
| |