Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0154 |
Symbol | |
ID | 5732063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 184887 |
End bp | 185924 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277278 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001542934 |
Protein GI | 159896687 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000391208 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTCG CTCTCATTTC ACACGATCAT TATCTCGACC ACGATCAGCC TGAGCACCCC GAAAATGCCA ATCGGCTACG GGCGATTCAT GCCATGCTGG CAGCTGATTA TGAATTGCAA CAGCACTTAA CCCCGCTTGC ACCACGCCAT GCAACTGCTG CTGAAATCGA GGCGGTGCAT GTGCCAAGCC ATTTGCCAAC CCTACAACGG ATGGCCCAAT TTGGCGATTG GGCTGATGCT GAAACCTATA TTTTGCCAGA TTCGGTCGAG ATTGCCCAAC TTGCTGCTGG CGGCGCGATT GTCGCGACTG ATGCAGTGCT CAGTGGTCGG CATGCCAATA GTTTTGCTTT GGTTCGCCCA CCAGGCCATC ATGCGACCGC CGACCAAGCT ATGGGATTTT GCTTGTTCAA TAATGCGGCG ATTGCGGCGG CCTTTGCTCA ACGTGAATAT GGCCTCAAAC GAGTGGCAAT TTTGGATTGG GACGTGCATC ATGGCAACGG AACCCAAGAT ATTTTCTACC AAAACCCCGA TGTGCTCTAC ATCTCGACCC ATGGCTGGCC GCTTTGGCCC AACTCAGGTC ATTGGAAGGA GATGGGCGCG AACGCTGGAT TGGGCACGAC CCTTAACTTG CCATTACGCC CATTAACTGG CGATATGGGT TTTCATCTGG TGTTTGAACA AGCGATTGCC CCAGCTATTC GGCGCTTCAA GCCTGAGTTA TTGATCATCT CGGCGGGCTA TGATGCGCAT ATTTACGACC CATTGGGGAA TTTGGCACTC TCCACTGGCG GTTATGCCCA GCTATCGTCG ATCGTGTATA ATTTAGCCGC CGAGTGCTGC GATGGACGCT TGGTGGGCTT GCTTGAGGGT GGTTACAACC TCGAAGCCCT CGCTCAAAGC CTCACTGCAA CGCTGCAAAC ATGGGTTTCT GGTCAGCCTG CCCCGATTTT CAATCAAGAA GTCAGTCATA CCCCAGAACC CGATGTTACC TGGCTGATCG AGCATCTCCG CCGCGAACAT CCGCTTTTGA AGGGATGA
|
Protein sequence | MSLALISHDH YLDHDQPEHP ENANRLRAIH AMLAADYELQ QHLTPLAPRH ATAAEIEAVH VPSHLPTLQR MAQFGDWADA ETYILPDSVE IAQLAAGGAI VATDAVLSGR HANSFALVRP PGHHATADQA MGFCLFNNAA IAAAFAQREY GLKRVAILDW DVHHGNGTQD IFYQNPDVLY ISTHGWPLWP NSGHWKEMGA NAGLGTTLNL PLRPLTGDMG FHLVFEQAIA PAIRRFKPEL LIISAGYDAH IYDPLGNLAL STGGYAQLSS IVYNLAAECC DGRLVGLLEG GYNLEALAQS LTATLQTWVS GQPAPIFNQE VSHTPEPDVT WLIEHLRREH PLLKG
|
| |