Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0533 |
Symbol | |
ID | 5103693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 490082 |
End bp | 491128 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506437 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001190632 |
Protein GI | 146303316 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.072372 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAGAA CTGTTTTCGT GTGGGATGAT GCTTACCTGG ATTACTCCTT TCCAGGGGAT CATCCCTTTA AGTCGTTTAG GGAGAGCAGA GCTAAGAAGT ACATGGAGGA GAGGGGGTTC TTTCATCACA TGGAAATGAG GAGGCCGGAT CCAGACGATG AGAGTATTCT CTTGAATGTT CACTCTCGGG ATTACGTGGA CTTCGTGAAG ATGAAGAGTC AGGAGGGGGA GGGATTACTG GATTATGGGG ATACTCCAGC CTTCAAAGGC GTCTTCGAGA GCGCACTGAG AAGGGTAATG GGTAGCGTTA CGGGGATTAG GTTGCTGGCA CAGGGCTACG ACCACGCAGT GAACTTGGGA GGGGGTTTAC ATCACGCGCA ATGGGGATCA GCATCTGGGT TCTGCGTCTT CAATGACGTT GCCATTGCGG CAAAGGAGGG AGAGAAGTTC TTCAGAAGGA TCGCAATTGT TGATGTGGAC GGACATCACG GCGACGGAAC CCAAGCGTTA CTTTACGATG ACCCCAATGT TCTGAAGATA TCCCTGCATA TGTATCATAG GGGGTTTTTC CCCGGTACGG GAGAGATAAA CGAAATTGGA ACGGGGAAGG GAAAGGGGTA TACGGTTAAC GTTCCTCTGC CTCCTGGAAC CGCAGATGAT GCCTATATTT ACGCCTTTGA TAACGTGGTT ATACCCCTTC TGGATAGGTT TCAGCCAGAG GCCATAATCA TCCAGGAGGG AGGAGACTCC CATTTCGACG ATCCTCTCGT GGAGCTTAAG TTGAGCACCA GGGGTTACCT TGCATTGGTC AAGAGGATCC ATGATCTAGC CCACAGGGGC ACTGGAAAGA TCCTGTTACT AGGGGGAGGA GGTTATAACT ACGATGCGAC TGCGAGAGTG TGGACAGTTT CCGTAGCTGA GTTACTCGGA CTTGATGATC AGGAGGTTGA GTCTCTGCAC GACTGCTGTC TTACCTCATC AAGTGCCTAC ATAATGGAGA GAGTTAAGCA GGTTGTAGAA GAAGTGAAAA AAGTTCACGG AATCTAG
|
Protein sequence | MHRTVFVWDD AYLDYSFPGD HPFKSFRESR AKKYMEERGF FHHMEMRRPD PDDESILLNV HSRDYVDFVK MKSQEGEGLL DYGDTPAFKG VFESALRRVM GSVTGIRLLA QGYDHAVNLG GGLHHAQWGS ASGFCVFNDV AIAAKEGEKF FRRIAIVDVD GHHGDGTQAL LYDDPNVLKI SLHMYHRGFF PGTGEINEIG TGKGKGYTVN VPLPPGTADD AYIYAFDNVV IPLLDRFQPE AIIIQEGGDS HFDDPLVELK LSTRGYLALV KRIHDLAHRG TGKILLLGGG GYNYDATARV WTVSVAELLG LDDQEVESLH DCCLTSSSAY IMERVKQVVE EVKKVHGI
|
| |