Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1556 |
Symbol | |
ID | 5104001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1513108 |
End bp | 1514133 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507442 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001191635 |
Protein GI | 146304319 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0460927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGTATG GGGACTCATT AGGGATAGTC TGGGATGAAA GGTTTAGGGA GATCTCCTTC TCCCACCCCA TGATAAGGGA CGTTTCCAAG AGCAGAATCG TGAGGTTCAG GGAGCTGGTC TCAACTCTCA ATGTCTACTT CATATCTCCT AAACCTGCCA CGTACGAGGA TCTGCTCGAG GTTCACGATG AGAGCCTTCT AAGAAAGATC AAGGAAGTTA GTTCCCTCCC TTACATTGGC TTCCTAGATT CGGGGGACAC GGTTCATTAC CCTGGCATGT TTGACGATAT TCTTCTAGTT GCGGGTTCGA CTCTCACGGC AATCTTCATG TCAAGGTTCT TTGACTCAAT TTACATTCCG CTGGGGGGAT TTCATCATGC CACTAGATCA AGATCCATGG GTTTTTGCCC CATAAACGAC GTTAACTTAG CGATATCGAG ACTCATGAAA ATGGGAGAAA GGGTCGCACT GGTTGACGTA GACGCCCACC ACGCGAATGG AGTGGAGGAA ATGTTCTATG ACAAACCAGT GCTCAAGATC AATATTTTCG CTTATGATGG CAAATTCTTC CCGGGAACCG GTGATTACAG GAGAAGGGGG AAAGGAGAAG GGACGGGTTA TAACTTCAAT GTCGGGTTAC CTCTGGGCTC TGCTGACGAC TCGTTTCAGG AGGCATTAAG GTTGCTCGAT GTGGTTGAAG ACTTCAAACC ATCAGTCCTA CTCGTGGTTG CCGGGGTAGA TGGGCATAAG GATGACGGAC TCAAATCCCT CAATCTGACC ACGAACTCCT TCAATTTACT GGGCCTAAAG GTGAGCAGGT TAGCTAGGAG GGTTAACGCA AGGATTATTT CCTTTGGAGG AGGAGGCTAC GGTCCAGGTT CAGCACCCTC TATGTTTTCC TTTGTGAAGG GACTCATGGG AAACAGGTCA GAGGATGAAA TGCCCACACA AGATGAGGAG AAAAAGGCTT ACGTGAAAAA ATTAGTGGAT CTACTCCTTG AACGCCTTCC CAGTAGCCTT GAGTAG
|
Protein sequence | MTYGDSLGIV WDERFREISF SHPMIRDVSK SRIVRFRELV STLNVYFISP KPATYEDLLE VHDESLLRKI KEVSSLPYIG FLDSGDTVHY PGMFDDILLV AGSTLTAIFM SRFFDSIYIP LGGFHHATRS RSMGFCPIND VNLAISRLMK MGERVALVDV DAHHANGVEE MFYDKPVLKI NIFAYDGKFF PGTGDYRRRG KGEGTGYNFN VGLPLGSADD SFQEALRLLD VVEDFKPSVL LVVAGVDGHK DDGLKSLNLT TNSFNLLGLK VSRLARRVNA RIISFGGGGY GPGSAPSMFS FVKGLMGNRS EDEMPTQDEE KKAYVKKLVD LLLERLPSSL E
|
| |