Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1746 |
Symbol | |
ID | 6165389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1537236 |
End bp | 1538246 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641668909 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001795110 |
Protein GI | 171186191 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.115562 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGGTGT ACTTCTCAGA GGTGTTCAAA GGCCACGTGC CTCCCTACCG GCACCCCGAG GCGCCGGACC GGCTGGACTT CCTCATAGAG GGCGCGCGCG AGGCCGGGGC CGACATAAAA GAGCCGAGGA TGAGGGAAGA CGTCTGGCAA CTCGTCGAGT CGGTACACGA CAGGAGCTAC GTAGAGCTGG TGAGGCGCTT GTGTAGAAAA GGCGATGTGC AGATAGACGG GGACACCTAC GTGTCCGCCG GGACATGCGA CGCGGCGGCG CTTGCGGTCT CTGCCGTGGT AGACGCCGTT GATAGAAAGG AGACGGCCTT GGTCGCGGCG AGACCCCCCG GCCACCACGC CGGCTTTGCG GGCAGAGCGC TTTCGGCGCC TAGCCAAGGC TTCTGCATAT TCAACACCGC GGCCATCGGC GCTCTCTATG TGGGCGAGGG CGCCGCCGTG GTGGACATAG ACGTCCACCA CGGAAACGGC ACACAGGAGA TACTATACGA CAGAGACCTG CTCTACATCT CCACACACCA GCACCCGCTA ACCCTCTACC CAGGCACAGG CTATCCGGAG GAGGTGGGCG TGGGGAGGGG GGAGGGCTAC AACGTGAACG TGCCGTTGCC CCCCCGCACC GGGGACGACC TCTACGCCAA GGCGGTAGAC GAGGTGGTGG TTCCCGTCCT CAAGCAGTAC GGCCCCCGCC TAATAATTAT CTCCCTGGGA TGGGATGCAC ACAGGGAGGA CCCCCTCGCC GACATGAACC TCACCCTCAA GAGCTACCTA TACGTCTTCG ACGCCGTCTT ACGCCTCCAA AAGCCGACCA TCTTCCTCCT GGAGGGGGGC TACAACCGCG GCGTTATAAA GAGGGGCACC AAGGCCCTCG TAAGACTCGT TGACGCGGGC GAGTTCGCCC CAGGCGAAAG CCAGACATCC ACCGACGGCC ACACCGCGAA GAGGTACGAG GAGGTCATGA GGGAGGTGAA GAGCCACGTG GGGAGGTACT GGAGGTTATA G
|
Protein sequence | MLVYFSEVFK GHVPPYRHPE APDRLDFLIE GAREAGADIK EPRMREDVWQ LVESVHDRSY VELVRRLCRK GDVQIDGDTY VSAGTCDAAA LAVSAVVDAV DRKETALVAA RPPGHHAGFA GRALSAPSQG FCIFNTAAIG ALYVGEGAAV VDIDVHHGNG TQEILYDRDL LYISTHQHPL TLYPGTGYPE EVGVGRGEGY NVNVPLPPRT GDDLYAKAVD EVVVPVLKQY GPRLIIISLG WDAHREDPLA DMNLTLKSYL YVFDAVLRLQ KPTIFLLEGG YNRGVIKRGT KALVRLVDAG EFAPGESQTS TDGHTAKRYE EVMREVKSHV GRYWRL
|
| |