Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1348 |
Symbol | |
ID | 5056397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1212303 |
End bp | 1213313 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640468894 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001153563 |
Protein GI | 145591561 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.104059 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGTGT ACTACTCGGA GATTTTTAAG AAGCACACCC CTCCCTTTAG ACACCCTGAG GCGCCAGATA GGCTTGACTA CGTCATAAAG GGCGTAGTGG AGGCGGGAGG CGTTGTAAAG GAGCCTAAAA TGCGGGAGGA CGCGTGGAGG CTTATATACT CGGCTCATGA TAAGAGCTAC GTAGAGTACG TCAAAAGGCT CTGCGGCGCT GGCCAAGCAG AAATTGATGG CGATACATAC GTCTCTGCGG GTACTTGCGA CGCAGCCGCG CTCGCCGTCT CTGCCGTAAT GGAGGCCCTC GACAAGAGGG AGACAGCCCT CGTCGCGGCG AGGCCGCCCG GGCACCACGC CGGCTTCGCC GGGAGGGCCC TCACGGCGCC TACCCAAGGC TTCTGTATAT TCAACACGGC GGCCATTGGG GCCTTGTACG GGGGAGATGG CATCGCTGTG GTTGACATAG ACGTGCACCA CGGGAATGGA ACGCAAGAAA TGCTCTACGA AAGAGATCTC TTGTACATAT CAACTCATCA ACACCCGGCT ACGCTGTACC CCGGCACTGG GTACCCAGAC GAGGTGGGGA CAGGCAGAGG CGAGGGGTTT AATGCCAACC TCCCCCTGCC GCCGGGGACA GGAGACGACC TGTACATCAA GGCTATTGAT GAAGTGGTCT TGCCGTTGCT GAGGCAGTAC GACCCAAGGG CGGTAATAGT CTCGCTTGGG TGGGACGCCC ACAAGGACGA CCCCCTAGCC GACCTCGCCT TGTCCCTAAA GGGCTACCTC TACGCGTTGA GCGCGATCCT CAGCTTGCAG AAGCCAACTA TATTTCTCCT GGAGGGGGGC TACAACAGAG AGGTGTTGCA GAGGGGGACA AAGGCGCTGG TGCGCCTAGT AGCGGCGGGG GACTTCAGGC CCGAGGAAAC CCAAACAGAT TCGCCTCCCC ACGTGGCGAG GCGATACGAG GAGATAATGC AAGAGGTAAG ACGCCACCTA GGCCGGTACT GGCGCCTATA A
|
Protein sequence | MYVYYSEIFK KHTPPFRHPE APDRLDYVIK GVVEAGGVVK EPKMREDAWR LIYSAHDKSY VEYVKRLCGA GQAEIDGDTY VSAGTCDAAA LAVSAVMEAL DKRETALVAA RPPGHHAGFA GRALTAPTQG FCIFNTAAIG ALYGGDGIAV VDIDVHHGNG TQEMLYERDL LYISTHQHPA TLYPGTGYPD EVGTGRGEGF NANLPLPPGT GDDLYIKAID EVVLPLLRQY DPRAVIVSLG WDAHKDDPLA DLALSLKGYL YALSAILSLQ KPTIFLLEGG YNREVLQRGT KALVRLVAAG DFRPEETQTD SPPHVARRYE EIMQEVRRHL GRYWRL
|
| |