Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_1126 |
Symbol | |
ID | 4909224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | + |
Start bp | 1057101 |
End bp | 1058111 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640124880 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001056017 |
Protein GI | 126459739 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.546018 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTGG TGTTCTCCCC CGTGTTTAAG TTGCACAAAC CGCCGTACAG CCACCCCGAG GCTCCCGACA GACTCGACGC AGTGTTAAAA GGCGCCGAGG AGGCCGGCGC CGCCGTCCAA GCGCCAAGGG CCAGAGAGGA CGTGTGGAAC CTCGTGTCGC TGGCCCATGA GAGGGGGTAC ATAGAGTTTG TGAGAAGGCT GTGTAAAGAG GACTATGCCC AAATCGACGG CGACACGTAC GTCTCCTCTG GCACATGCGA GGCAGCGGCC TTGGCAGTAA GCGCTATGGC AGACGCCGTG GACGCAGGCG AGACAGTTTT GCTCGCAGTT AGGCCGCCCG GCCACCATGC GGGGTTTGTG GGGAGGGCGC TGACGGCCCC AACGCAGGGC TTCTGTATCT TCAATACTGC GGCAATTGGC GCATTGTACA AGGGGGAGGG CGTCGCTGTG GTAGACATCG ACGTACACCA CGGAAACGGC ACACAAGAAA TTCTCTACGA CAAGGACCTA CTCTACATCT CTACGCACCA GCACCCAGCC ACGCTGTACC CAGGCACGGG CTACCCAGAC GAAGTAGGCA CAAGCAAGGG GGAAGGCTTC AACGTCAATA TACCGCTCCC GCCTCTCACT GGAGACGACG CGTACGCCGA GGCAGTAGAG GAGGTTATAG AGCCAATCTT GCGGCAGTAC GACCCCAAGA TCATAATCGT GTCCCTGGGA TGGGACGCAC ACAGAGACGA CCCCTTGGCC AACATGGGGC TGACGATAAA CGGCTACCTA CGCGCAATTT CCACCATACT CAAACTCCAA AAGCCCACGA TCTTCTTACT CGAGGGAGGC TACAACAGAG ACGTGTTGAG GAGAGGCACT GCGGCGCTGA TAAGGCTGGT GGACAAGGGC GAAGAAGCCG CCGGAGAAGC GCCTTCAAAG ACCGACGCCA AGGTATACGA GAGGTTTAGG AGAGAGCTCG GCGAAGTAAA GAGGCACGTC TCAAAACACT GGCGCCTATA G
|
Protein sequence | MKVVFSPVFK LHKPPYSHPE APDRLDAVLK GAEEAGAAVQ APRAREDVWN LVSLAHERGY IEFVRRLCKE DYAQIDGDTY VSSGTCEAAA LAVSAMADAV DAGETVLLAV RPPGHHAGFV GRALTAPTQG FCIFNTAAIG ALYKGEGVAV VDIDVHHGNG TQEILYDKDL LYISTHQHPA TLYPGTGYPD EVGTSKGEGF NVNIPLPPLT GDDAYAEAVE EVIEPILRQY DPKIIIVSLG WDAHRDDPLA NMGLTINGYL RAISTILKLQ KPTIFLLEGG YNRDVLRRGT AALIRLVDKG EEAAGEAPSK TDAKVYERFR RELGEVKRHV SKHWRL
|
| |