Gene PICST_32025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32025 
SymbolHST2 
ID4839128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp504939 
End bp505919 
Gene Length981 bp 
Protein Length326 aa 
Translation table12 
GC content41% 
IMG OID640390443 
Productputative histone deacetylase-like protein 
Protein accessionXP_001384758 
Protein GI126136469 
COG category[K] Transcription 
COG ID[COG0846] NAD-dependent protein deacetylases, SIR2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.822239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.176883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCGG ACCTTGAAAA GAAGCTCCGA CCGCTCGTAG ATGCCATCCA ATCGGGCAAA 
AAGATCACGT TTTTCAATGG TGCTGGAGTT TCTACCAGTG CAGGGATTCC AGATTTCAGA
TCTCCGAAGA CAGGACTCTA TGCCAATTTG GCAAAACTCG ATTTGCCGTA TGCGGAGGCA
GTTTTTGATA TCGACTACTT TAGAGAGAAT CCAAAGGCTT TTTATACTTT GACCCAAGAA
TTGTATCCTG GCAAATTTGC TCCCACCAAA TTCCACTACT TCGTAAAATT GGTTCAGGAC
AAGAAGTTGT TGAAAAGAGT CTACACTCAG AATATTGATA CATTGGAAAG ACTAGCAGGT
GTAGAGGATG AATACATTGT AGAAGCGCAT GGGTCTTTCG CGCGCAATCA TTGCATAGAC
TGTTCAGAAG AGATGTCTAC CGAGACGTTG ATAGAGCACA TGAACAATAA GGATAAGAAC
GAAGGTATTC CTACATGTTC AGCATGTAAG GGATATGTGA AACCAGATAT TGTTTTCTTT
GGCGAAGGCT TGCCTTCAAG GTTTTTTGAC TTATGGGATG AAGACTCCGA CGAAGTAGAA
GTAGCATTAG TAGCTGGAAC GTCTTTGACT GTATACCCAT TTGCTTCTTT ACCTGCAGAA
GTAGGCAAGA AAACGTTGAG AGTTTTGGTT AACAAGGAGA ACGTAGGTGA CTTCAAAGCT
GGTAAAAGAC GGTCAGATTT AGTACTTCTA CACGATTGCG ACTATGTAGC TGAAAAGTTG
TGTGAACTCT TGAACTGGAA GGATGAACTT GATGCCTACA TTGAAGAGGC TACGAAGAAG
TATTCCAAGA ATAAGGAAAC AGCGGCAGAA CTAGCTGAAG AGATCACCGA GGAAATAAAG
GAGGTAATAG AAAAGATGTC TCCGGCAGCA GAAACAAAAG AAGAAGAATT AGAAGATCAA
ATCAGCAAAT TGAAGATTTA A
 
Protein sequence
MSADLEKKLR PLVDAIQSGK KITFFNGAGV STSAGIPDFR SPKTGLYANL AKLDLPYAEA 
VFDIDYFREN PKAFYTLTQE LYPGKFAPTK FHYFVKLVQD KKLLKRVYTQ NIDTLERLAG
VEDEYIVEAH GSFARNHCID CSEEMSTETL IEHMNNKDKN EGIPTCSACK GYVKPDIVFF
GEGLPSRFFD LWDEDSDEVE VALVAGTSLT VYPFASLPAE VGKKTLRVLV NKENVGDFKA
GKRRSDLVLL HDCDYVAEKL CELLNWKDEL DAYIEEATKK YSKNKETAAE LAEEITEEIK
EVIEKMSPAA ETKEEELEDQ ISKLKI