Gene PICST_40594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40594 
SymbolSIR3 
ID4836940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp288233 
End bp289408 
Gene Length1176 bp 
Protein Length391 aa 
Translation table12 
GC content47% 
IMG OID640388255 
ProductNAD-dependent histone deacetylase SIR2 (Regulatory protein SIR2) (Silent information regulator 2) 
Protein accessionXP_001382290 
Protein GI150863725 
COG category[K] Transcription 
COG ID[COG0846] NAD-dependent protein deacetylases, SIR2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.326026 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCTTGGCTT ATCAGCCGGA TCCAGAACTC ATGCAATATT ATCGTAACAC CCTCGTCAAC 
TTTGGACTCA TGAAATTCTT GAAGGACTTT GTTCCCGAGA GAATCACCAA AATAGACCTC
TGTAGATTAG TACTCAACTT GGGCTACCCC AAGGAATCCA TTAGTAACCA ACGATTCTTA
ACTACCAAGC AAGTTGCCGG GATTCTCGTA CTGCTCATAC TTAACGATCC GCAGGTAAAC
CAGAGCCAGA GCTACTCCGA CTTCGCTGCG TACCACCATC CCACTTCAGA CTACACAATT
CCTAAGTTAC TCCTGGACTT GACGCTGGCC AAAAAGATCA TGGTGATATC CGGAGCAGGA
ATTCTGACGT CGTTGGGTAT ACCAGATTTC AGATCGTTCA AAGGGTTGTA TGCGCAATTG
GAACATCTCA ATTTGAAGGA TCCTCAAAAA GTATTTGACA TGGGTGCGTT CCAGAAGGAC
CCCAGCATCT TTTATTCAAT CGCCCATTTG GTACTTCCAC CAGAGGGAAG ATTCTCTATG
CTCCATTCCT TTATAAAATT GCTCCAGGAC AAGGGCAAGT TGCTCCGGAA CTATACACAG
AACATTGATA ACCTTGAGTC CAGAGTCGGG ATCCACCCCG ATAAACTTAT TCAATGCCAC
GGCTCGTTTG GTTCAGCCAG CTGCTTAACT TGTAGCAACA GATTCGCTGG CCACAAGATC
TTTGAGCATA TCAGGCATCA GCATGTTCCA CGGTGTTCCA CCTGTTGGAA AACGATCCAA
GAGGCAGTAA TTATCCATGG GGTTATTAAG CCCGACATCA CGTTTTTTGG TGAGGACTTG
CCAAAGAAGT TCTACCGTTT GCTTGAGCCG GACTGCCAAA CCTGTGACTT GGTCATCGTG
GTGGGGACCT CCTTGAAAGT AGAGCCAGTG TCCAGCATAA TCGACAAGAT TCCCAGAAGC
GTCCCCAGAG TGCTCATCAA CAAAGACCCC ATTCCAGATA GGGACTTTGA CCTCAGCTTG
ATAGGTTTGT GTGATGACGT CGTATGTCAT CTTACTAGAG AGTTGGGAGC TAGCTGGGAC
ATTCCTCACC CCAACTTTGT CCCTTCCACT GAGTTTACTG TCACACCTCA TGAGCTCTAC
AGCAAGAGCT ACAACATTGT CAAGAAGGAA ACATAG
 
Protein sequence
ILAYQPDPEL MQYYRNTLVN FGLMKFLKDF VPERITKIDL CRLVLNLGYP KESISNQRFL 
TTKQVAGILV SLILNDPQVN QSQSYSDFAA YHHPTSDYTI PKLLSDLTSA KKIMVISGAG
ISTSLGIPDF RSFKGLYAQL EHLNLKDPQK VFDMGAFQKD PSIFYSIAHL VLPPEGRFSM
LHSFIKLLQD KGKLLRNYTQ NIDNLESRVG IHPDKLIQCH GSFGSASCLT CSNRFAGHKI
FEHIRHQHVP RCSTCWKTIQ EAVIIHGVIK PDITFFGEDL PKKFYRLLEP DCQTCDLVIV
VGTSLKVEPV SSIIDKIPRS VPRVLINKDP IPDRDFDLSL IGLCDDVVCH LTRELGASWD
IPHPNFVPST EFTVTPHELY SKSYNIVKKE T