Gene PICST_50903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50903 
Symbol 
ID4841080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp780319 
End bp781977 
Gene Length1659 bp 
Protein Length553 aa 
Translation table12 
GC content39% 
IMG OID640392395 
Productpredicted protein 
Protein accessionXP_001386562 
Protein GI150866834 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5027] Histone acetyltransferase (MYST family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA AGACCCTTAT TCTCAATCTC AAGTCATCTT CAAAGCAAGC GACGTCTCGC 
AAACTGAACG ATTTGGATAC ACGAGAATTG CCGTACAGAG GTATATTTCC ATATCCTGAC
TGCACCATCA ACGATACTGA TCCTACCAAA GAGGATCGAG AACTTTTTGA AAAGTTGGCT
GAAGAAGGCA ATGCTCTTCG TTTAAAAGAT ACAAACCAGT TGACCCAGCA AAAGGATGAA
ACTCCAACCA TTAACGATAG AAGTGTAGAA TCAACTCCTA CGCCTGCCTC AATGCCCAAT
TTGCTCAAGT CTCAAATCGA GAAAATTGTT TTCCGCAACT ACGAAATCAA CACCTGGTAC
ACAGCTCCAT ACCCAGAAGA ATACTCTCAA TCCAAAGTAT TATTCATCTG TGAGCATTGC
TTAAAGTATA TGAACTCACC CATGTCCTAC AAGAGACACC AGCTCAAGAA CTGTAACTTT
TCAAATAACC ATCCTCCAGG AGTAGAAATA TATCGTGATT TGGCTACACG GATCCTGATC
TGGGAAGTAG ATGGCCGCAA GAATATTAAC TATTGCCAAA ACTTATGTCT TCTTGCAAAG
CTATTTCTCA ATTCAAAAAC TTTGTATTAC GATGTGGAGC CATTCATATT TTACATATTG
ACAGAAATTG ACGAATTCAA CCTCTCAAAA TACCATTTTG TCGGATACTT TTCCAAAGAG
AAGTTAAATA ATTCCGACTA CAACGTCTCC TGTATTCTCA CGTTGCCTAT TTATCAGAGA
AAAGGTTATG GTAACCTATT GATCGATTTC TCGTATTTGT TGAGCAGACA AGAATTCAAG
TATGGTACAC CAGAAAAGCC ATTGAGTGAT TTAGGTTTAC TCAGTTATAG AAACTACTGG
AGAGTAACCA TCGCCTATAA ACTTAGAGAA CTCTACACAG CATTTGGGTC AGAAGAAGAA
TCCACAACTC CTTCCTCCAT CATTTCCCAC ACAACAATAT CAGTAGATAT ACTTTGTAAA
CTAACTGGTA TGACTTCTTC AGATGTGGTA GTTGGTTTAG AACAGTTAGA TGCATTGATC
AAAAACCCGT CCACCAACAC ATATGCAATT GTTCTTAACT TGAAGAAAAT AAACTATGAA
ATCGCCAGAT GGGAGAAGAA AAACTATACT AAACTCAATT ACTCCAAGCT TCTTTGGAAG
CCAATGCTTT TCGGGCCCTC TGGGGGGATT AATTCAGCAC CAGCGTTTGT AGCTCCTCTT
GCTGCTGGTC ACAATAGTGT CTCTCTGATA GTTGGTTTCT TGAAAGATGA CATCAATAAC
CCATATTCAT ATGAAGAAGA AGCATATAAG GAGATAGAAA TGAGAAGAGA AGTTAGTCTT
CTGAAATCGG ACGACAATGA CAACGCAGAT GATCAAGAAG ATCCCGACGA AGATTTAGAT
AACTATTTGA TATGTTATCC AGGAATACAG TACAGCACCA AGAAGAAACC AATAAAGCTG
TCTACTGAGA TCAAGCAAGT TTCTTTTGTA GACCTCAACA ATCTACTGGA CGAGTTCCCT
GAAATATTCG AAGATGACGA ACCTGCCAGT AGCTCCAGTG AGTCGGAAGA CTATGTGGAA
GCATCAGAAG TGGAAGATGT TGACGAAGAA GAAGAGGAG
 
Protein sequence
MKKKTLILNL KSSSKQATSR KSNDLDTREL PYRGIFPYPD CTINDTDPTK EDRELFEKLA 
EEGNALRLKD TNQLTQQKDE TPTINDRSVE STPTPASMPN LLKSQIEKIV FRNYEINTWY
TAPYPEEYSQ SKVLFICEHC LKYMNSPMSY KRHQLKNCNF SNNHPPGVEI YRDLATRISI
WEVDGRKNIN YCQNLCLLAK LFLNSKTLYY DVEPFIFYIL TEIDEFNLSK YHFVGYFSKE
KLNNSDYNVS CILTLPIYQR KGYGNLLIDF SYLLSRQEFK YGTPEKPLSD LGLLSYRNYW
RVTIAYKLRE LYTAFGSEEE STTPSSIISH TTISVDILCK LTGMTSSDVV VGLEQLDALI
KNPSTNTYAI VLNLKKINYE IARWEKKNYT KLNYSKLLWK PMLFGPSGGI NSAPAFVAPL
AAGHNSVSSI VGFLKDDINN PYSYEEEAYK EIEMRREVSL SKSDDNDNAD DQEDPDEDLD
NYLICYPGIQ YSTKKKPIKS STEIKQVSFV DLNNLSDEFP EIFEDDEPAS SSSESEDYVE
ASEVEDVDEE EEE