Gene PICST_42410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42410 
Symbol 
ID4837015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2020258 
End bp2021934 
Gene Length1677 bp 
Protein Length558 aa 
Translation table12 
GC content45% 
IMG OID640388330 
Productpredicted protein 
Protein accessionXP_001382617 
Protein GI150863958 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCTG TTCAGAGCAC CAAGGGCGGA AAACAGAAGT TGGCTCCTGA AAAGGAGCGA 
TTTTTGCAAT GTTGCGGAGA CATTTCGCTC GAGCTTGTAG CTTCCCTCAA GAACTCCAAA
GACATCAACT TGAACGGCTT GATCATCAGG TATGCGAAAA AGTACAAGTT GAAGCAGCAG
CCCAGACTAA CGGATATCAT CTCGTCCATC CCAGACCAGT ACAAGAAGTA CTTAATTCCA
AAGCTCAAGG CCAAACCGGT CCGTACCGCA TCTGGTATTG CGGTTGTAGC AGTCATGTGT
AAACCTCACA GATGTCCCCA TATAGCCTAC ACGGGAAACA TCTGTGTATA TTGTCCAGGG
GGGCCAGATT CAGACTTTGA ATACTCGACC CAGTCATATA CCGGGTATGA GCCGACTTCA
ATGAGAGCCA TTCGGGCTAG ATATGATCCT TACGAACAGG CTCGAGGCAG ACTAGAGCAG
TTGAGACTGT TGGGCCATTC CATAGACAAA GTTGAGTACA TTATCATGGG TGGAACATTC
ATGTCACTTC CCATCGATTA CAGAGAAGGC TTCATCACCC AGTTACACAA CGCATTAACA
GGTTATAACG GTAAGGACAT TGACGAAGCC ATCAAATATT CCCAACAATC ACAGACTAAG
TGTGTGGGAA TAACCATTGA AACTAGACCC GATTACTGTA CTGAAACCCA TTTGAGCGAC
ATGTTGAAGT ACGGATGTAC CAGATTGGAA ATCGGGGTAC AGTCGGTATA TGAGGATGTA
GCAAGAGACA CGAATAGAGG ACATACGGTT AAGGCTGTCT GTGAAACCTT TGCTGTAGCC
AAAGATGCTG GGTACAAGGT GGTGAGTCAT ATGATGCCTG ACTTGCCCAA TGTAGGCATG
GAAAGAGACT TGGAACAATT TAAGGAATAC TTTGAGAATC CCGAGTTCAG AACTGACGGC
TTGAAGTTGT ACCCCACATT GGTCATTAGA GGCACTGGAT TGTACGAGTT GTGGAAGAAA
GGGTTATATA AGTCATACAA TGCGAATGCC TTGATAGACT TGGTGGCTCG TATCATGGCC
ATGGTACCTC CATGGACACG TATCTATCGT GTGCAAAGAG ATATCCCTAT GCCGTTAGTC
ACGTCGGGTG TAGAAAACGG AAACTTGAGA GAATTGGCTC TTGCCAGAAT GAAAGACTTT
GGCACCACCT GTAGAGACGT ACGTACAAGA GAAGTCGGAA TCCAAGAAGT TCATCACAAA
GTTGTACCAG ACCACGTGGA ATTGATTAGA AGAGATTACT ATGCCAATGG AGGCTGGGAA
ACTTTTTTGT CGTACGAAGA CCCAAAGAAG GATATTTTGA TTGGCTTGTT GAGATTGCGT
AAGGCTTCTA AGAAGTACAC ATACAGAAAG GAATTCACCA ACCAACCTAC CTCTATCATC
AGAGAATTGC ATGTCTACGG TTCTGTTGTG CCCTTGCACT CCAGAGACCC TAGAAAGTTC
CAGCATCAAG GGTTTGGTAC CTTGTTAATG GAAGAAGCTG CCAGAATCGC CAAGGAAGAA
CATGGTTCTG AAAAGATCTC GGTCATTTCG GGTGTAGGTG TAAGAAACTA CTACGCAAAA
CTTGGCTACC ATTTGGATGG TCCATATATG TCTAAATGGC TTAACGACGA GGAATAG
 
Protein sequence
MPSVQSTKGG KQKLAPEKER FLQCCGDISL ELVASLKNSK DINLNGLIIR YAKKYKLKQQ 
PRLTDIISSI PDQYKKYLIP KLKAKPVRTA SGIAVVAVMC KPHRCPHIAY TGNICVYCPG
GPDSDFEYST QSYTGYEPTS MRAIRARYDP YEQARGRLEQ LRSLGHSIDK VEYIIMGGTF
MSLPIDYREG FITQLHNALT GYNGKDIDEA IKYSQQSQTK CVGITIETRP DYCTETHLSD
MLKYGCTRLE IGVQSVYEDV ARDTNRGHTV KAVCETFAVA KDAGYKVVSH MMPDLPNVGM
ERDLEQFKEY FENPEFRTDG LKLYPTLVIR GTGLYELWKK GLYKSYNANA LIDLVARIMA
MVPPWTRIYR VQRDIPMPLV TSGVENGNLR ELALARMKDF GTTCRDVRTR EVGIQEVHHK
VVPDHVELIR RDYYANGGWE TFLSYEDPKK DILIGLLRLR KASKKYTYRK EFTNQPTSII
RELHVYGSVV PLHSRDPRKF QHQGFGTLLM EEAARIAKEE HGSEKISVIS GVGVRNYYAK
LGYHLDGPYM SKWLNDEE