Gene PICST_65073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65073 
SymbolPUP1 
ID4851978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3345720 
End bp3346852 
Gene Length1133 bp 
Protein Length267 aa 
Translation table 
GC content42% 
IMG OID640393686 
Product20S proteasome, regulatory subunit beta type PSMB7/PSMB10/PUP1 
Protein accessionXP_001386964 
Protein GI126276195 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0638] 20S proteasome, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.197238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGAACACAAG CCTATTGGTT GAAACCGGTC AATTCTAACC TCTACTTATC GCTGATTCAC 
ATAGAAAAGA CTGCAATAAC TTCCAATAGA GTCAGACGGA TCAATACATC AATTTCGTTG
ACATTGAATT AATCATTGAA TATCACCGAA TCCCGATCAA CCTCGTTTAA ACCTCATATT
AATCACAAAT TGTATTGAAT TGAGAGTAGA TTTTTCATTC ACGACTTCAA ACCATTACAA
ATTCAACCGT TGAGCTCATA TAGAAGTCAA CAATTGAAAC TTAAAACAAT AGATTTAATC
AAATCATTAA CAAATTATCA TAACATACAA TGCCTGGCTT GAACTTCGAC AACTACCAGA
GAAACTCGTA TCTCACCACT AAGGGTTACG GAACTCCCAA GGCTACCTCT ACTGGTACAA
CTATTGTAGG CTGTAAGTTT AAAGGAGGGG TGGTGATTGC TGCTGATACT CGTGCTACGG
CCGGAAGCAT CGTGGCCGAT AAGAACTGTG AGAAATTACA TAGACTAGCA CCCAAGATCT
GGTGTGCTGG TGCCGGTACA GCCGCTGATA CTGAGATGGT AACTCAATTG ATAGCTTCAA
ACTTGGAGTT GCACGGACTT TACCAGAATA GGCAACCCCG AGTCATCACC GCTTTAACGA
TGTTAAAGCA ACACTTGTTC AAGTACCAGG GCCATTTGGG TGCCTATTTG ATTGTAGCTG
GTGTAGATCC AACTGGCGCT CATTTGTTGT CGGTACAAGC TCACGGTTCT ACCGATATCG
GCAAGTACCA GTCGTTGGGT TCTGGTTCGT TGGCAGCCAT GGCTGTATTG GAAACTAATT
TCAAGGAAGA CATGACCAAG GAAGAGGCCA TCAAGTTATG TGCAGATGCT ATTGAGCTGG
GTATCTGGAA TGATTTGGGT TCCGGTTCGA ATGTAGACAT ATGTGTGATG GAAGTAGGCA
AAGATGCTGA ATTGTACAGA AACTACTTGA CTCCAAATGT CAGATCAGAG AAGGCAAGAT
CGTACAAGTT TGCTAGAGGA TCTACTGCTG TGTTGAGAGA AACTGTACGT GATATTTTGG
ATGTAGAGGA AACGGTTGTC ACATTTGGTG ATGCTATGGA GGTGGATGCA TAG
 
Protein sequence
MPGLNFDNYQ RNSYLTTKGY GTPKATSTGT TIVGCKFKGG VVIAADTRAT AGSIVADKNC 
EKLHRLAPKI WCAGAGTAAD TEMVTQLIAS NLELHGLYQN RQPRVITALT MLKQHLFKYQ
GHLGAYLIVA GVDPTGAHLL SVQAHGSTDI GKYQSLGSGS LAAMAVLETN FKEDMTKEEA
IKLCADAIEL GIWNDLGSGS NVDICVMEVG KDAELYRNYL TPNVRSEKAR SYKFARGSTA
VLRETVRDIL DVEETVVTFG DAMEVDA