Gene PICST_54032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_54032 
SymbolPRS4 
ID4837156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp613838 
End bp615142 
Gene Length1305 bp 
Protein Length434 aa 
Translation table12 
GC content46% 
IMG OID640388471 
Product26S protease regulatory subunit 4 homolog (TAT-binding homolog 5) 
Protein accessionXP_001382353 
Protein GI126131656 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0833723 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGAG GCGATGGCTT CAACAAAAAG AAAGACGACA AGAAAAAAGA AAAGCCCAAG 
TATGAACCTC CCGTAGAATC CAAGTTTGGC AAAAAGAGAA GAAAGGGACC CGACACCGCT
GTGAAATTGC CTTCAGTATA TCCCAACACT AGATGCAAAT TGAAGCTCTT GAAGTTGGAA
AGAATCAAAG ACCATTTGCT TTTGGAAGAA GAGTTTGTGA CCAATCAGGA AGCTTTCCAG
CCTACTGAGG CTAAACAAGC CGAAGAAAGG GAGAAGGTAG ATGAACTTCG TGGCTATCCA
ATGTCTATTG GGACTTTGGA AGAAATTATT GATGATGACC ATGCCATTGT TTCCAGCACA
GCAGGATCTG AGTACTATGT ATCGATTATG TCGTTCGTAG ACAAGGGCTT GTTGGAGCCT
GGCTGCTCTG TGCTTTTGCA CCACAAGACT GTATCTGTTG TGGGGGTCTT GCAAGACGAT
GCTGATCCTA TGGTATCCGT GATGAAGTTG GACAAGAGTC CCACGGAGTC GTATGCCGAT
ATCGGTGGTC TCGAATCCCA AATCCAGGAG ATCAAGGAGG CAGTAGAGTT GCCGTTAACC
CACCCAGAGT TGTATGAAGA AATGGGTATA AAACCGCCTA AGGGTGTCAT TTTGTATGGT
GCTCCGGGTA CGGGTAAGAC GTTGTTGGCG AAGGCTGTAG CTAACCAGAC CAGTGCGACG
TTTTTGCGTA TTGTAGGCTC AGAGTTGATC CAGAAGTACT TGGGTGATGG TCCTAGATTG
TGTAGACAAA TTTTCCAGAT CGCTGGGGAA CACGCTCCCT CCATCGTTTT CATTGATGAG
ATCGATGCCA TTGGTACAAA GAGATACGAA TCGACATCAG GTGGGGAACG TGAAATCCAG
AGAACAATGT TAGAGTTGTT GAACCAGCTA GACGGGTTTG ACGATAGAGG AGACATCAAG
GTCATTATGG CCACCAACAA GATCGAGTCA TTGGATCCAG CGTTGATCAG ACCTGGAAGA
ATTGACAGAA AGATCTTGTT TGAGAATCCC GATGCTAACA CGAAGAAGAA GATCTTAACC
ATTCACACGT CGAAGATGTC CTTGGCTGAC GATGTCAACT TGGACGAGTT AGTTACTTCC
AAGGACGACT TATCTGGAGC TGATATCAAG GCCATTTGTA CGGAAGCTGG TTTGTTGGCG
TTGAGAGAAA GAAGAATGCA AGTCAAGGCT GACGACTTCA AGTCAGCCAA AGAGAGAGTA
CTCAAGAATA AGGTGGAGGA GAACCTTGAG GGGTTGTACT TGTGA
 
Protein sequence
MPGGDGFNKK KDDKKKEKPK YEPPVESKFG KKRRKGPDTA VKLPSVYPNT RCKLKLLKLE 
RIKDHLLLEE EFVTNQEAFQ PTEAKQAEER EKVDELRGYP MSIGTLEEII DDDHAIVSST
AGSEYYVSIM SFVDKGLLEP GCSVLLHHKT VSVVGVLQDD ADPMVSVMKL DKSPTESYAD
IGGLESQIQE IKEAVELPLT HPELYEEMGI KPPKGVILYG APGTGKTLLA KAVANQTSAT
FLRIVGSELI QKYLGDGPRL CRQIFQIAGE HAPSIVFIDE IDAIGTKRYE STSGGEREIQ
RTMLELLNQL DGFDDRGDIK VIMATNKIES LDPALIRPGR IDRKILFENP DANTKKKILT
IHTSKMSLAD DVNLDELVTS KDDLSGADIK AICTEAGLLA LRERRMQVKA DDFKSAKERV
LKNKVEENLE GLYL