Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_54032 |
Symbol | PRS4 |
ID | 4837156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 613838 |
End bp | 615142 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640388471 |
Product | 26S protease regulatory subunit 4 homolog (TAT-binding homolog 5) |
Protein accession | XP_001382353 |
Protein GI | 126131656 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1222] ATP-dependent 26S proteasome regulatory subunit |
TIGRFAM ID | [TIGR01242] 26S proteasome subunit P45 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0833723 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGAG GCGATGGCTT CAACAAAAAG AAAGACGACA AGAAAAAAGA AAAGCCCAAG TATGAACCTC CCGTAGAATC CAAGTTTGGC AAAAAGAGAA GAAAGGGACC CGACACCGCT GTGAAATTGC CTTCAGTATA TCCCAACACT AGATGCAAAT TGAAGCTCTT GAAGTTGGAA AGAATCAAAG ACCATTTGCT TTTGGAAGAA GAGTTTGTGA CCAATCAGGA AGCTTTCCAG CCTACTGAGG CTAAACAAGC CGAAGAAAGG GAGAAGGTAG ATGAACTTCG TGGCTATCCA ATGTCTATTG GGACTTTGGA AGAAATTATT GATGATGACC ATGCCATTGT TTCCAGCACA GCAGGATCTG AGTACTATGT ATCGATTATG TCGTTCGTAG ACAAGGGCTT GTTGGAGCCT GGCTGCTCTG TGCTTTTGCA CCACAAGACT GTATCTGTTG TGGGGGTCTT GCAAGACGAT GCTGATCCTA TGGTATCCGT GATGAAGTTG GACAAGAGTC CCACGGAGTC GTATGCCGAT ATCGGTGGTC TCGAATCCCA AATCCAGGAG ATCAAGGAGG CAGTAGAGTT GCCGTTAACC CACCCAGAGT TGTATGAAGA AATGGGTATA AAACCGCCTA AGGGTGTCAT TTTGTATGGT GCTCCGGGTA CGGGTAAGAC GTTGTTGGCG AAGGCTGTAG CTAACCAGAC CAGTGCGACG TTTTTGCGTA TTGTAGGCTC AGAGTTGATC CAGAAGTACT TGGGTGATGG TCCTAGATTG TGTAGACAAA TTTTCCAGAT CGCTGGGGAA CACGCTCCCT CCATCGTTTT CATTGATGAG ATCGATGCCA TTGGTACAAA GAGATACGAA TCGACATCAG GTGGGGAACG TGAAATCCAG AGAACAATGT TAGAGTTGTT GAACCAGCTA GACGGGTTTG ACGATAGAGG AGACATCAAG GTCATTATGG CCACCAACAA GATCGAGTCA TTGGATCCAG CGTTGATCAG ACCTGGAAGA ATTGACAGAA AGATCTTGTT TGAGAATCCC GATGCTAACA CGAAGAAGAA GATCTTAACC ATTCACACGT CGAAGATGTC CTTGGCTGAC GATGTCAACT TGGACGAGTT AGTTACTTCC AAGGACGACT TATCTGGAGC TGATATCAAG GCCATTTGTA CGGAAGCTGG TTTGTTGGCG TTGAGAGAAA GAAGAATGCA AGTCAAGGCT GACGACTTCA AGTCAGCCAA AGAGAGAGTA CTCAAGAATA AGGTGGAGGA GAACCTTGAG GGGTTGTACT TGTGA
|
Protein sequence | MPGGDGFNKK KDDKKKEKPK YEPPVESKFG KKRRKGPDTA VKLPSVYPNT RCKLKLLKLE RIKDHLLLEE EFVTNQEAFQ PTEAKQAEER EKVDELRGYP MSIGTLEEII DDDHAIVSST AGSEYYVSIM SFVDKGLLEP GCSVLLHHKT VSVVGVLQDD ADPMVSVMKL DKSPTESYAD IGGLESQIQE IKEAVELPLT HPELYEEMGI KPPKGVILYG APGTGKTLLA KAVANQTSAT FLRIVGSELI QKYLGDGPRL CRQIFQIAGE HAPSIVFIDE IDAIGTKRYE STSGGEREIQ RTMLELLNQL DGFDDRGDIK VIMATNKIES LDPALIRPGR IDRKILFENP DANTKKKILT IHTSKMSLAD DVNLDELVTS KDDLSGADIK AICTEAGLLA LRERRMQVKA DDFKSAKERV LKNKVEENLE GLYL
|
| |