Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_89662 |
Symbol | PRE4 |
ID | 4839356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 571744 |
End bp | 572671 |
Gene Length | 928 bp |
Protein Length | 281 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390671 |
Product | B-type subunit of proteasome |
Protein accession | XP_001384774 |
Protein GI | 126136501 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0638] 20S proteasome, alpha and beta subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.596472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACG ATCCATTCCA ATGGGGAAGA CCCAGCAACG AAACATACGG TGCCTACAAC CATCATATTG CCAATGCATC TGTTCGTGAG ACTGATCACC AGAGCTTGGA CAACTTTCCA AAGATGAACA CACAACAGCC TATCATCACT GGTACGTCTG TGATCTCCGT CAAGTTCAAG GATGGGGTCA TATTGGCTGC TGACAACTTG GGCTCATATG GATCCTTGTT GAGATTCAAC AATATCGAAA GATTGATAAA AGTTGGCCAG GAAACAGTTG TTGGCATTTC TGGTGACATC TCGGACTTGC AACAGATCGA GAGAATATTG GACGAGTTGG AAACAACGGA AGAAGTTTAC GACAGCGATG GGGGCCACAA CTTGAGAGCT CCTCACGTAC ACGAGTACTT ATCGAGAGTA TTATACAACA GGAGATCCAA AATGAATCCC TTGTGGAATG CCATTATTGT GGGAGGCTTC AACGATGACA GAACTCCTTT CTTGAAATAT ATTGACTTGT TAGGAGTTAC CTACGGCGCT TCAACGTTGG CTACAGGATT CGGTTCTCAT TTGGCGGTGC CTTTGTTGAG ACAATTGATT CCGAATGATG TAGACTATGT AAACGTGTCG GAAGAACAAG CAAGAAAAGT GGTAGAAGAC TGTATGAGAG TGTTGTTCTA CCGTGACGCC AGGGCCTCTG ACAAGTTCTC GTTGGTGACA ATTAAGAGGT CTGCTGAAGA GCCAGATACA TCTTACACAT TCAACTTCGA GAAGGAGTTG AAGGTAGAGA ACCAGAGCTG GAGATTCGCA AAGGACATCA GAGGTTACGG CAGTCAACAG CAGTAGAGAC CATAGAAGTA GGTACCACTA AATATGTCAT CTGTATTACC CATATAGTGT CAGTTATCAA TCAATAATGA TTATCATC
|
Protein sequence | MNHDPFQWGR PSNETYGAYN HHIANASVRE TDHQSLDNFP KMNTQQPIIT GTSVISVKFK DGVILAADNL GSYGSLLRFN NIERLIKVGQ ETVVGISGDI SDLQQIERIL DELETTEEVY DSDGGHNLRA PHVHEYLSRV LYNRRSKMNP LWNAIIVGGF NDDRTPFLKY IDLLGVTYGA STLATGFGSH LAVPLLRQLI PNDVDYVNVS EEQARKVVED CMRVLFYRDA RASDKFSLVT IKRSAEEPDT SYTFNFEKEL KVENQSWRFA KDIRGYGSQQ Q
|
| |