Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78867 |
Symbol | |
ID | 4840265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1446589 |
End bp | 1447812 |
Gene Length | 1224 bp |
Protein Length | 399 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391580 |
Product | predicted protein |
Protein accession | XP_001385975 |
Protein GI | 150866394 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5187] 26S proteasome regulatory complex component, contains PCI domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.505457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTCG ATTCAGATGT GCCCCACATT CCAGACTACC GGTTGTCGGA AAAAGAGTTT CTTCTTTCCC AGACTGTTGA TGCTGGCGTG AGAGCTTCCA TTTTCCACGA TTTAACAGAG GCCATAACCA AAGATGATTT GGCACCATAC TATCTCCATT TACATACTGA GTATGAAGAT TTCCCGTATG ATGAAAAGGT TTACCAAGAA TTGGCAGCAA AAAACGAAGC CATTGCTGGC GACTTGAAGC TGAAGCTCAA GGAAGTTGAA GGTGAAGACG AAACAGAGTT GGACATCTTG GCTACGACGA TCCAATTGGC TGAACACTAC ACGCGTATTG TAGATAGAAC GAACGCATCT GAGACGTTGA AGAAGGCATT GGACTTGTCG CAAGGTACTG GCTCGAAGAT CGATCTTCTC TTGACTTTGA CGCGACTCGA ATTTTTCTTC AATGACTATG TGTTAGTCTC AAAGTATTTG GACCAGATCA AGACGTTGAT CGATAAGGGA GGAGACTGGG AACGTCGTAA CAGATTCAAG ACGTACCAGG GTATCTACTT ATTGGCGACA CGTAACTTTG CCGAAGCTGC CAAGTTGTTG ATCGACTCGT TGGCAACCTT TACTTCTACC GAGCTCTGTA GTTACGAGCA AGTAGCACAA TATGCGATTA CGGCTGGTGT TTTGTCGTTG GACAGAGTAG ACTTGAAGGA AAAGATCATC GATTCGCCCG AGATTCTTCT GATCTACTCG TCGGCTCCAG AGACGGAACC ATTGCTCAAC TTGACCAACT CCTTGTACAC ATGTCAGTAC AACTACTTTT TCCAGTACCT CTTGGAATCG TACGACAAGC TCCTTGTACC CAACAAGTAT TTGCACAAGC ACGCCAGCTA CTTCTTGCGT GAAATGCGGT GTAAAGCATA CGGCCAGTTA TTGGAGAGCT ACAAGTCTTT GTCGCTTAAG TCTATGGCCC AAAACTTCAA CATCTCGGAA GACTTCTTGG ACCAAGACTT GTGCAAGTTC ATTCCCAACA AGAAATTGAA TTGTACTATT GACAAGGTGA ACGGCATTAT TGAGACAAAT AGACCGGACA ATAAGAACAA CCAGTACCAT TTGTTGATCA AGCAGGGTGA TGGCTTGTTG ACGAAATTGC AGAAGTACGG TACAGCTGTG AAGTTGAGTG GGGCCGAGAG AGTAGCCTAG ATGTAATATA TGTTGATAAA TGGT
|
Protein sequence | MDLDSDVPHI PDYRLSEKEF LLSQTVDAGV RASIFHDLTE AITKDDLAPY YLHLHTEYED FPYDEKVYQE LAAKNEAIAG DLKSKLKEVE GEDETELDIL ATTIQLAEHY TRIVDRTNAS ETLKKALDLS QGTGSKIDLL LTLTRLEFFF NDYVLVSKYL DQIKTLIDKG GDWERRNRFK TYQGIYLLAT RNFAEAAKLL IDSLATFTST ELCSYEQVAQ YAITAGVLSL DRVDLKEKII DSPEILSIYS SAPETEPLLN LTNSLYTCQY NYFFQYLLES YDKLLVPNKY LHKHASYFLR EMRCKAYGQL LESYKSLSLK SMAQNFNISE DFLDQDLCKF IPNKKLNCTI DKVNGIIETN RPDNKNNQYH LLIKQGDGLL TKLQKYGTAV KLSGAERVA
|
| |