Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81390 |
Symbol | |
ID | 4837477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1768781 |
End bp | 1771977 |
Gene Length | 3197 bp |
Protein Length | 955 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388792 |
Product | predicted protein |
Protein accession | XP_001383108 |
Protein GI | 150864336 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5116] 26S proteasome regulatory complex component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGAAGTGTC TGATTAGTAG ACTAGTGTTG GGAATTGAAT CTTTATCGCT GATTGTAACC ATCCAGAGCC TTCAGACTAT TGCTATCGAG TCAGAGAAAG TTAAAAATCA TTTTCTTATC CTGAGTCATT TACATTCTTC ATATCTCTCG GTGATTGCAT TCAACGATTT ACACAACATC CACCACATCT CTATAAATCA TTAATAAAAT GGCTTTGGTA TCTGCTGCTC CGTACTTGGC TCTTTTAGTT GAGCAAGATG ACAGCTTGAA GTCATATGCC TTACAATCGT TGAACAACGT TGTTGACCAG TTGTGGGCTG AGATCGCTAA CAACATTACA GACTTGGAAG AACTCTACGA AAACGAAAAC TTTGTCAGCA GATCGTTGGC TGCACTTATT GTCTCAAAAG TCTACTACAA TTTGGGCGAT TTTGAAGCTT CCGTCAAGTA CTCGTTATTT GCTGGTGATG AGTTCAACAT TGAAGAACAG TCTCAGTATA TTGAAACCAT TGTTTCGCAA TGTATCAATC TCTACAACTC ATTATCGCAG AAGAAGTTTT CTGATGACTC CGTAGAGATC GACACTCGCT TGGCTGCAGT CTTCAACAAG ATGTTGGAAA AGTGTATCTC TGCCAACGAG TTGAAGTTGG CCCTTGGTAT CTCATTGGAA AGCTTCAGAT TGGATATTGT AGAAGACATA TTGAAACAGC AAATCAAGAG CAACGAAGAA AACGCATTGA ACTTAATAAA CTACGTGTTG GTGTGTTCTA ACACTGTTAT TAACAATACG ACTTTCAGAA CGAAAGTGTT GAATTCGTTG ATCCAGTTGT TGATGACTTT GTCCAACAAT CACGATTTCT TCACCGTGAT CAAGATCATT GTCCAGTTGA ATGACTCAAC TTTGGCTATA GAACTTTTTA AAGAATTGGT AGACAAGAAG GAAGACTTGA TTGCCTACCA GGCTTCGTTT GACTTGGTAA ATACTGCTTC GCAGGAATTG TTGGATAATG TGATCAATGT CTTAAGTAGC GACAAGACGC TTGATCAGAC CAATGCCATT CTCAAGAAGA TCTTGACGAT CTTGTCCGGT GTACCTACAT GTGATTTGGA TATCACCTTC TTATACAAAA ACAACAACAC AGACATCACC ATTTTGAACA AGACCAAGAA CTTGTTGGAA GGTAGATCAT CTATTTTCCA TTCTGCCGTA ACCTTTGCCA ATGCATTTAT GCATGCTGGT ACGACAGACG ACTCTTTCTT CAGAAAGAAT TTGGAATGGT TGGGCAGAGC TACCAATTGG TCCAAATTTT CTGCTACAGC AGCCTTGGGT GTAATTCACA AGGGCAACTT ATCTCAAGGA CGTAGTATCT TAAAGCCATA TCTTCCTGGT TCTTCTGGTG CTCCTCATAC TAAAGGTGGT TCTTTGTTTG CATTGGGTTT GATCTACGCT GGTCACGGAA GAGAAATTAT TGACTACTTG AAGCTGTATA TTGATGAACA CGGAAACTCC GCAGGAAGCA ATGATACCGA TGTCATGTTG CATGGTGCTG CTTTGGGGGC TGGTGTAGCA GGTATGGCTT CTGGAAGCGA AAGTCTTTAC GAGGCTCTTA AGGTAGTCTT GTATTCTGAT TCGGCTATTT CTGGACAAGC TGCTGGTTTG GCTATGGGTT TGGTGATGTT GGGTTCTGGT AACGAAAACG CCATAAACGA TATGCTCACC TATGCTCAAG AGACCCAGCA TGAGAATATC ATCCGTGGTT TGGCTATCGG TATTGCATTG TTGAACTATG GTCGTGAAGA GAAGGCTGAT GGTATAATTG ACAAGTTGAT GACTCAAGAG TCTTCTATCT TAAGATATGG TGGTGCTTTC ACTATTTCTT TGGCATATGC GGGTACCGGC AGCAACTCTG CCATAAAGAA ATTGTTGCAT TATGCTGTTT CCGATCCATC TGATGACGTC AGAAGAGCCT CCGTTCTTGG TTTAGGATTC TTGTTGATCC GTGATTACAC AGCTGCCCCA CAAATTGTGG AATTGTTGTC TCAATCTCAT AATCCACACG TTCGTTATGG TACTGCTCTT GCTTTGGGTA TTTCTTGTGC TGGTAGGGCC TATGCTCCGG CAATTGAAGT TCTTGAGCCA TTGACTAAGG ATCCTGTTGA TTTTGTAAGA CAGGGAGCTT TGATTGCCAG CTCCATGATT TTGATACAAC AGAACGAATT CGCCTATCCA AAGGTTAAGG ACTTCACCAA ACAGCTTGCT GATACCATTA AGAATAAACA CGAAGATGCT TTGGCTAAGT TTGGTGCTAC TTTGGCTCAG GGTATAATAG ATGCAGGTGG GCGTAATGTT ACTATTCATT TGGAGAATGC CCAGACTAAC ACCTTGAATA TCAAGGCTAT TGTTGGTTTG ACGGTGTTTG TTCAATCCTG GTACTGGTTC CCATTGGCAC ACTTTTTGTC GTTGTCTTTT GCTCCTACAT CGATTATTGG TGTTAGAGGC GACTTGAAGG CACCTCAGTT CGAGTTCAAC TGCCACACCA AACCAGAATT ATTCCAGTAC CCTCCAAAGG TGGAAGAGGC TAAGGAGAAA CAACCGGACA AGATAGCAAC TGCTGTGTTG TCTACTACTG CAAGAGCAAA AACTAGAGCC AAGAAGAAGT TGGGCAAGAA GCACGAAGAC GACGAAAAAC CTGAAGAAAA GCCCAAGGTT GAAGTATTAA GTGATGAAAA GAAAGATAAG GACGAAGCAA AGGACAAGAG TGACAAATCT GAAGATTCGA AGGACAACAA GAATGAGTCT GTGCCAGTTC GTTACACTAA GACAGCATTC AAGGTTTCTA ATCTTACCAG AGTGTTGCCT GCTCAGTCCA ATTATGTTTC TTTCATTAAA GATGATAGAT TTGTACCAAT AAGGAAATTC AGAGGCACCA GTGGTATTAT TGTTTTGGAA GATACGAAGC CTGAAGAGCC AGTTGAAATA ATACGGACTG TGCGCCAATT GAATACAACG GAAGCTCCTA TTCCTGAACC TTTCACTTTG AGTGCTGAAG ACTTAAAAGA ATTGGAAGAA GAATAATGAG TACGTTAGAG AAACATGTTC TAAATAGTTG TTGATTCTTT AGACTCGATT CTTTGATAGT TACAATTCTT CGTACACGTC ATATATTGCT ATCATTAATA TAAAGTTTTT TATATCT
|
Protein sequence | MALVSAAPYL ALLVEQDDSL KSYALQSLNN VVDQLWAEIA NNITDLEELY ENENFVSRSL AALIVSKVYY NLGDFEASVK YSLFAGDEFN IEEQSQYIET IVSQCINLYN SLSQKKFSDD SVEIDTRLAA VFNKMLEKCI SANELKLALG ISLESFRLDI VEDILKQQIK SNEENALNLI NYVLVCSNTV INNTTFRTKV LNSLIQLLMT LSNNHDFFTV IKIIVQLNDS TLAIELFKEL VDKKEDLIAY QASFDLVNTA SQELLDNVIN VLSSDKTLDQ TNAILKKILT ILSGVPTCDL DITFLYKNNN TDITILNKTK NLLEGRSSIF HSAVTFANAF MHAGTTDDSF FRKNLEWLGR ATNWSKFSAT AALGVIHKGN LSQGRSILKP YLPGSSGAPH TKGGSLFALG LIYAGHGREI IDYLKSYIDE HGNSAGSNDT DVMLHGAALG AGVAGMASGS ESLYEALKVV LYSDSAISGQ AAGLAMGLVM LGSGNENAIN DMLTYAQETQ HENIIRGLAI GIALLNYGRE EKADGIIDKL MTQESSILRY GGAFTISLAY AGTGSNSAIK KLLHYAVSDP SDDVRRASVL GLGFLLIRDY TAAPQIVELL SQSHNPHVRY GTALALGISC AGRAYAPAIE VLEPLTKDPV DFVRQGALIA SSMILIQQNE FAYPKVKDFT KQLADTIKNK HEDALAKFGA TLAQGIIDAG GRNVTIHLEN AQTNTLNIKA IVGLTVFVQS WYWFPLAHFL SLSFAPTSII GVRGDLKAPQ FEFNCHTKPE LFQYPPKVEE AKEKQPDKIA TAVLSTTARA KTRAKKKLGK KHEDDEKPEE KPKVEVLSDE KKDKDEAKDK SDKSEDSKDN KNESVPVRYT KTAFKVSNLT RVLPAQSNYV SFIKDDRFVP IRKFRGTSGI IVLEDTKPEE PVEIIRTVRQ LNTTEAPIPE PFTLSAEDLK ELEEE
|
| |