Gene PICST_81390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81390 
Symbol 
ID4837477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1768781 
End bp1771977 
Gene Length3197 bp 
Protein Length955 aa 
Translation table12 
GC content41% 
IMG OID640388792 
Productpredicted protein 
Protein accessionXP_001383108 
Protein GI150864336 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5116] 26S proteasome regulatory complex component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGAAGTGTC TGATTAGTAG ACTAGTGTTG GGAATTGAAT CTTTATCGCT GATTGTAACC 
ATCCAGAGCC TTCAGACTAT TGCTATCGAG TCAGAGAAAG TTAAAAATCA TTTTCTTATC
CTGAGTCATT TACATTCTTC ATATCTCTCG GTGATTGCAT TCAACGATTT ACACAACATC
CACCACATCT CTATAAATCA TTAATAAAAT GGCTTTGGTA TCTGCTGCTC CGTACTTGGC
TCTTTTAGTT GAGCAAGATG ACAGCTTGAA GTCATATGCC TTACAATCGT TGAACAACGT
TGTTGACCAG TTGTGGGCTG AGATCGCTAA CAACATTACA GACTTGGAAG AACTCTACGA
AAACGAAAAC TTTGTCAGCA GATCGTTGGC TGCACTTATT GTCTCAAAAG TCTACTACAA
TTTGGGCGAT TTTGAAGCTT CCGTCAAGTA CTCGTTATTT GCTGGTGATG AGTTCAACAT
TGAAGAACAG TCTCAGTATA TTGAAACCAT TGTTTCGCAA TGTATCAATC TCTACAACTC
ATTATCGCAG AAGAAGTTTT CTGATGACTC CGTAGAGATC GACACTCGCT TGGCTGCAGT
CTTCAACAAG ATGTTGGAAA AGTGTATCTC TGCCAACGAG TTGAAGTTGG CCCTTGGTAT
CTCATTGGAA AGCTTCAGAT TGGATATTGT AGAAGACATA TTGAAACAGC AAATCAAGAG
CAACGAAGAA AACGCATTGA ACTTAATAAA CTACGTGTTG GTGTGTTCTA ACACTGTTAT
TAACAATACG ACTTTCAGAA CGAAAGTGTT GAATTCGTTG ATCCAGTTGT TGATGACTTT
GTCCAACAAT CACGATTTCT TCACCGTGAT CAAGATCATT GTCCAGTTGA ATGACTCAAC
TTTGGCTATA GAACTTTTTA AAGAATTGGT AGACAAGAAG GAAGACTTGA TTGCCTACCA
GGCTTCGTTT GACTTGGTAA ATACTGCTTC GCAGGAATTG TTGGATAATG TGATCAATGT
CTTAAGTAGC GACAAGACGC TTGATCAGAC CAATGCCATT CTCAAGAAGA TCTTGACGAT
CTTGTCCGGT GTACCTACAT GTGATTTGGA TATCACCTTC TTATACAAAA ACAACAACAC
AGACATCACC ATTTTGAACA AGACCAAGAA CTTGTTGGAA GGTAGATCAT CTATTTTCCA
TTCTGCCGTA ACCTTTGCCA ATGCATTTAT GCATGCTGGT ACGACAGACG ACTCTTTCTT
CAGAAAGAAT TTGGAATGGT TGGGCAGAGC TACCAATTGG TCCAAATTTT CTGCTACAGC
AGCCTTGGGT GTAATTCACA AGGGCAACTT ATCTCAAGGA CGTAGTATCT TAAAGCCATA
TCTTCCTGGT TCTTCTGGTG CTCCTCATAC TAAAGGTGGT TCTTTGTTTG CATTGGGTTT
GATCTACGCT GGTCACGGAA GAGAAATTAT TGACTACTTG AAGCTGTATA TTGATGAACA
CGGAAACTCC GCAGGAAGCA ATGATACCGA TGTCATGTTG CATGGTGCTG CTTTGGGGGC
TGGTGTAGCA GGTATGGCTT CTGGAAGCGA AAGTCTTTAC GAGGCTCTTA AGGTAGTCTT
GTATTCTGAT TCGGCTATTT CTGGACAAGC TGCTGGTTTG GCTATGGGTT TGGTGATGTT
GGGTTCTGGT AACGAAAACG CCATAAACGA TATGCTCACC TATGCTCAAG AGACCCAGCA
TGAGAATATC ATCCGTGGTT TGGCTATCGG TATTGCATTG TTGAACTATG GTCGTGAAGA
GAAGGCTGAT GGTATAATTG ACAAGTTGAT GACTCAAGAG TCTTCTATCT TAAGATATGG
TGGTGCTTTC ACTATTTCTT TGGCATATGC GGGTACCGGC AGCAACTCTG CCATAAAGAA
ATTGTTGCAT TATGCTGTTT CCGATCCATC TGATGACGTC AGAAGAGCCT CCGTTCTTGG
TTTAGGATTC TTGTTGATCC GTGATTACAC AGCTGCCCCA CAAATTGTGG AATTGTTGTC
TCAATCTCAT AATCCACACG TTCGTTATGG TACTGCTCTT GCTTTGGGTA TTTCTTGTGC
TGGTAGGGCC TATGCTCCGG CAATTGAAGT TCTTGAGCCA TTGACTAAGG ATCCTGTTGA
TTTTGTAAGA CAGGGAGCTT TGATTGCCAG CTCCATGATT TTGATACAAC AGAACGAATT
CGCCTATCCA AAGGTTAAGG ACTTCACCAA ACAGCTTGCT GATACCATTA AGAATAAACA
CGAAGATGCT TTGGCTAAGT TTGGTGCTAC TTTGGCTCAG GGTATAATAG ATGCAGGTGG
GCGTAATGTT ACTATTCATT TGGAGAATGC CCAGACTAAC ACCTTGAATA TCAAGGCTAT
TGTTGGTTTG ACGGTGTTTG TTCAATCCTG GTACTGGTTC CCATTGGCAC ACTTTTTGTC
GTTGTCTTTT GCTCCTACAT CGATTATTGG TGTTAGAGGC GACTTGAAGG CACCTCAGTT
CGAGTTCAAC TGCCACACCA AACCAGAATT ATTCCAGTAC CCTCCAAAGG TGGAAGAGGC
TAAGGAGAAA CAACCGGACA AGATAGCAAC TGCTGTGTTG TCTACTACTG CAAGAGCAAA
AACTAGAGCC AAGAAGAAGT TGGGCAAGAA GCACGAAGAC GACGAAAAAC CTGAAGAAAA
GCCCAAGGTT GAAGTATTAA GTGATGAAAA GAAAGATAAG GACGAAGCAA AGGACAAGAG
TGACAAATCT GAAGATTCGA AGGACAACAA GAATGAGTCT GTGCCAGTTC GTTACACTAA
GACAGCATTC AAGGTTTCTA ATCTTACCAG AGTGTTGCCT GCTCAGTCCA ATTATGTTTC
TTTCATTAAA GATGATAGAT TTGTACCAAT AAGGAAATTC AGAGGCACCA GTGGTATTAT
TGTTTTGGAA GATACGAAGC CTGAAGAGCC AGTTGAAATA ATACGGACTG TGCGCCAATT
GAATACAACG GAAGCTCCTA TTCCTGAACC TTTCACTTTG AGTGCTGAAG ACTTAAAAGA
ATTGGAAGAA GAATAATGAG TACGTTAGAG AAACATGTTC TAAATAGTTG TTGATTCTTT
AGACTCGATT CTTTGATAGT TACAATTCTT CGTACACGTC ATATATTGCT ATCATTAATA
TAAAGTTTTT TATATCT
 
Protein sequence
MALVSAAPYL ALLVEQDDSL KSYALQSLNN VVDQLWAEIA NNITDLEELY ENENFVSRSL 
AALIVSKVYY NLGDFEASVK YSLFAGDEFN IEEQSQYIET IVSQCINLYN SLSQKKFSDD
SVEIDTRLAA VFNKMLEKCI SANELKLALG ISLESFRLDI VEDILKQQIK SNEENALNLI
NYVLVCSNTV INNTTFRTKV LNSLIQLLMT LSNNHDFFTV IKIIVQLNDS TLAIELFKEL
VDKKEDLIAY QASFDLVNTA SQELLDNVIN VLSSDKTLDQ TNAILKKILT ILSGVPTCDL
DITFLYKNNN TDITILNKTK NLLEGRSSIF HSAVTFANAF MHAGTTDDSF FRKNLEWLGR
ATNWSKFSAT AALGVIHKGN LSQGRSILKP YLPGSSGAPH TKGGSLFALG LIYAGHGREI
IDYLKSYIDE HGNSAGSNDT DVMLHGAALG AGVAGMASGS ESLYEALKVV LYSDSAISGQ
AAGLAMGLVM LGSGNENAIN DMLTYAQETQ HENIIRGLAI GIALLNYGRE EKADGIIDKL
MTQESSILRY GGAFTISLAY AGTGSNSAIK KLLHYAVSDP SDDVRRASVL GLGFLLIRDY
TAAPQIVELL SQSHNPHVRY GTALALGISC AGRAYAPAIE VLEPLTKDPV DFVRQGALIA
SSMILIQQNE FAYPKVKDFT KQLADTIKNK HEDALAKFGA TLAQGIIDAG GRNVTIHLEN
AQTNTLNIKA IVGLTVFVQS WYWFPLAHFL SLSFAPTSII GVRGDLKAPQ FEFNCHTKPE
LFQYPPKVEE AKEKQPDKIA TAVLSTTARA KTRAKKKLGK KHEDDEKPEE KPKVEVLSDE
KKDKDEAKDK SDKSEDSKDN KNESVPVRYT KTAFKVSNLT RVLPAQSNYV SFIKDDRFVP
IRKFRGTSGI IVLEDTKPEE PVEIIRTVRQ LNTTEAPIPE PFTLSAEDLK ELEEE