Gene PICST_88209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88209 
SymbolFST1 
ID4838339 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp359001 
End bp362941 
Gene Length3941 bp 
Protein Length1028 aa 
Translation table12 
GC content41% 
IMG OID640389654 
ProductFungal specific transcription factor 
Protein accessionXP_001383698 
Protein GI150864738 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0520649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TATTTAGACT CTTTTTTCGA ACCATACAAT TTTTCAATTT ATTCACTATT TGAAAACTAT 
TGACTTCTAC TGAACATAGC CAACATGGTA GACGACGCCC AGGCCCGCAA ACGAAACCGG
TTGTCGCTCA GCTGCAATTA CTGCAAAAAG AGGAAAGTCA AGTGTGATCG TGGTAGACCT
TGTTCTTCTT GTGTGCGGTA CAATGTAGCC AATCTCTGTG AATACTCGGA CCCTATATGG
CTGGAGCCGG TGGCCATAAC CGGTAATGCC GCCATGGCGG CCACCTCCAA TCAAAACTCA
CGAAAATCCC CTCAATCACA TAGCCTTTCT GCTTTCCACG TCAACGGTAG CAGCAATAGT
CCGCATACGT CATCTCATTC TCACAGCTCG AACTTGCCGT CTTTTTCTGT GCAACGTTTC
CATCCACCAG CAGCTTCTAC CAAACCGCCA CAGGCAAATG GAGGTGTTCA AACCTCTTTC
CACGTGCCGC CACCTGGCCC TTCGAATCAC TCCGGAAACC ATTCTTCACG AGGTACAGCT
AGCCACCAAG ATCCCTCTCT GGCAGCTCCG GCTGTGCAAT CTGAGTTGGA AACGCTCAAA
AACAAAATCA AGGAAATCGA GGCCTCGCTC AAGCCACAGT CATCGCAACA TAATCACAAT
CAACATAATC ACAATCTTGT GGATCATAGT CACAATTCAC GAAGTGTTCA TACTTCAACA
AACTCATTCT CATCTCAGCC ATCTCTATCT ACAGCAAACG GTAATGTCCA TAACAGAACT
TCTCCAGAAG GCTCACTTTC TTCGGCTTCT TCTAATTCAT CGGCCAATGC TATACCATAT
TCGTACTTAT CTGGTTCTAG ACCTACCGGT GGAAATGGGA ATCTTCCAAT TCAGCTTCCA
CCTCTTCAGT GGACTCATAA TGCAGACCGT CCAGATGTAG ACTATACCGG AAGACCTCAA
TATGTCAACA GATACGTTTT AAAGTTTGAG GACTTCAAGT ATAATCCAGA CAAAAACCCT
ACGTTCATTG GGATAAATCC CTTTTCTGCT GATGACGACT TGATCAATTT ATACGAAGGC
TATACACCTA TTCATATCCG AGACGCCTCC AGACGAATGA ATTACGGTCC ATTTGCTTGG
CTTTCGATAA TGAAAAAGGA TCCTGGTCTT TTGTCTCTCT GGAAATTCAT GATGTCTAAA
AAGCATGAAA GACATCTTGC CATTCAGAGA TTGGCAAGAA CCGCACCAGC TCCGGAATTG
GGACTATTAC CCGAAAAGGA ACCAGAGCAG ACTTCAGCTC CTGCCGATTC TCCCAAAGGT
GCCGACGACC AAGAAAAGAT CTTCCGAGAA AAGGCTATAG ACAGGGATGG TTTCAATGAC
TTGAGATTGT ACGGAAATGT GGCTAAGGCG AGTGACACCA GTGGTAACCT GAAGGAGAAG
ATCCACTCTG GAGGATACTC AAAGGTTGAA TTACATAATC CAAAGGCTCA CCAAAAAACC
CAAATGAACA AACTGGGTAT AGCGCTTGGT CTAACTATGT ATGAGGGACA GATTGGCAGA
GAGTTAGAGT TGATTGAGAA AATTAAAATG ATCTTACCCA AACAAAAAGT GATCTGGAAG
TTGATCAACA AGTTCTTCAA GAGTGTCTAT CCTTACATGC CATTCATAGA TGAGAACTAT
TTCAAGTTGG AGATGGCTAG AATCCTTGGT CCAGAAGCTT ACTCTGATAG TCGCTTGGGA
GACTTGAAGA TCGAGAAGAG ATTAGACTTG GCTCAGATTG GAATATTGTT GATCATCCTC
AGAATCTCCT ATTTGTCTCT TTTCTCCAAC AGAAGAGCTG TAAACCAAAA TAACTTGAAT
TCTAACGATC CAGCCCCACC TGCTATTGAG ATGAAGTATC TATTGTCTAA TCCAATCAAT
ATAGATGTCA TAGACGTAGC CCAGTTGTGT CTTGACCAAT TCGAATTGTT GCGTAAATCC
AGTTTGGTTA TCTTGCAATG TGCCTATTTC ATGAGGTTGT ATCATATGTT TTCTCCTGAA
GATGGTGATG GAGCCGACGG TGGAGACTCA CAGATATTTA ATGGAATGTT AGTACAAATG
GCTTATTCCA TGGGTATCAA CAGAGAACCG GATACCTTTC CAGATATCTG TAATGACGAG
AAGGTTAACA ATCTTGGTAG AAAGATTTGG TTCTTTTTGA TTATCAACGA TTTGATCCAG
GCTTACCAAT ACGGCAACCC CTTGACTATC AGAGAGAAAT ACTACGATAC CAAGTTACCA
TACTATAAAA AGGGTAACGA AAATATTTCA GATGTGTCTA TGGAACAGCA TGTCATCAGT
ACATTTGCCT ATTTTGAAAA GTACTACTAT AGGTTGACAG CTATTCTTGA TATCTGCTTG
GATATCAGGA AGAAGGTGAA GGTTTCACAA TTGTCAAACT TGATGGGTGA CTTTGAACAT
CATTTAAATG ATCATTATGG AACATTGCGA TACTTCTTGG TGCCATTCCA ACAGGACAAT
TACGTTTACC CTTTCCTCAA GACAATGAAG TGTAAGAACT ATATCAACAT GAAAGGCTTT
TTACTCACCA TATATTTCCA CATGTACCTT TATTACGAAA GTAAGAAGAA GACCGACTTT
GCCTACTTCT ATGTCAAAAA GATATTTGCG ACAACGTGTG GTGAGTTTGT TCCTTCTTTC
TTCCCTTTGA TCACGAACAA CTATATCAAC TTTGGTGAAA CGGCAGACTT GATCTTGAAC
CCCACCATTG AGAGTATGAT TCACAAGACT ACGCAGATGA ACTTTGCGAT TTTGGTGAGA
CTAAATTCGA CCATTTATCG TATGAAGCAC AATGCAAATC ATGGCTTAAA TTTGAGAGAC
AACTTTGGCT ACAAGTTGAA ATTTGCCAAA TTATGTAAGC TTTCTAAAAT TTTGGAGAAG
ATTATCAAGT TCTGTATAGC TGCGATGTCC AGATTGAGTC AGCGTTACTA CTATGCCTGG
CGTGTTACCA AGGCACATAC TTTCTTGTTG AAGATGATTA TTGGTGAAGA ATTCTACAAG
TATTGCCAGG ATGATGAGCA GATAATCTTC TTAGACTTGA GCAATGATCA GCTCAATGAA
TTGACAGAAA TTTGTGAAGT CACATTGAAG AAGTTCGGCA AGACTAAGAT CTTCCATGCA
GCAAGTGAAT TGTTTGACGA TATCAACAAT GAAGATATTG AAGCTCCAGT TCTTGATGGA
GCCAACTCTC GTGTCAGTGT TTCTTCTACC CCACAAGCTG CTTTTGATGG TACTAATGGT
TCAGAACCAA ATGCAGCCCA TATGGCTGGA GTCATAAATG TTCCATCATC AGTTGATTCC
ATAGGCAGTG CAGATATGGA TGATTTCCAA TTTATTGAAA ATGTTGAAAT CGATAAATTG
TGGTACCAAA TGGCTTCCAT CAAGAACGAA AACAATAACA ACTCCAACAG CAATACCAAT
ACCAACAACA ATAGCACTGG CAAGGCTACG GGATCAGCAG GGTTGTTCAA CTTTGGCCTC
AACGTTGTTT CTGGAAGCTC TGCGCAGGAC GATGGGCCCT TCAAGTTACC TGGACAGAAA
GCCAGTGTTG TTGATCCTAA CCCTGCTGAA GGCGGTTCTA CTGGTTATAC CCCTATGAAC
TTGTCTATTC CGACTCCTTT GATGAATCCC TTGAGTCCAG AGAGTTTGAA CACAAATTCA
GGAGGGCAGC CATCTAACGC TGCTGCTGAA GAGGAGTTGA CAACGGCAGA TTTTTACTCC
ATCGATATTT TCAACAATTT ACCTATTGAC CAGTTACTTG GTTTCCGTGA CGAAGTAGAT
GAAAGTTGGT AGAAATAAGT ACCATAGACA TAAGATTAAG AAGAGTATCA ATATCTGTAT
ATTACTGTAT CTATGTAATT AGAAAAGGAT GTTCTTCCTT G
 
Protein sequence
MVDDAQARKR NRLSLSCNYC KKRKVKCDRG RPCSSCVRYN VANLCEYSDP IWSEPVAITA 
SHQDPSSAAP AVQSELETLK NKIKEIEASL KPQSSQHNHN QHNHNLFEDF KYNPDKNPTF
IGINPFSADD DLINLYEGYT PIHIRDASRR MNYGPFAWLS IMKKDPGLLS LWKFMMSKKH
ERHLAIQRLA RTAPAPELGL LPEKEPEQTS APADSPKGAD DQEKIFREKA IDRDGFNDLR
LYGNVAKARY SKVELHNPKA HQKTQMNKSG IALGLTMYEG QIGRELELIE KIKMILPKQK
VIWKLINKFF KSVYPYMPFI DENYFKLEMA RILGPEAYSD SRLGDLKIEK RLDLAQIGIL
LIILRISYLS LFSNRRAVNQ NNLNSNDPAP PAIEMKYLLS NPINIDVIDV AQLCLDQFEL
LRKSSLVILQ CAYFMRLYHM FSPEDGDGAD GGDSQIFNGM LVQMAYSMGI NREPDTFPDI
CNDEKVNNLG RKIWFFLIIN DLIQAYQYGN PLTIREKYYD TKLPYYKKGN ENISDVSMEQ
HVISTFAYFE KYYYRLTAIL DICLDIRKKV KVSQLSNLMG DFEHHLNDHY GTLRYFLVPF
QQDNYVYPFL KTMKCKNYIN MKGFLLTIYF HMYLYYESKK KTDFAYFYVK KIFATTCGEF
VPSFFPLITN NYINFGETAD LILNPTIESM IHKTTQMNFA ILVRLNSTIY RMKHNANHGL
NLRDNFGYKL KFAKLCKLSK ILEKIIKFCI AAMSRLSQRY YYAWRVTKAH TFLLKMIIGE
EFYKYCQDDE QIIFLDLSND QLNELTEICE VTLKKFGKTK IFHAATNSRV SVSSTPQAAF
DGTNGSEPNA AHMAGVINVP SSVDSIGSAD MDDFQFIENV EIDKLWYQMA SIKNENNNNS
NSNTNTNNNS TGKATGSAGL FNFGLNVVSG SSAQDDGPFK LPGQKASVVD PNPAEGGSTG
YTPMNLSIPT PLMNPLSPES LNTNSGGQPS NAAAEEELTT ADFYSIDIFN NLPIDQLLGF
RDEVDESW