Gene PICST_76670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_76670 
SymbolFST5 
ID4837758 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1742226 
End bp1745640 
Gene Length3415 bp 
Protein Length1026 aa 
Translation table12 
GC content43% 
IMG OID640389073 
ProductFungal transcriptional regulatory protein 
Protein accessionXP_001383619 
Protein GI150864684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTTTAACCAT TTTTCCTACG TCCTAAGCAC AACGCTGGTA CTGGCGTCAC TCAGCGAGTC 
CCGTGACGTT TTTCCTTACC CTGCCACCTC TTCCCACCGC TTCCATCCAT GGCCCAATAG
CCATGTCTAC CAAGAAGCGC AATAGAGCCA CTAAGGTGTG CGACTTCTGT AGAAGGCGCA
AAGTTAAGTG TGATTTGGGG AACCCCTGCT CCACCTGTGT CAAGTACGGC CACCGCAATT
GCCGATACAA GGAACAAGAC AACGAGGCCG ACTCAGACTT CAAAGTCCAG GATCAGTTGC
TCCAGTTAAA GGAGAAATTG AGACTGCTCG AAGAGTCTGT AAATCAGAAC CCAGCCTTCA
GTGAGTACCA CAAGGAAAAG AGCAGCATCA GCAGTAGCAG CAGCAGCAGC AGCAACATTA
ATAACTATAA CAGTAATCGT AACATTTCTA GTAACAACAG TGTCAATAAC AGTAGTAGCG
TGAATAGCAA GATCAATAAC AATCTAAACA ACAATCTTAG CAATCTTAAC ATTAACAACA
ACCCTAACAG CATCAATGTT AATGACAACC TCAATATCAA CAACGAAGAT GTCAACTTCA
ACAGCAATAT CACCAACAAT AACAAACGAC AACATCTTGC GGCCAGCAGT TCGCCACCAT
TTGAGAACCC TGGTTTTGGA ATCTTGGATG TACGGAGCGT CTTAGGGAGA AACCCCGTTG
CATCAGAGGA GGATGTAATC AACTTCTACG ATGGATACTC GTCCGTCTAC GACAAGGAAC
CAATTAGAAG ACGGAATTTC GGGCCCATGG CATGGGTCAC TTTAGTAAAA GTGGATAACT
GTAGTGGGAA GCTCTGGGAC TATATACATT GTGTCAAGAA CTCGAACAAG GAAATCCGGG
GCGCCAAGTT TAACGGCGAG ATGTTAAGCA AAGTAGACGT TTTTCCCACA GCCAACGCAG
TGGACCAGGA TTTCCGAGAA AAGGTCACCG AAGATGAAGG GTTCAACGAA GTGAGGCCAT
ATAAAGATGT TGCCTATAAA CTCTCACCGG GACACAATAA GTCTTCTATT GACAATAAGC
ATTCTCAGAA TACTTTGAAT GAGAAGGCCA AGAGCTTAGG CTTGATGTTC TACCAAGGGG
GATTGGACGA GGAGTTGGCG TTGATCGAAA AGATCCGCTT GGTCTTGCCC AAACGTAAAG
TCATATGGCT ATTGTATAAG CGATACTTCA CCCACTTGTA TCTGGCCTTG CCATTGCTTG
ACGAAGTCGT CTTCAAGGAA AAGATCCAGA AGCTCGTGGG CCAGGAAAGC TACGAGGATG
TAGATGTCAA AGTGCAAGTG GAGATGCGAT TGGATTTCGC CCAGTTGGGA TTGTTGCTCA
TAGTACTTCG TTTGAGTTAC TTGACGTTGT TCACGAATGT CGCTTCTATC AATGAGGCGA
ACTTGATTCT GAATGATCCA TCACCAAGAG CCCAGGAAAT CAAGTATTTA TTGAACAATC
CCATCAACAT CGACGTTATA GATGTGGCTC AGAGCTGTCT TAACCAGTTT AACCTAATGA
AGAATGTCAA CATGACCACC ATGCAATTGG CCATGTACAT GAGGATCTAC CACATGTATG
CTCCTGAAGA AGGCGACGGT ATCGACGGTG GAGATGCGCA GGTGTTTAAT GCCATGTTGA
TTCAGATGGC TTACTCTCTA GGCTTGCATC GTGAACCAGA CAAATTTCCC AAAGAATTGA
ACGACGAAAA GACAAACAAC CTTGGAAGAA AAATGTGGTA CATGTTGTTG GTGTTTGACA
TGAACAACTG TATGGCTAAT GGTACTCCCA TGAATGTACA CAGGTTGTCT TTCGACACCG
TATTCCCTTT TTATAGACCA GGAAACGAGA ACGTCATAGA TGTGGAGGTA GAAAAAAGCG
TCTTAAGTTC ATTCAACCGT TTCAACAACG TGTACGCTCC CATGACAGAA ATTCTCGATA
TGATTGTCAA AGTCAAGGGA GGGGTCAAGA TGACCGATCT TGCTGAGAAA TTGTCGTATA
TGGAATCTCA CTTTGTAGAG GAGTATGGCA AGATAGGCAA TCATTTCGAC GCTGGAAACT
TGAGTCAAGT TGAAGTATAC CAGACAACAT TAAAAGTCAA AATCCATCTA ACCTCCAATA
TGTTTGTTGT CACAATTTTT TTCCACATCT TCAACTATTA CGAGAAGAAA GGATTTTCTG
AGTTAGCCTT CTTTTACCTC AAGAAGATTT TCTTGATCAC CATCTTAGAC TTAATGCCTT
TCTATTTTGA ATTCTTAGAC CGCTCCCATA TTATTTTCAA GAATTCCACG GATTTGAGCA
TTACTCCCGC ATTTGAAATG GTGACTCATA AGGCTCTTAT TGTATTGATG AGTATACTAT
TAAGAGTTAG ATTTGCTATC AAGACGAGCG AGGACTCCTT TGATCACAAT ATGAAATTAA
TCAAGGTTAT CAGCTACAAG ATATATTACG ACTGTTTGAT TAGAATTAAG GGTCTTCTCG
AAAAGTGTTT GGGCGTTTTC AGAGAGAGCA TTGCAAAGTT AAGTCACAGA TATTACTATT
CGTGGAGAAT TACCAAGGCA CAGAACTTCT TAAACACACT TTTGACAAGC GACCAGTTAT
ACGAGACTTA TGCTCCCCGC ATGAGCGGGC CCAGATTGGA ATTCACTAAT GCCATGTTGG
AAGAATTGAC CACTATTCTT GAAAAGGCTT TATACAAAGT GAAGGAACAC AAGAAGGCCC
AAAAAGGTGG GTCTCAAGCA TCTAAAGCAG ATCCAACGGT CAAGAGGGAT CCTGCGAGCT
ACACCGTTAG TGGATCTGTT GGACTGAGTG ACGATCGCTG TAATTCATAC GAAGATCGTG
ACAAGATTAC ACCTACTTTT TCGAGTACTT CTGTTGGATC AACTACCTCT AATGACAACT
ATTTTGACGG TGACTACTTG CCCAATGACC AGATCGACTC CATTTGGTTA CAGATGATGT
CGTTGAAGAG TCAGGCGGCA GATCCCACAG CCAACTTCAA CACCCCTGCC CCACTTGGAG
TGGGCATTCC TGGATCAAGT ATGTATGACG CATTGGGAAC GCCTGGTGCG TCCCTTGGTG
GGGACATGGG TCCTTATCTT GAGACAGATA ATCTCAACCT CTATCAGCGA GACGATTTCT
TTGGAAACAT GCCTCTTGAA GAGATCTTCA AAGACTTCAG CTGATTTTTT CGTTTTTTTT
CCACGACTTG TTTTACTATT GCTTTTGAAA CTAGAAATTA CCATAAAAAC CATCAAAAAG
ACTAAAAAAA AGAATGCTAA TTGAATTCAG GAGATCTATT CCAGATTTTG TATATATCAT
TGTTTATTAA CTTGTAAATA GAACGATTGT GAATGAATGA ATGTCTATGT AAATG
 
Protein sequence
MSTKKRNRAT KVCDFCRRRK VKCDLGNPCS TCVKYGHRNC RYKEQDNEAD SDFKVQDQLL 
QLKEKLRSLE ESVNQNPAFS EYHKEKSSIS SSSSSSSNIN NYNSNRNISS NNSVNNSSSV
NSKINNNLNN NLSNLNINNN PNSINVNDNL NINNEDVNFN SNITNNNKRQ HLAASSSPPF
ENPGFGILDV RSVLGRNPVA SEEDVINFYD GYSSVYDKEP IRRRNFGPMA WVTLVKVDNC
SGKLWDYIHC VKNSNKEIRG ANKVDVFPTA NAVDQDFREK VTEDEGFNEV RPYKDVAYKL
SPGHNKSSID NKHSQNTLNE KAKSLGLMFY QGGLDEELAL IEKIRLVLPK RKVIWLLYKR
YFTHLYSALP LLDEVVFKEK IQKLVGQESY EDVDVKVQVE MRLDFAQLGL LLIVLRLSYL
TLFTNVASIN EANLISNDPS PRAQEIKYLL NNPINIDVID VAQSCLNQFN LMKNVNMTTM
QLAMYMRIYH MYAPEEGDGI DGGDAQVFNA MLIQMAYSLG LHREPDKFPK ELNDEKTNNL
GRKMWYMLLV FDMNNCMANG TPMNVHRLSF DTVFPFYRPG NENVIDVEVE KSVLSSFNRF
NNVYAPMTEI LDMIVKVKGG VKMTDLAEKL SYMESHFVEE YGKIGNHFDA GNLSQVEVYQ
TTLKVKIHLT SNMFVVTIFF HIFNYYEKKG FSELAFFYLK KIFLITILDL MPFYFEFLDR
SHIIFKNSTD LSITPAFEMV THKALIVLMS ILLRVRFAIK TSEDSFDHNM KLIKVISYKI
YYDCLIRIKG LLEKCLGVFR ESIAKLSHRY YYSWRITKAQ NFLNTLLTSD QLYETYAPRM
SGPRLEFTNA MLEELTTILE KALYKVKEHK KAQKGGSQAS KADPTVKRDP ASYTVSGSVG
SSDDRCNSYE DRDKITPTFS STSVGSTTSN DNYFDGDYLP NDQIDSIWLQ MMSLKSQAAD
PTANFNTPAP LGVGIPGSSM YDALGTPGAS LGGDMGPYLE TDNLNLYQRD DFFGNMPLEE
IFKDFS