Gene PICST_935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_935 
SymbolFST4 
ID4840929 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp419169 
End bp422231 
Gene Length3063 bp 
Protein Length760 aa 
Translation table12 
GC content39% 
IMG OID640392244 
ProductFungal transcriptional regulatory protein 
Protein accessionXP_001386474 
Protein GI150866770 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.664373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGCGTAAAA GGAACAGAAT AACGTTGGTC TGTAACGTGT GCAAGTTCAG AAAGGTTAAA 
TGCGATAGGG GACAGCCATG CTCATCGTGT ACAAAGTACA ACACGTCTCA TATCTGTGCT
TACAATAAGC CCTACTATGG AGATGCTTCG GTAGATGTTG TTAATGTAGA ACCAGATGTT
CTGCAAAAGA ATTCACTCTC TGAATCTTCG TCTGTTAACT CGGCTCAAAC TGCTAGTGGT
ATAGTAGATT CCCCCAATTC CGGACCATCA TCCACGTCTA CATCATCTAC ATCTGTTTCG
GCTATTTCGG CAAATCAGCC TGCTAGTTCT TCTGTATTTT CTAAAGTGAT TACACTGGCT
ACTGCCCAAG ACACCACGTC TCTGAATTTG TTCAATAGAA ATGCCGTTCC AACTCCTTTA
GAAAGCCACG ATAGATTATC CAATAGCAAT ACTTCGGCAT TCAAACCTGA CTACCAAACA
CCTAGACGAA TAACACTGAA GCAAAAACTC CAAGCTCAGC ATCCTCAATT AGGTCTTCAA
CAACCTCAAC AAGCACTAGG ACAACAACCA TTGGCACAGC AACCGTCATC ACAACAACCA
TTGTCACAAC AACCCTTAGA TCAACAGCCC CAAGCTTTAC GCCATCAAAA TACTACTCTT
CCTACTCCAC AACGACCACC TCCACAATTA ACACATGTGC CACAGCAAGA TGCAAGATTG
CGAAGGGTGC ATCTGTCGGT TTCGCTGGAA TCGTATAAAT CTTCAGAGAG ACCATACAAG
CTGTCAATTT CATCAACTGG TTCCTATTCA ACAACATCAC CAGTTGTAAC TGGCAGAAGT
CCGCATGGCC TAAATAATAT CAATAGTGTA GATCAGTCCT TAAACAAACT TCTGGAAGTA
CAGTCCAATG ATTCCGACAA AGGTAGTCAA CAATTCACTT CTCCGTCCTA TGCCTCTGCA
ACAACTCATG CCAGTACAGG ACCTACATCC GTTAGTTCAA CTACCCATCT TGACTACAAA
GAGCCGAACT TGACCCCAAA TTCTAGAGTA TACCTTGTAG CGAAACAGTA TGTAGACATA
ATGAAGCATT TCCAGGACTA TAAATCAGTG ATTGGAATCA ACCCGGTTAG CTCACCTTCA
GACACAATTA ATTTCTTTGA TTATCCGCCT ATTTCCACAA CCCCAGACAA ATATGAAGTC
AACCACGGAC CTTTTACGTG GCATGCATTC TTGAAGATGG ACATGGGGTT GTCTGATTTG
TGGGCATTCA TGTCAACTAA GTCTCAGAGC TTCCACGAGG CTAAGACTAA ATTTATGGAT
TGTGTGCCTC CTGGCAAGAA CAAAGAAAGT TCAAGATCTT TTGAAAATCA GTCAGTTTTG
AACAAAACAC CAATCCAAGT ACTTCAAAGA TTGAATATAG CAGCTAAGAA AAAATTTAAC
ACTTCTAAGC ATAATATCAA TGAAGGTATA CCTTTGGGAC TTAACTTCAT TACCGACCAT
GAATATTTAA ACAATAATGA TATTGAATTG TCTTCCAAGA TCTTGACAGT ATTGCCAAGC
AAGAGAATTG TTTGGATCCA TATTAATAGG TTTTTCAAAT TTATCTACCC TTATATTCCT
TTCCTAGATG AAACTTCGTT CAAGAAGTCT ATTGAGGCTA TTATTGGTCC AGAGTCATTT
ACGGAAGAAA AGATAGAAGG CCTAAAGATA GAAGATAATA TTGACTATGC CAAACTTGGA
ATCATGTTGA TTATTCTCAG AATGAGTTAT CTTTCGTTAA TCACCAACAA CCAGGAAGTC
AACGAGTTCT TCTTGGGGTA TGACGGGCAG AAGCGCAATA CATTCTATAA AGAAAAGCAG
AAATTGACGG ATTCGGAATT GCGTAGCTTG AAAATGCTAC TCCTTAACTC TATTGGAATT
GATGTCATTG TATTAGCAAG AAACTGTTTG AACAAATTCC AGTTATTCCA GAAGATGAAC
CTATCCATTT TGCAATTGGC ATTGTTCATG AGAATCTACG TTTTCATTGC GCCAGAAGAT
TCCGAAGGTC CTGATCGCAA CCAGTTTCAG ATTTACAATG GCACCTTGCT TCTGATGGCT
TATTCAATAG GATTGAATAG GGAACCGGAT TCGTTCAAGT CCGTCTTAAA TGATCCAAAG
GAAAATAATA TCAGAAGAAA GATTTGGTTC CATTTGAACT ATATCGATTT ATTACATGGA
TTTTCTTCCG GTAACCCATT AACCACTAAT CCAATAGCTT CAGATGTAAA GTTTCCTTTC
CACGAGGATG GAAATGAAAA CATTAGTTTT GAAGCTCTTG ACTCTTATAC ACATGTTGCT
TGGAAATTTT TGCAACCGCT CAGTACCAAA ATGAGAAATA TTCTTAATAT CGTGGGAAAT
GTCAGTGAAG GAGTCAAAAT TATAGAATTA GTAGAAGCAT TGAATGACTT CGAGATATTT
GTGGCTACTA ATCTCGGCAC CCACATTTCT GAATTTATAG ATGCTTTCAA GGAGGATTTT
TTTAATGACA ACTTCAGAAG AGTCTTGCAA GTTCATTACT ATATCCAAGT TAAATTTTTC
ATGGTATCTA TCTATCATCA TCTTTTTATT CATTACGAGA CAGAAAAGAA TCAGGAATTG
AGCTTTTTCT ATCTCAAAAA GATGTTAATG GTATTGACTG AAGAATTTCT TCCTAGTTAC
TACACTATTT TGGAACATGA GCATTATTAC TTCAGCTACA GTGCAAACCT TTTTTTGAAT
CCTGTCATTG AGTCGGCAAT GCATAAGTCT AATGGTATCT TGTTCAGTCT CATTATCCGC
ATGAGCTACA CAATCAGAGC CATGGAGGGA GCTAATCCTT CTGCTCATGT TGAAAAAATG
GCTACGGATG AGTCGTATAA ATCATACTTC GATGATATGC TGCAATTGCA ATACTATCTC
ACAAAGTGTG CTAAACTCGG GACTCTAGGA ATATCAAAGC TTGGTAACCG TTACTTGTAC
GCTTGGAGAA TTAGCAAATC CAACTTGTTT ATTCTTCGGG CTATTCTGAG TGACGAGTTT
TAC
 
Protein sequence
ARKRNRITLV CNVCKFRKVK CDRGQPCSSC TKYNTSHICA YNKPYYGDAS VDVVNVEPDV 
SQKNSLSESS SVNSAQTASG IVDSPNSGPS STSTSSTSVS AISANQPAMI GINPVSSPSD
TINFFDYPPI STTPDKYEVN HGPFTWHAFL KMDMGLSDLW AFMSTKSQSF HEAKTKFMDC
VPPGKNKESS RSFENQSVLN KTPIQVLQRL NIAAKKKFNT SKHNINEGIP LGLNFITDHE
YLNNNDIELS SKILTVLPSK RIVWIHINRF FKFIYPYIPF LDETSFKKSI EAIIGPESFT
EEKIEGLKIE DNIDYAKLGI MLIILRMSYL SLITNNQEVN EFFLGYDGQK RNTFYKEKQK
LTDSELRSLK MLLLNSIGID VIVLARNCLN KFQLFQKMNL SILQLALFMR IYVFIAPEDS
EGPDRNQFQI YNGTLLSMAY SIGLNREPDS FKSVLNDPKE NNIRRKIWFH LNYIDLLHGF
SSGNPLTTNP IASDVKFPFH EDGNENISFE ALDSYTHVAW KFLQPLSTKM RNILNIVGNV
SEGVKIIELV EALNDFEIFV ATNLGTHISE FIDAFKEDFF NDNFRRVLQV HYYIQVKFFM
VSIYHHLFIH YETEKNQELS FFYLKKMLMV LTEEFLPSYY TILEHEHYYF SYSANLFLNP
VIESAMHKSN GILFSLIIRM SYTIRAMEGA NPSAHVEKMA TDESYKSYFD DMSQLQYYLT
KCAKLGTLGI SKLGNRYLYA WRISKSNLFI LRAISSDEFY