Gene PICST_82123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82123 
SymbolFST2 
ID4837511 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1895243 
End bp1899374 
Gene Length4132 bp 
Protein Length1080 aa 
Translation table12 
GC content39% 
IMG OID640388826 
ProductFungal transcriptional regulatory protein 
Protein accessionXP_001383135 
Protein GI150864357 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.600944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACGAAAACGA AGAACAAGAA AGCTGAAGCA CTCATTGAAA GCTTTAGCAA GTAAGAGAAA 
AGAATCTCGC ACTTCATTTC AAATCAAAAT CAATTTTTCA ATTCAAATAA TTCAAGCTGT
ACTATTCAGA TTCATTTCCA AAATCAAGTT CTATTACTAA TAATTTAAAT CTTACTTTTC
TAGCATTCCA ACATATGCAT ACACACGATG TCTGTCCAGT TCCAAAACCC GTCACCAGCA
GATGGAACCA TCAAGAAAAG AAAAAGACAA AGTCTTGTCT GCGAGAATTG CAAAAAGAAG
AAGATCCGTT GTGACAAAGG AGAACCATGT AGCCAATGTG TTCGGTCTAA ACTCACGGAC
TCGTGCCACT ATTCAGCACC TGTAAGTTCA CGCTTGGCTG CTAATAGAGC TGCCGAAACA
TCTTTGCCGT ATATATCTAG CGCTAAGACT CCTCTTGCTT TCGGAAAGAA AGTCGAGGAC
ATTGTAGTTC AACCTTTGAA GAAGAAGAAA CTTGAAACTG AGTCCGAAGG TTTAGCCTCA
ATTTCGCTTG AACCAGTCCA AACTAACCAG TCATCCACAT CTACCTTTTC AGCTTCAATT
CCAGCATCAA CTTCTATAGG TACTTCTATA CCTAATTCAT CCATTCAATC GTCAATACAA
TCGTCAGCAC ATTTACCTTT ACTGAGCCAT CAAGTACAAG TGCTGTCACA ACTTTTGCCA
CAAACTCAAC TTCAAAATCA AACACAGAAT TCGCAACAAT ATCAATACCA GCCACCTCCT
GCGGCTCCTC CTGCTCCAAT ACCTGATGAG CTAGCTGTTT CAGAAGACTA CAAGGTTACA
GTATCATTGT CTGAGTTGAA TATGCTTAAG CAACGTCTTC AACAGATTGA GTCTTCTATC
GTAGCTTCAT CATCAGCTAC TCCTCCAATA GGAACCCAAC AGAATTCCCA CCAACATGCT
CAACAACCAC AGTCACAACA ATTACAGCCA CAGCAGTATC AGCAATCACT ACAGTCACAA
CAGTATAATT CGGATCTTCA ACCGCAACGA GAATTTCAAG AGCAGAAGCA GCAGCAGCAC
CATCAACACC AACGACAATA TCAGCACCAA CAGAAACAAG AAAAGCCTAC GAGACTGAAT
GGAGACAACC ATCTTTCCCA ACATGGTTCA TCTTCTGGTA TCGCAACATC AACAAGATCA
AGCGAATCTT TCTCAAGCGT AAGCTCAATT TCAGTTAATA CTCCTCAGGT ACCTACTCGT
AATGAGCCTA TACAGTTACC TCCGATTCAA TTTAGAAACA GTTTACCACC CTCAACAACC
CCAAATGGTC CTCCTATCTT ACCACCACCT CCTGGTTTAT CTTACTCTAA TGGTAGAGGT
TATGAGTCTA ATACCAGTGG TAGCACGCGT TCACATCCGA CTATGAGTAA TGAGAGTACA
GGTGCAGCAC CAACAACAGC AAATTCCAGT CCGGATACGT CTATTAAGCA AGGATTGGTA
GATTCGCCAG CTTCTATATT ATCATCTGTG ACGACTCCTT CATCAAAAAA ATCTCTATTT
GGTTTGCATC CGTATGCTGA TGAAAATGAA AGGATCAACT TTTATGCCGA CTACACTTCC
ATTCATATAA AAGAACCAGA AAGAAGAATC AACTTTGGTC CCTTTGCTTG GTCGTCATTG
ATGAAGAAGG ATAAGGGATT AAGTCTCTTA TGGGACTACA TCATTAAGAA GAAGGAAGAA
AAGTCCCAGA CAGCCATGAT TTTCACTCAG GTATCCCACG AATTGACCCA GGAAAATACC
AATGTAGCCA CCTCCCAAGA TGTAGGGGAA AGCGAAAAGT CGTTCAAGAG AAGAGCTCTA
GAAGTTGACG GATATACTGA TATGATCCCT TACAACAGCA TCTTAAAAGC TAAAGTAGGA
AAGAATATCC AAAAGACAAA GTTGAATGAA AACACTTTGC CACTTGGATT GACTTATTAT
GATGGGCAAT TGAATCGTGA ATTGCAACTC ATTGAAAAGA TACATATGAT TTTACCAACA
AAGCTCGTTT TATGGAAGTT GGTTAGACGG TTCTTTACCT GGTTGTATCC TTTTATGCCG
TTTATTGACG AGGAGTGTTT CAAGGATAGC ATAACTAGTA TTATCGGTCC AGTGTCGTAT
GAAGATGTTA AGATCACAGA ACTTAAGGTT GAAAAGAGAT TGGATTTAGC CCATATTGGA
CTATTGTTAG TTGTGTTAAG ATTGGCATAT TTATCGTTGT TCTGTAACAG TAGTGAGGTC
AACGAAGCTA ATTTGAAGAC AACTGATCCA TCACCCAAAA AACAAGAAAT GAAGTACTTA
CTAAGCAGCC CAATTGATAT TAACACGGTT GATATAGCTC AAGAATGCTT GGACCAATTC
AATTTGCTAC GTAAAACATC TTTCATTATC TTTCAGTTGG GTTTCTATAT TAGATTGTAT
CATATTTATG CTCCCGAAGA TGGGGATGGT GCCGATGGTG GTGATTCTCA AGTGTTGAAC
GCAATTTTGA TTCAAATGGC ATATTCTCTA GGTTTGAACA GGGAGCCTGA TAACTTCAAG
GATATCTTGA ACAATCCAAA AACAAATAAT CTTGGAAGAA AAATGTGGAA TTATTTGTTA
CTTGGTGATC TCCATATCTC TTACTCTAAT GGTATGCCAA TGGTTACTGA TCCTATCTAT
TCTGATACGA GAGCCCCAGT GTACGAGCCA GGAAGTGAAA ATATATCTGA TATAACTTTG
GATAGATATG TCAGCGATTG TTACTTCGAA TGTGGCCAAA TGAGTGGCTC GTTAAGAAAG
GTGTTAAGAT TAGTTTTGAA TGTCCAAGAG GGAGCCAATA TGGCAGACTT GTGCAAACTG
TTGACTGAGT TTGAGTATTT GATATCTGAG CATTATGGTA CATTGGAAGA ATGTTTGAAA
CCACTCGAAG AAAATGAACA TTGGTTTGTA TTTACGAGAA ATTTTAGAAC AAAGTTCTAT
CTTGCCTTGA AGGCTTTCTA CGTTTCGATA TACTATCATT TTTATCTTTA CTATGAAACA
AAAGACATCA ATCTTTCGTT TTTTTATTTG AAAAAGATGC TAATCATAGC AGCTTACGAT
ATCATGCCAC ACTACTTCGA TTTGTTGGGG GGTAGTGAAA TTATTTGTGA TATGATCGTA
AATCCCACCC TTGAACAGAT CATACACAAA TGTAACCAAC TTACACTAGC ATTGATTATT
CGTATTAATT TTGTCATTTA TCATATGAAA AATCAGACAG ATCACAATAG CAAATGTTTA
TATGATAGCA ATTATGCAAC TTACTATCGT GGCCTTTGCA GATTTTCCTC TTGTCTTACG
AGGAGTGCTG AAGTTTCGAT TGCAGCCATT TCCAAAATTA GTAATAGGTA CTACTATGCC
TGGAGGATTA CTAAGGGTCA TACTTATTTG TTGAAGAATA TCACGACCAT GGAATTCTAC
GACGAGAACT ACTCTCAGGC TGGTTCACTT TGTCTTCCTC GGTTTACTCT TGATCAAATT
GAAGACATGA TAAATATCTG CGAAGCATCA CTCAGTAAGC TTGGAAAGGT TGATCTAGTT
GGAAATGAAT TCTGCAATGA CACCTCTATT TCTTCGATGA CTAACAATAC GAGGCCAGTT
GTTTCAAACG AGAGGACTTC TGCAGCAACA ACATCGGATG TTCTACCTGA TTCAAAAGCA
AACGCTGACA CCAAAACAAA TGATATAAAT TTGGAATTTG TGAATAACAA AGAAATTGAT
AACTTATGGT TCCAGATGCT TTCGATGAAA TACGACAACC AAGAAACATC CCAGTTTTCC
ACCCAAGGCA ATGTCCCGGA GAATAGGAAG CCTAGTTTCA GCTTTGGAGG GTTTTCTCCC
GGCCCAATGG GTGGCTTCAA AACCCCTAAC CCAGATAGTC CCATGCGAAA CATGTACGGT
TCTCCTAAAC CTGATGTTGC GCTTGGTGGT GATATCGACC GCTATGGGTT CGACATGGAG
CAGGCTGCGC AATTTGATAT ATTCAGTGAG TTGCCCTTTG ATCAAGTGTT CAAGTAGAAG
ACAGTTAAAT AGTTTCACAT GACTGCAACA AATATATTAA GTTTTAGAAA AG
 
Protein sequence
MSVQFQNPSP ADGTIKKRKR QSLVCENCKK KKIRCDKGEP CSQCVRSKLT DSCHYSAPVS 
SRLAANRAAE TSLPYISSAK TPLAFGKKVE DIVVQPLKKK KLETESEEDY KVTVSLSELN
MLKQRLQQIE SSIKQEKPTR SNGDNHLSQH GSSSGIATST RSSESFSSVS SISVNTPQVP
TRNEPIQLPP IQFRNSYESN TSGSTRSHPT MSNESTGAAP TTANSTSILS SVTTPSSKKS
LFGLHPYADE NERINFYADY TSIHIKEPER RINFGPFAWS SLMKKDKGLS LLWDYIIKKK
EEKSQTAMIF TQVSHELTQE NTNVATSQDV GESEKSFKRR ALEVDGYTDM IPYNSILKAK
VGKNIQKTKL NENTLPLGLT YYDGQLNREL QLIEKIHMIL PTKLVLWKLV RRFFTWLYPF
MPFIDEECFK DSITSIIGPV SYEDVKITEL KVEKRLDLAH IGLLLVVLRL AYLSLFCNSS
EVNEANLKTT DPSPKKQEMK YLLSSPIDIN TVDIAQECLD QFNLLRKTSF IIFQLGFYIR
LYHIYAPEDG DGADGGDSQV LNAILIQMAY SLGLNREPDN FKDILNNPKT NNLGRKMWNY
LLLGDLHISY SNGMPMVTDP IYSDTRAPVY EPGSENISDI TLDRYVSDCY FECGQMSGSL
RKVLRLVLNV QEGANMADLC KSLTEFEYLI SEHYGTLEEC LKPLEENEHW FVFTRNFRTK
FYLALKAFYV SIYYHFYLYY ETKDINLSFF YLKKMLIIAA YDIMPHYFDL LGGSEIICDM
IVNPTLEQII HKCNQLTLAL IIRINFVIYH MKNQTDHNSK CLYDSNYATY YRGLCRFSSC
LTRSAEVSIA AISKISNRYY YAWRITKGHT YLLKNITTME FYDENYSQAG SLCLPRFTLD
QIEDMINICE ASLSKLGKVD LVGNEFCNDT SISSMTNNTR PVVSNERTSA ATTSDVLPDS
KANADTKTND INLEFVNNKE IDNLWFQMLS MKYDNQETSQ FSTQGNVPEN RKPSFSFGGF
SPGPMGGFKT PNPDSPMRNM YGSPKPDVAL GGDIDRYGFD MEQAAQFDIF SELPFDQVFK