Gene PICST_81861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81861 
SymbolFST3 
ID4837202 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1899935 
End bp1903547 
Gene Length3613 bp 
Protein Length1095 aa 
Translation table12 
GC content40% 
IMG OID640388517 
ProductFungal transcriptional regulatory protein 
Protein accessionXP_001383136 
Protein GI150864358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.518071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGC ATACAATCAG AAAGAGAAAC AGACCAAGTT TGGTCTGTAA ACCATGCAAG 
AAGAGAAAGA TCAAGTGTGA CAAAGGAAAG CCATGTACGT CTTGTGTCAA AAACAAGATG
GACCATCTTT GTGTCTATGA CGAAAAATGG ATTGGCAGCA AGAAATCCAA AAGATCAAAA
ACTACTGTTG CCAAAGTGCT ACCATTGGCT TCTTCTGTTC CAGGTGTAAA TGCTATTACC
AATAACGGAA TATCCATCCA CACTGGAGTC ATTAGCCCCA CCAGTATAAA ACACGAGCTC
CTCTCAGATT CAGGCGAGAA ACCTCTCCAA CAGACTAAGG ACCCGAATAA GGAAAAATCT
GTAGTTGTGA TAGTGCCGAA GTCAGAACTT GACGAACTAC GGGCAAAAGT CCAGAGGTAT
GAGCATGGTA TGATATCTCC TTCGAGATCT GATAACTCGG AGGAACAGAA CAGTACTCTG
TCATCGACAA GTTCGAGCTA TCGCAACATA AACAAAGTCT ACCCTCCTCC AGAAGTTAGT
ATCCATTTCT ATAACCGATC CATAAATCCT ACCAAGCGTT CATTGTATGC ATATCATGAT
CCAGATAGAC CTGGAGAGAT CCCCGTTGAC GATCTTTTTG TGAACAACCA GTGTTTTATC
ACCAGAGATA TGCCACCGCC TAAAGCCAAG CGTATGACGT TGGATGACGT GCCCAAACAC
GATTACTACG CAGTAGGTAT CAACCCGTAC GAATCACCTA CAGACACTAT TAACTTCTAC
GAAAACTATA ACCCTGTACA AGTGAAGGAC TGTTCCAAAC GTATAAGCTT TGGACCTCTT
GCATGGGCTA CGATTCTGAA GAAGGACAGA ACTCTCTCGC TTATCAAGAA ATTCACGTCA
TCACAGAAAG CCCAGTCGGT TTTGGAAAGG ATCAATAGGT CCAGAACTGC TACGGCAGAA
CCCAACGCTG TCTTAGTCGG CTGTATGGTA GAACCACCAC AAGAAGAACT CAAGTCCACA
TTGAATGTAG ATGGCAGCCA AATCAAACAT TTTGAGCAGA AAGTATTGGA AGAGGAAGGT
TACAATGACG TTAAGCCATA TAACGAAACG AAGACAAGTA GCAAGAAGCA AAGTGCAAAA
AAAGAAGTAT CACACAACGT GTTCCGTGAC ATGGACAAGA AGTCTCTCAT GTTGGGGCTT
ACTATTTTGG AGAAGCTGAT CGATAGAGAG TTGAAACTTA TCGAACAGAT TCAAGTATCT
CTTCCCAAAC AGAGAGTCAT TTGGCTCTTG ATAGACCGTT TCTTTAAAGT AGTATACCCA
TTTGTACCAT ATCTCGATGA AGAGGACTTT AGATACCAAG TTGCTAGAAT TATTGGTCCT
GAAGGTTACG AAGAAACTAA GGTCGAAGTC AATGTAGAAA AAAGATTAGA CTTGTCGTAC
TTGGGAGTGT TACTAGTTGT GGGAAGATTA TCCTATTTGT CGCTTTTTTC CAATCGTGTT
TCGATAAATG AAAAGATCTT ACGTAGCAAA GATCTCACCC ATGAACAGGA AGTAATAAAG
TATTTATTGA TGAATGCCGT CAATATTGGT GCCATTGATT TGGCTCGTGA ATGCCTTCAC
CAGTTTGATT TGTTAAGAAA GTTGAGCTTC CCAGTATTGC AATGTGCTCT TTATATCAGA
TTGTACAATC TGCTTGCACC AGAAGACGGC GATGGTTTGG ATGGTGGCGA TGGTCAAGTG
TTTAACGGAA TGTTGGTTCA GATGTGCTAC TCTATTGGCT TGAACAGAGA GCCTGACAAT
TACAGAGAAA CATTCACAGA CGAGAAAACA AATAACTTGG GAAGAAAGAT CTGGTACTAT
GTTGTAGTCA ATGATTATAT TCAAGCGTAT ACTTTCGGCA ATCCCTTGAC TACTAATTCG
ACGTACTACG ACACCAAAAT GCCATTCATC ACGAGTGATA ATACCAATCT CAGAGACGTA
GAGCTTGATA AATCTATTGT GAGTTGCTAC ATGTTTGATG AAGCACTTCT CAGGGGGCCA
ATACGTGAAA TCCTCAACCA AATTTTAAAC ATCAACGTGC GAGTGAAGGT CCCCCCATTT
ACCCAGTATG TTAACCATTT GGAACTTGGT ACGATTAGTA TCTTTGGCAG AATTCGTGAT
TATTCACACC GTTTAGAGTC TGAGGATGTT TCTTATCGTT TTAACAAGAT AAGAAAGGCA
TATTCCCTTA TCTCATTCAA TGCTTTTTTC GTATCGGTAT ACTACTATTT CTATGATTAC
TATGAAAGCA GGGGAAACCA GGTGTTATCA GTATTTTATT TGAAGAAGTT GTTGGCAATC
ACTGTAGGGG AAATGATACC GTACTTCTTC CCTCTTATCA TCAGATCTGC TGAGATCTTT
GGTGATGGTT CAGATTTGAT GATCAACCCT CCTATCATTC AGGGAATACA ACGTACAAGT
GATCTGGTAA TGATTATCAT GGTTAAACTC AATTTTTGGC TTATTAATAC TGTCAATAAT
CCTCAGCATA ATTTCATGAT GAAGAATGAT GCCAAATACA GAAACTACTT CGAATTGGTG
TCTACTTTGG TTGTTTGTTT GGAGAAATGT GCTAAGATTT GTCTCATGGC CTCATCAATA
ATGAGTAGCA GATACTACTA TGCCTGGGGG ATCTACAAGA GCCACAATTT CTTATTCAAA
GTTATCAGCG ATCGGGAATT CTATACTCAG AACCTAGACT TTGAACTGCA CTACATACCT
CCTACCAAAG ATCAACTTCA GGACATAACC AACTTGGTAC AGAACTCATT AAATATCTTG
AACCGCACTG TTGGTGAGCA TTGCGACAAA GTGGATCTTA GGACCTTGTT CAAGGTGTAT
GGTCAAGTTC CTAGTGTGTT GAAAGACTTG GAAACGGCTA ATCCAGAAAG TATGAACAAG
CCTACTTCTT TCAATATGCC TATCGGCGAC TTTCCTGGTC TTTCTAGTCC GGACACACCA
TTGTCAGACA CATCCAGTGC ACAGGGTGTT CTACTTGGTA GCTTTGACGA CTTAAAGTAT
GACGGTAGTG CTGAAATTGA CTCGTTGTGG TTGCAAATGC TTTCACAAAA GACTAACAAA
CAGAATACTT TTGAAGAGGC ATTCAATCTC GTCGACAATA ATGTAGCTAA TTCTAACAAT
ATTCCCTCCG GATCTGATAG TTCTACTTTC ACTGGTAGTA CTGCGAATAC TGGAGTCTCT
GGAAGTGGAA GTTCTGCTGC AACTTCTACG CCGGCCAATG GACAGCAGCC TTCTACAGAT
TACAATTTCA CTCTGACAAA CTATCAGCTC TTTGAGAGTT TAGGGTCTGT ATTCGATGGA
GGAGCAAACG GTCACTTTGA TTTCTTCAAC GACTTGCCAT TGGATAAGGT GTTCAACTAG
TTGAAGTTAA GACGTATCCA CAAAAAGTAC TAACATCACT ATTTTAATCT TTTCTACGAG
TCTGGTTTTC TCATTTTTTC AACCATCTTT TATAACTTTT CTTCGACTGG TCTAATCTAT
TGTCATTGTG GTGTCGTGTG TTTCTTTTCA TTCTTATCTT GTATTTAATT TATTGGAGGT
TAATCAAAGT TAG
 
Protein sequence
MDEHTIRKRN RPSLVCKPCK KRKIKCDKGK PCTSCVKNKM DHLCVYDEKW IGSKKSKRSK 
TTVAKVLPLA SSVPGVNAIT NNGISIHTGT KDPNKEKSVV VIVPKSELDE LRAKVQRYEH
GMISPSRSDN SEEQNSTSSS TISIHFYNRS INPTKRSLYA YHDPDRPGEI PVDDLFVNNQ
CFITRDMPPP KAKRMTLDDV PKHDYYAVGI NPYESPTDTI NFYENYNPVQ VKDCSKRISF
GPLAWATISK KDRTLSLIKK FTSSQKAQSV LERINRSRTA TAEPNAVLVG CMVEPPQEEL
KSTLNVDGSQ IKHFEQKVLE EEGYNDVKPY NETKTSSKKQ SAKKEVSHNK SLMLGLTILE
KSIDRELKLI EQIQVSLPKQ RVIWLLIDRF FKVVYPFVPY LDEEDFRYQV ARIIGPEGYE
ETKVEVNVEK RLDLSYLGVL LVVGRLSYLS LFSNRVSINE KILRSKDLTH EQEVIKYLLM
NAVNIGAIDL ARECLHQFDL LRKLSFPVLQ CALYIRLYNS LAPEDGDGLD GGDGQVFNGM
LVQMCYSIGL NREPDNYRET FTDEKTNNLG RKIWYYVVVN DYIQAYTFGN PLTTNSTYYD
TKMPFITSDN TNLRDVELDK SIVSCYMFDE ALLRGPIREI LNQILNINVR VKVPPFTQYV
NHLELGTISI FGRIRDYSHR LESEDVSYRF NKIRKAYSLI SFNAFFVSVY YYFYDYYESR
GNQVLSVFYL KKLLAITVGE MIPYFFPLII RSAEIFGDGS DLMINPPIIQ GIQRTSDSVM
IIMVKLNFWL INTVNNPQHN FMMKNDAKYR NYFELVSTLV VCLEKCAKIC LMASSIMSSR
YYYAWGIYKS HNFLFKVISD REFYTQNLDF ESHYIPPTKD QLQDITNLVQ NSLNILNRTV
GEHCDKVDLR TLFKVYGQVP SVLKDLETAN PESMNKPTSF NMPIGDFPGL SSPDTPLSDT
SSAQGVLLGS FDDLKYDGSA EIDSLWLQML SQKTNKQNTF EEAFNLVDNN VANSNNIPSG
SDSSTFTGST ANTGVSGSGS SAATSTPANG QQPSTDYNFT STNYQLFESL GSVFDGGANG
HFDFFNDLPL DKVFN