Gene PICST_68146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68146 
SymbolSFL1 
ID4839822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1176157 
End bp1179570 
Gene Length3414 bp 
Protein Length921 aa 
Translation table12 
GC content44% 
IMG OID640391137 
Productputative transcription factor 
Protein accessionXP_001385578 
Protein GI150866095 
COG category[K] Transcription 
COG ID[COG5169] Heat shock transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.800099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGTGGATTGT TTCGTTTCTT CTTGTTTATT CGCTACAGTT TCAAGATTTT CCCATTTTAC 
CACTGGTTTG TTCTGCCGTA CTTGCTTGTA CCTGATACTC AACGTTGCAC AATACAGCCT
GAAAACCCCT GTACGATTTT TGAGGAAACA CTTGCCCGGT TTTTTATCTG AAATTTCACC
AGAATAAGCA CAGCTGAAGT CATCGGTCAG CCCTATAAAA CACTGTGGCA ACATAAAGAA
AGACCCTGTC GACCGTTCCT CTTCGCCGTC TCCGTCTTTT CAACCTGGTA GACCATATTT
GTGTGATCAT CTGGCTCTCC GAGTTTTTTC TGGCTCGATA ATTTTAAGTG TTGTTCTAGA
ATCAATTCAA TCCAGTAATT TTTCAGAACA TTAAACACAC TTCGCCAAAG TTACGACAAA
TCTACAACTT TCCGTACTAT CCACCACTAG CCATTTTGTA ATTCTAGCCA TTTCGCTAAC
TCATTTCACT GTTGTAACTC ATCTCAGCAC TGTACTATAC TTGTCATCAC TGTTTTATTA
CCTGCCTATC GTATCATGTA CATATAACCA TGTCCGCCGT AATTCCCTCG ACCAAGATTA
AGTTATCACC CGAAGCATTG GAAGCCTCTC ACGAACTGGC TCCGGCAAAT CCTACACTGG
GACAGACGAA TGTTGCGAAT AATTCTGCTG TAAACTCTAC AGCAAATTTA ACGACTGCAA
CGGGATCAAA TGCTGGATCA AATTCTGGGT CCAATTCGGG CTCCAATTCG GGCTCCAATT
CAGGCTCCGT TTCTGGTCCG GCCTCTTCAG CTAACTCTTC TGGAAAGACC CAGACTGTTT
TTATCCATAA ATTGTACGAC ATGCTTCACG ATGAGACGAT TTCACATCTT ATCTGGTGGT
CGCCTTCCAA CGACTCGTTC TGTCTTTTGC CAGGGGAAGA GTTCTCAAAG GTGTTGGCTC
AGTACTTCAA ACACACCAAC ATCGCCAGTT TCATCAGACA GTTGAACATG TACGGCTTCC
ATAAAGTCAA CGATACCTTC CAGAACAATG AAGATGGCTC AGGCTCTAAC GCAAACGCTA
ACTCTGTAAA CTCTTCTGGT ACTTCAACCA ACTCAACCAA CAACAGTAAC CCCAATAAAT
GGGAGTTCCG CCACCTGACC AACCAGTTCC GTAAAGGTGA TATCGAACTG CTAAAGCTTA
TAAAACGTAG ATCTTCTAAA AACATCAACT CGCACAAGGA AATCGTCAAT TTGAAGTCAT
TGCCTCCTAC TTCGAATCCG ATTATGGATC CGAATTCTGG ATATGGTCCA GCTCATGGCC
ATTACTATGG CTATAGTGAC GATGAAACTT CATCCATTGC CAGTGCGAGA AGTCCTCATG
GCTCTTCAGA CAATTTGCAC CAACAATATC ACCAATCGCT CAGAATACAC CAGCAGTCGC
TTTTGATCAA TAACGAAGGT AGAACTACTC CTAACGGCCA GATACAGCAT CCTCCACAAC
AGCAGCAACA AATTCAACAA CCCCAAGCTC AACAATTACA CGGCCTTGGA CTTTCAGCTC
CACAGTCACC AGCACATACT CCAATACACC AACCGTTATC TCCACGATTG CAGCCACCTC
CTCTTGTAAC CAATCCGTCG TTTGAAAACT CCATCAACTT CAAGTTCATA GAGTTGAACA
ACCAGTTGAG TTCGCTCAAA AACGAACTCA ACGTAATGCA CCAGAAGTAT GATCTGGCCC
ATTCGGAATT GGTCAGAAAT CAGTCAGACA CATTGCAGCT TGTAGAAATG CTAGAGAGAT
TTGTCCAGGC AACAGAAGCA GCACCAGAGA AACCTACAAT CAATTCCAAC ATCTTCAACA
ACTCGAATTC AAATGCTATT CCTACTGTAA CGGTAAACTC AAACGTTGCC AGAAAGGATG
TGCCTGATAG AGTGTCTAAT AACAAAACGC CCATCAATAA CACGGGTCGT CTAGCTGATG
AGGAGGGTAA TACTTCTCCT ATGTCTGGTG CCTTGTTCAA ATCCACGTCT TCTAGTTCTA
ATTCTGCCAC TTCTGCTTCT GTAACTTCTA CTTCAGCTAC TTCAGTTTCC GGTACTGGTT
CTTCTACAAC GTCTGCTGCC AGAACTCTTC TGGGAGAAAT CGGCTCATTA AAGTCCATAC
TTCTTCAGAG ACTAAGAGCT TCCACACAAC ACCAACAGCC TCCTCTTCGA GTACAGCAGG
CGAGTACATC TTCTTCTTAC CACTTAAGCA ATGCTCCCCA CACCACCAAC AACTCGCGTA
ATCCTTCAAA CTCTAATATA CAGATAGTGC CTCAACACTA TCCGTTGAAT CCACACTATA
CCATCTACCC ACAGAGTGAG TATAGGGGCC CAGACTTGAA TCAGAACCAG TCCCAGAATC
AAAATCAGCA TTCCAAACTG ATAACAGAAG AGTCTGCCTC TGCCAATCGC CATTTGTCCA
TCTTGATGGA CCCCTTGCAA CCGATGCCTA CTCGTAATCC TATTCTAGAT GAGCAATCTA
CCAACTTTAG ATTGAGAGCC GAGTCGAAAA GCTACTCTCC TCTTACTGTT GGAGGTGCCA
GTGGCCAGTT ACCTACCAGT TCTGGGAATG CTAATCAAGC CCAGCAAGCA CATGTCACAC
AACTACAGCC CCCAGCGCAT CTTCATTCAC AGCCACCTAC TGCAAATGGA CCTGGAGGAT
CACAGCAGAA TTCACAGTCT TCGTCTAGAT CGTCTATCGT GGAGAAGAAG ACAAGCTCTC
GTTCTTCCAG CTTGATCAAC ACTCCTGCTG GTCATGATAG TAGCAAGCCT TCGTATCAGC
AGTATCCATT TCCACATGTA AATAGTCATT CGACTCATTT GCATTCATAC CATAACGGCT
CGACCAGAAC TAATTCTCTT CCAAACCCGC CCATCGACCA TAGCGTGACT ACACCTTTAA
CTCCGCCAGA ACCTCCTTCT ACCAACCCTT CCTACTTCAA CCAAAGAAAC TCCTTCACTT
CCATGTACGA TTCCCATCAA CAGCAGGGTC ACAATTACAG AGTGCCTCCT TATTCTCAAC
AGCCTACGTC CATGCCTAAA CCTACAGGCC CTATCACTTC TTCACCAATA TCTACTACTT
CTCCTCTGAG ACTTCAGGAG TCCTCAGAAT TTGCGACCTC CATGTCTAAG AACCAGTTGC
CAAGTGTAAG TGAACTAGAC AAGTCGATTA AGGGTGCCTC TTCCAATGTT CCGAAGTCGG
CTACTACCTT ACCGCCATTG CATCTGAGCC CTATGTTTGC GTTGTTGAAC AAGGACAATG
ACGATGACAA AATACTCAAG AAGCGTAAAA CCTGATTGGG TAACAAATGT TTCCTGTCCC
ATTAATGATA GTTTATAGAA TCTGTATGAA TACCAATGTA TTTTTATTCT GCAA
 
Protein sequence
MSAVIPSTKI KLSPEALEAS HESAPANPTS GQTNVANNSA VNSTANLTTA TGSNAGSNSG 
SNSGSNSGSN SGSVSGPASS ANSSGKTQTV FIHKLYDMLH DETISHLIWW SPSNDSFCLL
PGEEFSKVLA QYFKHTNIAS FIRQLNMYGF HKVNDTFQNN EDGSGSNANA NSVNSSGTST
NSTNNSNPNK WEFRHSTNQF RKGDIESLKL IKRRSSKNIN SHKEIVNLKS LPPTSNPIMD
PNSGYGPAHG HYYGYSDDET SSIASARSPH GSSDNLHQQY HQSLRIHQQS LLINNEGRTT
PNGQIQHPPQ QQQQIQQPQA QQLHGLGLSA PQSPAHTPIH QPLSPRLQPP PLVTNPSFEN
SINFKFIELN NQLSSLKNEL NVMHQKYDSA HSELVRNQSD TLQLVEMLER FVQATEAAPE
KPTINSNIFN NSNSNAIPTV TVNSNVARKD VPDRVSNNKT PINNTGRLAD EEGNTSPMSG
ALFKSTSSSS NSATSASVTS TSATSVSGTG SSTTSAARTL SGEIGSLKSI LLQRLRASTQ
HQQPPLRVQQ ASTSSSYHLS NAPHTTNNSR NPSNSNIQIV PQHYPLNPHY TIYPQSEYRG
PDLNQNQSQN QNQHSKSITE ESASANRHLS ILMDPLQPMP TRNPILDEQS TNFRLRAESK
SYSPLTVGGA SGQLPTSSGN ANQAQQAHVT QLQPPAHLHS QPPTANGPGG SQQNSQSSSR
SSIVEKKTSS RSSSLINTPA GHDSSKPSYQ QYPFPHVNSH STHLHSYHNG STRTNSLPNP
PIDHSVTTPL TPPEPPSTNP SYFNQRNSFT SMYDSHQQQG HNYRVPPYSQ QPTSMPKPTG
PITSSPISTT SPSRLQESSE FATSMSKNQL PSVSELDKSI KGASSNVPKS ATTLPPLHSS
PMFALLNKDN DDDKILKKRK T