Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68146 |
Symbol | SFL1 |
ID | 4839822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1176157 |
End bp | 1179570 |
Gene Length | 3414 bp |
Protein Length | 921 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391137 |
Product | putative transcription factor |
Protein accession | XP_001385578 |
Protein GI | 150866095 |
COG category | [K] Transcription |
COG ID | [COG5169] Heat shock transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.800099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TGTGGATTGT TTCGTTTCTT CTTGTTTATT CGCTACAGTT TCAAGATTTT CCCATTTTAC CACTGGTTTG TTCTGCCGTA CTTGCTTGTA CCTGATACTC AACGTTGCAC AATACAGCCT GAAAACCCCT GTACGATTTT TGAGGAAACA CTTGCCCGGT TTTTTATCTG AAATTTCACC AGAATAAGCA CAGCTGAAGT CATCGGTCAG CCCTATAAAA CACTGTGGCA ACATAAAGAA AGACCCTGTC GACCGTTCCT CTTCGCCGTC TCCGTCTTTT CAACCTGGTA GACCATATTT GTGTGATCAT CTGGCTCTCC GAGTTTTTTC TGGCTCGATA ATTTTAAGTG TTGTTCTAGA ATCAATTCAA TCCAGTAATT TTTCAGAACA TTAAACACAC TTCGCCAAAG TTACGACAAA TCTACAACTT TCCGTACTAT CCACCACTAG CCATTTTGTA ATTCTAGCCA TTTCGCTAAC TCATTTCACT GTTGTAACTC ATCTCAGCAC TGTACTATAC TTGTCATCAC TGTTTTATTA CCTGCCTATC GTATCATGTA CATATAACCA TGTCCGCCGT AATTCCCTCG ACCAAGATTA AGTTATCACC CGAAGCATTG GAAGCCTCTC ACGAACTGGC TCCGGCAAAT CCTACACTGG GACAGACGAA TGTTGCGAAT AATTCTGCTG TAAACTCTAC AGCAAATTTA ACGACTGCAA CGGGATCAAA TGCTGGATCA AATTCTGGGT CCAATTCGGG CTCCAATTCG GGCTCCAATT CAGGCTCCGT TTCTGGTCCG GCCTCTTCAG CTAACTCTTC TGGAAAGACC CAGACTGTTT TTATCCATAA ATTGTACGAC ATGCTTCACG ATGAGACGAT TTCACATCTT ATCTGGTGGT CGCCTTCCAA CGACTCGTTC TGTCTTTTGC CAGGGGAAGA GTTCTCAAAG GTGTTGGCTC AGTACTTCAA ACACACCAAC ATCGCCAGTT TCATCAGACA GTTGAACATG TACGGCTTCC ATAAAGTCAA CGATACCTTC CAGAACAATG AAGATGGCTC AGGCTCTAAC GCAAACGCTA ACTCTGTAAA CTCTTCTGGT ACTTCAACCA ACTCAACCAA CAACAGTAAC CCCAATAAAT GGGAGTTCCG CCACCTGACC AACCAGTTCC GTAAAGGTGA TATCGAACTG CTAAAGCTTA TAAAACGTAG ATCTTCTAAA AACATCAACT CGCACAAGGA AATCGTCAAT TTGAAGTCAT TGCCTCCTAC TTCGAATCCG ATTATGGATC CGAATTCTGG ATATGGTCCA GCTCATGGCC ATTACTATGG CTATAGTGAC GATGAAACTT CATCCATTGC CAGTGCGAGA AGTCCTCATG GCTCTTCAGA CAATTTGCAC CAACAATATC ACCAATCGCT CAGAATACAC CAGCAGTCGC TTTTGATCAA TAACGAAGGT AGAACTACTC CTAACGGCCA GATACAGCAT CCTCCACAAC AGCAGCAACA AATTCAACAA CCCCAAGCTC AACAATTACA CGGCCTTGGA CTTTCAGCTC CACAGTCACC AGCACATACT CCAATACACC AACCGTTATC TCCACGATTG CAGCCACCTC CTCTTGTAAC CAATCCGTCG TTTGAAAACT CCATCAACTT CAAGTTCATA GAGTTGAACA ACCAGTTGAG TTCGCTCAAA AACGAACTCA ACGTAATGCA CCAGAAGTAT GATCTGGCCC ATTCGGAATT GGTCAGAAAT CAGTCAGACA CATTGCAGCT TGTAGAAATG CTAGAGAGAT TTGTCCAGGC AACAGAAGCA GCACCAGAGA AACCTACAAT CAATTCCAAC ATCTTCAACA ACTCGAATTC AAATGCTATT CCTACTGTAA CGGTAAACTC AAACGTTGCC AGAAAGGATG TGCCTGATAG AGTGTCTAAT AACAAAACGC CCATCAATAA CACGGGTCGT CTAGCTGATG AGGAGGGTAA TACTTCTCCT ATGTCTGGTG CCTTGTTCAA ATCCACGTCT TCTAGTTCTA ATTCTGCCAC TTCTGCTTCT GTAACTTCTA CTTCAGCTAC TTCAGTTTCC GGTACTGGTT CTTCTACAAC GTCTGCTGCC AGAACTCTTC TGGGAGAAAT CGGCTCATTA AAGTCCATAC TTCTTCAGAG ACTAAGAGCT TCCACACAAC ACCAACAGCC TCCTCTTCGA GTACAGCAGG CGAGTACATC TTCTTCTTAC CACTTAAGCA ATGCTCCCCA CACCACCAAC AACTCGCGTA ATCCTTCAAA CTCTAATATA CAGATAGTGC CTCAACACTA TCCGTTGAAT CCACACTATA CCATCTACCC ACAGAGTGAG TATAGGGGCC CAGACTTGAA TCAGAACCAG TCCCAGAATC AAAATCAGCA TTCCAAACTG ATAACAGAAG AGTCTGCCTC TGCCAATCGC CATTTGTCCA TCTTGATGGA CCCCTTGCAA CCGATGCCTA CTCGTAATCC TATTCTAGAT GAGCAATCTA CCAACTTTAG ATTGAGAGCC GAGTCGAAAA GCTACTCTCC TCTTACTGTT GGAGGTGCCA GTGGCCAGTT ACCTACCAGT TCTGGGAATG CTAATCAAGC CCAGCAAGCA CATGTCACAC AACTACAGCC CCCAGCGCAT CTTCATTCAC AGCCACCTAC TGCAAATGGA CCTGGAGGAT CACAGCAGAA TTCACAGTCT TCGTCTAGAT CGTCTATCGT GGAGAAGAAG ACAAGCTCTC GTTCTTCCAG CTTGATCAAC ACTCCTGCTG GTCATGATAG TAGCAAGCCT TCGTATCAGC AGTATCCATT TCCACATGTA AATAGTCATT CGACTCATTT GCATTCATAC CATAACGGCT CGACCAGAAC TAATTCTCTT CCAAACCCGC CCATCGACCA TAGCGTGACT ACACCTTTAA CTCCGCCAGA ACCTCCTTCT ACCAACCCTT CCTACTTCAA CCAAAGAAAC TCCTTCACTT CCATGTACGA TTCCCATCAA CAGCAGGGTC ACAATTACAG AGTGCCTCCT TATTCTCAAC AGCCTACGTC CATGCCTAAA CCTACAGGCC CTATCACTTC TTCACCAATA TCTACTACTT CTCCTCTGAG ACTTCAGGAG TCCTCAGAAT TTGCGACCTC CATGTCTAAG AACCAGTTGC CAAGTGTAAG TGAACTAGAC AAGTCGATTA AGGGTGCCTC TTCCAATGTT CCGAAGTCGG CTACTACCTT ACCGCCATTG CATCTGAGCC CTATGTTTGC GTTGTTGAAC AAGGACAATG ACGATGACAA AATACTCAAG AAGCGTAAAA CCTGATTGGG TAACAAATGT TTCCTGTCCC ATTAATGATA GTTTATAGAA TCTGTATGAA TACCAATGTA TTTTTATTCT GCAA
|
Protein sequence | MSAVIPSTKI KLSPEALEAS HESAPANPTS GQTNVANNSA VNSTANLTTA TGSNAGSNSG SNSGSNSGSN SGSVSGPASS ANSSGKTQTV FIHKLYDMLH DETISHLIWW SPSNDSFCLL PGEEFSKVLA QYFKHTNIAS FIRQLNMYGF HKVNDTFQNN EDGSGSNANA NSVNSSGTST NSTNNSNPNK WEFRHSTNQF RKGDIESLKL IKRRSSKNIN SHKEIVNLKS LPPTSNPIMD PNSGYGPAHG HYYGYSDDET SSIASARSPH GSSDNLHQQY HQSLRIHQQS LLINNEGRTT PNGQIQHPPQ QQQQIQQPQA QQLHGLGLSA PQSPAHTPIH QPLSPRLQPP PLVTNPSFEN SINFKFIELN NQLSSLKNEL NVMHQKYDSA HSELVRNQSD TLQLVEMLER FVQATEAAPE KPTINSNIFN NSNSNAIPTV TVNSNVARKD VPDRVSNNKT PINNTGRLAD EEGNTSPMSG ALFKSTSSSS NSATSASVTS TSATSVSGTG SSTTSAARTL SGEIGSLKSI LLQRLRASTQ HQQPPLRVQQ ASTSSSYHLS NAPHTTNNSR NPSNSNIQIV PQHYPLNPHY TIYPQSEYRG PDLNQNQSQN QNQHSKSITE ESASANRHLS ILMDPLQPMP TRNPILDEQS TNFRLRAESK SYSPLTVGGA SGQLPTSSGN ANQAQQAHVT QLQPPAHLHS QPPTANGPGG SQQNSQSSSR SSIVEKKTSS RSSSLINTPA GHDSSKPSYQ QYPFPHVNSH STHLHSYHNG STRTNSLPNP PIDHSVTTPL TPPEPPSTNP SYFNQRNSFT SMYDSHQQQG HNYRVPPYSQ QPTSMPKPTG PITSSPISTT SPSRLQESSE FATSMSKNQL PSVSELDKSI KGASSNVPKS ATTLPPLHSS PMFALLNKDN DDDKILKKRK T
|
| |