Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32563 |
Symbol | PUT3 |
ID | 4839746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 101707 |
End bp | 104892 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391061 |
Product | transcription activator involved in proline utilization potential fungal Zn(2)-Cys(6) binuclear cluster domain |
Protein accession | XP_001385712 |
Protein GI | 150866202 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.165172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACT GCGCTGCGGG CGACGAACAT CGCCCTGCCG TTTCGCTCTC TGCTTCCGTC AAAGATGAGG AAGTAGCTGA CTCCATGATA GCTGGAGGAG TGCTCGAGTT GCACCCAGCA AACAGCCCTG CTCTGGGGCA AACTACAAAC ATAACGTCCA CATTAACAGA CATATCATCT AATTCACCAT CATCGTCATC AAATGCGTCG TCACATGCTT CATCAAATTT ATCGAATATA ACATCGTCTC ACCAATCAAA CAAGCAGAAG CGTACAGCGC TAGCCTGTAT TCGTTGTCGT GCTAGACACA TCCGATGTCC CGGAGGAGAT CCATGTAAAA AGTGTCAGAT TGCAAAGACG AAGTGCGAGT ATGTAGAGGC TGATAAGAAG ATTGTCGTAT CGATGAAGTA CTTGTCGAAG TTGCACGACG ATATAGCCCG TTTGAAGAAG GATAATGCCG TTCTCAGAAA CAACTTGAAG GAAGAGGAAA CAAAACGCAT CCGAGCAAAC CCTGTGCTTC TGGCCCTGAC ACTACAACAG CAAAACAATT TTACCAACAA CATCAAGTAC TCCATGCCAA GTGTACTGCA ATCGGTGCAG CAACCACAAC AGCCTGCTCT GACGACAGGT GTTACGGCTA ATAGCAATTT CTCCTCCACT GAGGTGATCC AGCCATCTTT GGATAAACAC GGCAGACTTA TACAGTCGAG AACGGGGGAA AAGGTCTATG TAGGCTCGTC ATCCATGACG CTTTTTGGAC TCGAAATCCA GAACATGGTA CCTTCGTTTG TGTCTTCAAG TTTGTTGCCT AACAATTCTA CAGATACTTC ACCCACTGCT TCTCCCAATA GTGTAGGTGG CTCAACACCT CAATCGAACC AGTCTGGAGA ACCCGGATCA TTCAAGCGTA ACAAACGAGA GACGGAAATA CTCGAGAAGG AGGGAAACGC CTACCGAATC ACTCTTGCTA AGACCAACAC CAGACCAGGA CTCTCCATTA ACTTTACGTT ACCATCGTAT TCATACGCCA TGCTCTTGGT AGATACGTTT ATCAACTATA ACGATGGGTG TTTTTACTTC TTCAACGAAG GGCTTGTCAA GAAGTTCCTT ATGAATTTAT ATTCCGGAAA GGCAGCTGAG AACAAGAGAA TACTCAAGAG AAACATCACC GAAGCTAAAG GTGGACCTGA CGAAGATGAA AATGCGATAA AAAAAGACAC AGATGATGAT ACAATTCTTG AAACCATATG GTTCTGTAAG ATTCTACTTA TATTTGCTAT TGGCGAAATG TATCTTGGAA CTGAATCAAA CTCACACATC ATAAAGCTGA AGGAAAAGTT GGAATCCAAG AAGGCGAGAA ATAGAACTAA AGAAAGAAAA GAGAAAGACA CGCTACCAGG ATCTGGATTC TTCTACGAGG CTTCTGAGTT GTTTACCGGC TTGTTTGCCT CAGGTGCCAT AGATAATATT ACTAAAGATG GTGGTATTGA AGTAATGCTT CTTTATGCTT TCTACTTACA AGTGGCTGAT TGTACCATTG CTTCGTATTT CTATTTTGGA TTGGCTCTCA GGTCGACTTT GATCTTAGGT TGGCATGTGG ATGCAGACAA AGAGAACTTG AATAGGTTCG AGCTAGAGCA CCGAAGAAGA ATATGGTGGA CGGTGTATAT GTACGAGAGA ATGTTGTCTT CTAAAGCTGG TTTGCCTTTA AGTTTTGCAG ACGACAGTGT TTCCACTGAA TTGCCGGTTG ATTTCAACAT TGATCTCACC GATTTCAGAA AGGATGAAAA TGATGTCAGA GGATACTATA TCTTCCCACC GGCAGACTAC ATAAACAACT GCGTTACAAT CACACAAATC AATGCTATCA TCTTATCTTC TTTATACACC AAGCAGCCCA CTGTCAATAT TCTACCAGTT GTTTCAGATT TGGTGCATAA GTTGATGACA TGGAAGAACT TGTTGCCAGA CTTCCTCAAG ATAGATTTTT CAGAAGAAAA CTTGCGCATC ACAAGACTTA TTGTCAATTT GATGACGGAA TACTTCCAAG GTTTGAACTT GGCTGTTCGT CCATTACTTT TCCACTTTGC TACCAAGAAA CTAAAGGAAC TCCAGGCCAA AAATACAGTC AACAAATATG TTGACTTGTC AAAATATTCA AAGAATGTAT TATTCTTATT GAATGCCTCG TTCCAAGCGT CCATCAATAC CATAAAATCA ATATGGGCCC TTCTTCCAGA AAACATGGTA GCCCTTTTTG GATGGATGGA TAGAGAGTAT TTATTCACGT CGGCTTCGAC ATTAATCTTA TTCAATGCCT CGTTTGGCGT ACATGAAGCC ACGAAAGAAC ACTTGGATCA TGCTTTAATA ATCTTCACCA AGATGAAGAA ACTTGGAAAC TATCCAGCTG CACTCAGAAG AGCTCAATTA TTGAAGCTTA TCAAAGTTCT TGACTTTAAT GGAGTCATGA AAGATCTTTT GTTGAAGCAC GATGACGATT TAAAAGAAAT CAATATTTCC AATACAAATT TGTCATCGGA GGAAATTCAA AATCACATCG TCGAGGTTAA CCAAATTTCA AACTCCAAGA TTAATAGCGA TCACTTAAAT GTTCTCGACA CGGAGCTAAG TGAAAGTATT GCTGTAGCTG CAGTTGCGCC CGATAAACAG GCTCCTCCAT CTGAGCCATA TCTAGAATTT CCAGACAGAC AAACACCAAT TCACCCATAT ATTCATACCA CTTCATACCC TCTAGGAACA ATGAGTGGTG ATACTTTTTC TTACACCATC CCAACTCCAA TGAATGGAAA CAATACTGGT GGTGATGTAT ACACCAATTC AGACTTGGCA GGTATCGAAG GTTTGACGTA TTTGGATGAA GAACAGAAGT TGTGGAATGA AATCACCAAT GATGCCGGTT GGTTGAATGT TGCCGGAGGT AATCCAAACC AACAACATGG TCTGAGTGGT GACCTCTTTC TAAGAAATCA CGCCGTTGAA TCGCATTCGG CCACAGAAAG TTCCACTCCT CATAATACTG CCGGGCACAT TCCTACCAAC TATGGACCAG GCTCTGACAG CCGTGGAGAT ATCTACGGTC ATCCCAGCTT TGGTACGAGC ATGGCATCTG GAGGCTACAG TGACATCATC AACCTGGAGT TCCATGACAT AATGGACCAA TCCTAA
|
Protein sequence | MTDCAAGDEH RPAVSLSASV KDEEVADSMI AGGVLELHPA NSPASGQTTN ITSTLTDISS NSPSSSSNAS SHASSNLSNI TSSHQSNKQK RTALACIRCR ARHIRCPGGD PCKKCQIAKT KCEYVEADKK IVVSMKYLSK LHDDIARLKK DNAVLRNNLK EEETKRIRAN PVLSASTLQQ QNNFTNNIKY SMPSVSQSVQ QPQQPASTTG VTANSNFSST EVIQPSLDKH GRLIQSRTGE KVYVGSSSMT LFGLEIQNMV PSFVSSSLLP NNSTDTSPTA SPNSVGGSTP QSNQSGEPGS FKRNKRETEI LEKEGNAYRI TLAKTNTRPG LSINFTLPSY SYAMLLVDTF INYNDGCFYF FNEGLVKKFL MNLYSGKAAE NKRILKRNIT EAKGGPDEDE NAIKKDTDDD TILETIWFCK ILLIFAIGEM YLGTESNSHI IKSKEKLESK KARNRTKERK EKDTLPGSGF FYEASELFTG LFASGAIDNI TKDGGIEVML LYAFYLQVAD CTIASYFYFG LALRSTLILG WHVDADKENL NRFELEHRRR IWWTVYMYER MLSSKAGLPL SFADDSVSTE LPVDFNIDLT DFRKDENDVR GYYIFPPADY INNCVTITQI NAIILSSLYT KQPTVNILPV VSDLVHKLMT WKNLLPDFLK IDFSEENLRI TRLIVNLMTE YFQGLNLAVR PLLFHFATKK LKELQAKNTV NKYVDLSKYS KNVLFLLNAS FQASINTIKS IWALLPENMV ALFGWMDREY LFTSASTLIL FNASFGVHEA TKEHLDHALI IFTKMKKLGN YPAALRRAQL LKLIKVLDFN GVMKDLLLKH DDDLKEINIS NTNLSSEEIQ NHIVEVNQIS NSKINSDHLN VLDTELSESI AVAAVAPDKQ APPSEPYLEF PDRQTPIHPY IHTTSYPLGT MSGDTFSYTI PTPMNGNNTG GDVYTNSDLA GIEGLTYLDE EQKLWNEITN DAGWLNVAGG NPNQQHGSSG DLFLRNHAVE SHSATESSTP HNTAGHIPTN YGPGSDSRGD IYGHPSFGTS MASGGYSDII NSEFHDIMDQ S
|
| |