Gene PICST_32563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32563 
SymbolPUT3 
ID4839746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp101707 
End bp104892 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table12 
GC content42% 
IMG OID640391061 
Producttranscription activator involved in proline utilization potential fungal Zn(2)-Cys(6) binuclear cluster domain 
Protein accessionXP_001385712 
Protein GI150866202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.165172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACT GCGCTGCGGG CGACGAACAT CGCCCTGCCG TTTCGCTCTC TGCTTCCGTC 
AAAGATGAGG AAGTAGCTGA CTCCATGATA GCTGGAGGAG TGCTCGAGTT GCACCCAGCA
AACAGCCCTG CTCTGGGGCA AACTACAAAC ATAACGTCCA CATTAACAGA CATATCATCT
AATTCACCAT CATCGTCATC AAATGCGTCG TCACATGCTT CATCAAATTT ATCGAATATA
ACATCGTCTC ACCAATCAAA CAAGCAGAAG CGTACAGCGC TAGCCTGTAT TCGTTGTCGT
GCTAGACACA TCCGATGTCC CGGAGGAGAT CCATGTAAAA AGTGTCAGAT TGCAAAGACG
AAGTGCGAGT ATGTAGAGGC TGATAAGAAG ATTGTCGTAT CGATGAAGTA CTTGTCGAAG
TTGCACGACG ATATAGCCCG TTTGAAGAAG GATAATGCCG TTCTCAGAAA CAACTTGAAG
GAAGAGGAAA CAAAACGCAT CCGAGCAAAC CCTGTGCTTC TGGCCCTGAC ACTACAACAG
CAAAACAATT TTACCAACAA CATCAAGTAC TCCATGCCAA GTGTACTGCA ATCGGTGCAG
CAACCACAAC AGCCTGCTCT GACGACAGGT GTTACGGCTA ATAGCAATTT CTCCTCCACT
GAGGTGATCC AGCCATCTTT GGATAAACAC GGCAGACTTA TACAGTCGAG AACGGGGGAA
AAGGTCTATG TAGGCTCGTC ATCCATGACG CTTTTTGGAC TCGAAATCCA GAACATGGTA
CCTTCGTTTG TGTCTTCAAG TTTGTTGCCT AACAATTCTA CAGATACTTC ACCCACTGCT
TCTCCCAATA GTGTAGGTGG CTCAACACCT CAATCGAACC AGTCTGGAGA ACCCGGATCA
TTCAAGCGTA ACAAACGAGA GACGGAAATA CTCGAGAAGG AGGGAAACGC CTACCGAATC
ACTCTTGCTA AGACCAACAC CAGACCAGGA CTCTCCATTA ACTTTACGTT ACCATCGTAT
TCATACGCCA TGCTCTTGGT AGATACGTTT ATCAACTATA ACGATGGGTG TTTTTACTTC
TTCAACGAAG GGCTTGTCAA GAAGTTCCTT ATGAATTTAT ATTCCGGAAA GGCAGCTGAG
AACAAGAGAA TACTCAAGAG AAACATCACC GAAGCTAAAG GTGGACCTGA CGAAGATGAA
AATGCGATAA AAAAAGACAC AGATGATGAT ACAATTCTTG AAACCATATG GTTCTGTAAG
ATTCTACTTA TATTTGCTAT TGGCGAAATG TATCTTGGAA CTGAATCAAA CTCACACATC
ATAAAGCTGA AGGAAAAGTT GGAATCCAAG AAGGCGAGAA ATAGAACTAA AGAAAGAAAA
GAGAAAGACA CGCTACCAGG ATCTGGATTC TTCTACGAGG CTTCTGAGTT GTTTACCGGC
TTGTTTGCCT CAGGTGCCAT AGATAATATT ACTAAAGATG GTGGTATTGA AGTAATGCTT
CTTTATGCTT TCTACTTACA AGTGGCTGAT TGTACCATTG CTTCGTATTT CTATTTTGGA
TTGGCTCTCA GGTCGACTTT GATCTTAGGT TGGCATGTGG ATGCAGACAA AGAGAACTTG
AATAGGTTCG AGCTAGAGCA CCGAAGAAGA ATATGGTGGA CGGTGTATAT GTACGAGAGA
ATGTTGTCTT CTAAAGCTGG TTTGCCTTTA AGTTTTGCAG ACGACAGTGT TTCCACTGAA
TTGCCGGTTG ATTTCAACAT TGATCTCACC GATTTCAGAA AGGATGAAAA TGATGTCAGA
GGATACTATA TCTTCCCACC GGCAGACTAC ATAAACAACT GCGTTACAAT CACACAAATC
AATGCTATCA TCTTATCTTC TTTATACACC AAGCAGCCCA CTGTCAATAT TCTACCAGTT
GTTTCAGATT TGGTGCATAA GTTGATGACA TGGAAGAACT TGTTGCCAGA CTTCCTCAAG
ATAGATTTTT CAGAAGAAAA CTTGCGCATC ACAAGACTTA TTGTCAATTT GATGACGGAA
TACTTCCAAG GTTTGAACTT GGCTGTTCGT CCATTACTTT TCCACTTTGC TACCAAGAAA
CTAAAGGAAC TCCAGGCCAA AAATACAGTC AACAAATATG TTGACTTGTC AAAATATTCA
AAGAATGTAT TATTCTTATT GAATGCCTCG TTCCAAGCGT CCATCAATAC CATAAAATCA
ATATGGGCCC TTCTTCCAGA AAACATGGTA GCCCTTTTTG GATGGATGGA TAGAGAGTAT
TTATTCACGT CGGCTTCGAC ATTAATCTTA TTCAATGCCT CGTTTGGCGT ACATGAAGCC
ACGAAAGAAC ACTTGGATCA TGCTTTAATA ATCTTCACCA AGATGAAGAA ACTTGGAAAC
TATCCAGCTG CACTCAGAAG AGCTCAATTA TTGAAGCTTA TCAAAGTTCT TGACTTTAAT
GGAGTCATGA AAGATCTTTT GTTGAAGCAC GATGACGATT TAAAAGAAAT CAATATTTCC
AATACAAATT TGTCATCGGA GGAAATTCAA AATCACATCG TCGAGGTTAA CCAAATTTCA
AACTCCAAGA TTAATAGCGA TCACTTAAAT GTTCTCGACA CGGAGCTAAG TGAAAGTATT
GCTGTAGCTG CAGTTGCGCC CGATAAACAG GCTCCTCCAT CTGAGCCATA TCTAGAATTT
CCAGACAGAC AAACACCAAT TCACCCATAT ATTCATACCA CTTCATACCC TCTAGGAACA
ATGAGTGGTG ATACTTTTTC TTACACCATC CCAACTCCAA TGAATGGAAA CAATACTGGT
GGTGATGTAT ACACCAATTC AGACTTGGCA GGTATCGAAG GTTTGACGTA TTTGGATGAA
GAACAGAAGT TGTGGAATGA AATCACCAAT GATGCCGGTT GGTTGAATGT TGCCGGAGGT
AATCCAAACC AACAACATGG TCTGAGTGGT GACCTCTTTC TAAGAAATCA CGCCGTTGAA
TCGCATTCGG CCACAGAAAG TTCCACTCCT CATAATACTG CCGGGCACAT TCCTACCAAC
TATGGACCAG GCTCTGACAG CCGTGGAGAT ATCTACGGTC ATCCCAGCTT TGGTACGAGC
ATGGCATCTG GAGGCTACAG TGACATCATC AACCTGGAGT TCCATGACAT AATGGACCAA
TCCTAA
 
Protein sequence
MTDCAAGDEH RPAVSLSASV KDEEVADSMI AGGVLELHPA NSPASGQTTN ITSTLTDISS 
NSPSSSSNAS SHASSNLSNI TSSHQSNKQK RTALACIRCR ARHIRCPGGD PCKKCQIAKT
KCEYVEADKK IVVSMKYLSK LHDDIARLKK DNAVLRNNLK EEETKRIRAN PVLSASTLQQ
QNNFTNNIKY SMPSVSQSVQ QPQQPASTTG VTANSNFSST EVIQPSLDKH GRLIQSRTGE
KVYVGSSSMT LFGLEIQNMV PSFVSSSLLP NNSTDTSPTA SPNSVGGSTP QSNQSGEPGS
FKRNKRETEI LEKEGNAYRI TLAKTNTRPG LSINFTLPSY SYAMLLVDTF INYNDGCFYF
FNEGLVKKFL MNLYSGKAAE NKRILKRNIT EAKGGPDEDE NAIKKDTDDD TILETIWFCK
ILLIFAIGEM YLGTESNSHI IKSKEKLESK KARNRTKERK EKDTLPGSGF FYEASELFTG
LFASGAIDNI TKDGGIEVML LYAFYLQVAD CTIASYFYFG LALRSTLILG WHVDADKENL
NRFELEHRRR IWWTVYMYER MLSSKAGLPL SFADDSVSTE LPVDFNIDLT DFRKDENDVR
GYYIFPPADY INNCVTITQI NAIILSSLYT KQPTVNILPV VSDLVHKLMT WKNLLPDFLK
IDFSEENLRI TRLIVNLMTE YFQGLNLAVR PLLFHFATKK LKELQAKNTV NKYVDLSKYS
KNVLFLLNAS FQASINTIKS IWALLPENMV ALFGWMDREY LFTSASTLIL FNASFGVHEA
TKEHLDHALI IFTKMKKLGN YPAALRRAQL LKLIKVLDFN GVMKDLLLKH DDDLKEINIS
NTNLSSEEIQ NHIVEVNQIS NSKINSDHLN VLDTELSESI AVAAVAPDKQ APPSEPYLEF
PDRQTPIHPY IHTTSYPLGT MSGDTFSYTI PTPMNGNNTG GDVYTNSDLA GIEGLTYLDE
EQKLWNEITN DAGWLNVAGG NPNQQHGSSG DLFLRNHAVE SHSATESSTP HNTAGHIPTN
YGPGSDSRGD IYGHPSFGTS MASGGYSDII NSEFHDIMDQ S