Gene PICST_66881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66881 
SymbolHSR1 
ID4837260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp839943 
End bp843073 
Gene Length3131 bp 
Protein Length666 aa 
Translation table12 
GC content47% 
IMG OID640388575 
Productheat-shock related protein 
Protein accessionXP_001382397 
Protein GI150863800 
COG category[K] Transcription 
COG ID[COG5169] Heat shock transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.531798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0646148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CACCTTCAGG CCTCGAAAAG ATAGACGAAG TAGGACTGTT GATTCTGCTG CCACCCCGTG 
AGTTTGACTG ACAGGCATCC CTCCGCCATT GCTGCCGCCG TCTGTTGGAA GATCCGCTAC
TACCAAAACT TAGCGAATAG CCATCGACAA AAAATAGAAC TCTGACAGTT TCACATCGGA
AAAGAGAGCC GTTTCTGCCT GATTGACAGA AAAATCTGAA GACATAGCTT TGCTATCGAG
ACAAACATCT ACTAATAAGG AGCTCCTCGC ACAAAGGGAA TCATCTGCAA TTGTTGCTTT
GCATCCCCTA GTTTGTCTGA TCAAAGTTTT ACAGCGCTTC CCCATATTTT TCAAGCGCTA
ATTTTTTATC CATCATCATA GGCAGCGCCA TACTTGCCTC AGAACCGGTC TGAAGTTTCT
AGCTAGTTTG TAGATACCGT CCATATAGTT TGGACCATCT AGCTTTAACC CTGCCGTTCT
TTGTTCTACG TTAAATTTCC ATCGTTTAGC CGTCGTGAAT AATAGAATTA GCTATTTAGC
CAATTTGGCA TTTTGAACAA TTTGAGAAAT TTGTCAATTC TGCACCACAC CCAAAAATTT
GCTCAATATT GTTGATTAAA GCTGATTTAA GCTGAATACG TTACACTTTC ACAGCCATGA
GTAAAAAGTC ACTGGATCCA CCAGTCAAAA AGAATGCCTT TGTGCACAAG CTCTATCTGA
TGCTTAACGA TCCTAAGCTC ACCCACTTGA TCTGGTGGTC TGAGAACAAC GCCAATCTGT
TTGCGCTTTA TCCGGGCAAG GAGTTTGCAA ACTCGCTTAC CCGTTACTTC AAACACGGGA
ACGTGGCGTC GTTTGTTCGT CAGCTCCACA TGTATGGATT CCATAAAGTA TCTGATCCTA
CCCAATCCTC CACCTCTATA GCTTCATCTG GTGAAAACGC TGACGCAGAT TCAGACCAGC
CTCCTCCCGT GTGGGAGTTC AAGCATCTGA GCGGGAAGTT TAAAAAGGGT GAAGAAGCAC
TGCTTATTTA CATCAAAAGA AGACCTTCTT CGAATAGCTC ACGAAATAGC AACTTTAACG
GAGTTCCCAA CCATCCCCAC CCTCAGCACA TGCACCCGCA CCATCTTCAT CATCAGATGG
TTCATCCTCA TCTTCATACT GGTCATATCC ATCCTGGAAT TCCCCACGAG TCTTACTCCA
TTCAAGGTCA AATACAAGGT CCCTACGATT CTCCTTATGC TACTCTACCT GCTGGTACTC
CCATGGTTCA TGGAAGTCCC ATGGTAGCCT ACTCTGGACA GTTCTATCCT CGTCCAGGTG
GGCTTCCTCT TCCTCAGCCC AAACTGCTCC AAGAACAGAA CCAACCACTT TCGGCGCCAC
CGCAGCCAGT TTTCATGTCG CAGTCGTATC CCCATCCTGT TCACTATGGA TATAGCTATG
CTGAACGCCA AAACAATCCT CTGTTTAGAC ACTCCGAAAG TTCTGCCACG TACTCTGTAG
CGCATGATGA GTCACATTCG TCTCCTTCTG TGGTCTCTAG GCACCAGCTG GACCCCAACG
TCCGTCCTGC TCTCGCCAAA ATTGGTTCAG GCTCGATTCC CGTCCAGGCT CAGCATTATG
CTCCCAACTT GCAGTTCAGA AAGATCTGGG AAACCAACCC TACCAACCAA CCAACCACTG
CTATAACCGC CCAAAACGGC ACATCTTCTC CTTCAGCATC TACTCCAGTC TTGGACAATA
GACCTCGTAA CCCTTCGCTT CTTTACGATC CCTTAGTCCC TGTATCGCCT CCTACTGACC
ATCACAGATC GCCATCATAC TTACCGCTTT CTCGAAACAA CTCCAACACC GCGAGTTTCC
TGAGGGACTC TGTGGACTCT AGACTGTCGA TCAAGCTTCC ACCGCCCTCT TCGTTGCATT
CGGTTTCGGG TTCTGTGTCA AATTCTGTGT CGGTATCAAA ATACCCACAG TCACCAGAAC
CACCAAGACA ACAGCTGCCA GCACCAGCAC CTCTACCATT TCACCAGCAT CTTCCTCGTA
GCATACCGTC CAACTTCGAA GCCTCTCCTC AGCAGGTGTC AGTTCCCCAG CAGCATTCAT
CGCTTCCCGG CTCTCCTGTT GTTGCCAACG GAGTCAAGAA GCCTTCGCTT ATACCTGTGA
GCAATGGAAT CCATGAACGC TTGAGACCTT CTCTTCTTGA GCTTCATTTT GGTTCTTCGA
ACGGCTCAAA CTCAGCCAAG GGATCATCTT CTTCTGCACC AAGACACCCT CAGGATTCAA
TAGGATCGTC AACGTCGTCG CACAATTCGG TATTCCTGAC GCAGCTGCTG TTACTGTCTG
TATCTTCTTC TGTTGTACAG CGTGCTTCGT CGTTCGGCTC GATTTCTCAC AACCCGCTCA
TTCACAAGAA CTCTTTCTCC ATCAGTCCTC ACGACACTCC ACCCATCTCT AGCTCAGGCG
CAGTTGAAGC TTCTGTCTCT AGCTACAATG CGACAGCCCT GAGCGACCAC CGCTCTTCTA
CAAGTCTGCC ATTGTCGAAG TCTGTAGAAG ATGTTAACAA AAAGGTCTCT GTAAGGTCAC
TTTTGGGTGA TTCTTCAGCA ACCTCGGTAG CCAGCGAGTC TGAAGACTCC GAGAGCAAAA
GACGCCGTTT GAGATGATTG CCCAATGTTT GAGGTGATCG CTCTCTGTTT AACTAAAACC
CTCAACCCTC AAAAAGTCCC TGCAATACGT CACCTCCAAA ATGCACTTTC CGTTCACTTC
TACTGAATAT TCTTGTAACT CAACTTCATA TCCAAGGCTC AAGTGAGTGA CTTACTTCAG
GTTTATTTCT TCTCTTGATG CCCTTCGGGT GCATCCTGTT TTATAACGAG GCACTCTGCT
GTTGCCGAAG AAAGGTTGGA ATAGCCCACC CTGCCGTTTC TCTCCTCTGT ATTATTAGAC
AACATTCACG GCTATCTTCT CTTTTTGTCA CATAATAGTT GAATATTCCT GCTGGTAAAT
CTTGGTTGAT ACTCTATTAG TACATCAAAA ACATAAGTGC TACGGTAATT GATTGACTAT
GATTTCTGCG AAACCTCTAG TTAGTCTACA TATCCCGGTT AATGAAAACA TTAATACACT
ACAACTGAAT G
 
Protein sequence
MSKKSSDPPV KKNAFVHKLY SMLNDPKLTH LIWWSENNAN SFALYPGKEF ANSLTRYFKH 
GNVASFVRQL HMYGFHKVSD PTQSSTSIAS SGENADADSD QPPPVWEFKH SSGKFKKGEE
ASLIYIKRRP SSNSSRNSNF NGVPNHPHPQ HMHPHHLHHQ MVHPHLHTGH IHPGIPHESY
SIQGQIQGPY DSPYATLPAG TPMVHGSPMV AYSGQFYPRP GGLPLPQPKS LQEQNQPLSA
PPQPVFMSQS YPHPVHYGYS YAERQNNPSF RHSESSATYS VAHDESHSSP SVVSRHQSDP
NVRPALAKIG SGSIPVQAQH YAPNLQFRKI WETNPTNQPT TAITAQNGTS SPSASTPVLD
NRPRNPSLLY DPLVPVSPPT DHHRSPSYLP LSRNNSNTAS FSRDSVDSRS SIKLPPPSSL
HSVSGSVSNS VSVSKYPQSP EPPRQQSPAP APLPFHQHLP RSIPSNFEAS PQQVSVPQQH
SSLPGSPVVA NGVKKPSLIP VSNGIHERLR PSLLELHFGS SNGSNSAKGS SSSAPRHPQD
SIGSSTSSHN SVFSTQSSLS SVSSSVVQRA SSFGSISHNP LIHKNSFSIS PHDTPPISSS
GAVEASVSSY NATASSDHRS STSSPLSKSV EDVNKKVSVR SLLGDSSATS VASESEDSES
KRRRLR