Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66881 |
Symbol | HSR1 |
ID | 4837260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 839943 |
End bp | 843073 |
Gene Length | 3131 bp |
Protein Length | 666 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640388575 |
Product | heat-shock related protein |
Protein accession | XP_001382397 |
Protein GI | 150863800 |
COG category | [K] Transcription |
COG ID | [COG5169] Heat shock transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.531798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0646148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CACCTTCAGG CCTCGAAAAG ATAGACGAAG TAGGACTGTT GATTCTGCTG CCACCCCGTG AGTTTGACTG ACAGGCATCC CTCCGCCATT GCTGCCGCCG TCTGTTGGAA GATCCGCTAC TACCAAAACT TAGCGAATAG CCATCGACAA AAAATAGAAC TCTGACAGTT TCACATCGGA AAAGAGAGCC GTTTCTGCCT GATTGACAGA AAAATCTGAA GACATAGCTT TGCTATCGAG ACAAACATCT ACTAATAAGG AGCTCCTCGC ACAAAGGGAA TCATCTGCAA TTGTTGCTTT GCATCCCCTA GTTTGTCTGA TCAAAGTTTT ACAGCGCTTC CCCATATTTT TCAAGCGCTA ATTTTTTATC CATCATCATA GGCAGCGCCA TACTTGCCTC AGAACCGGTC TGAAGTTTCT AGCTAGTTTG TAGATACCGT CCATATAGTT TGGACCATCT AGCTTTAACC CTGCCGTTCT TTGTTCTACG TTAAATTTCC ATCGTTTAGC CGTCGTGAAT AATAGAATTA GCTATTTAGC CAATTTGGCA TTTTGAACAA TTTGAGAAAT TTGTCAATTC TGCACCACAC CCAAAAATTT GCTCAATATT GTTGATTAAA GCTGATTTAA GCTGAATACG TTACACTTTC ACAGCCATGA GTAAAAAGTC ACTGGATCCA CCAGTCAAAA AGAATGCCTT TGTGCACAAG CTCTATCTGA TGCTTAACGA TCCTAAGCTC ACCCACTTGA TCTGGTGGTC TGAGAACAAC GCCAATCTGT TTGCGCTTTA TCCGGGCAAG GAGTTTGCAA ACTCGCTTAC CCGTTACTTC AAACACGGGA ACGTGGCGTC GTTTGTTCGT CAGCTCCACA TGTATGGATT CCATAAAGTA TCTGATCCTA CCCAATCCTC CACCTCTATA GCTTCATCTG GTGAAAACGC TGACGCAGAT TCAGACCAGC CTCCTCCCGT GTGGGAGTTC AAGCATCTGA GCGGGAAGTT TAAAAAGGGT GAAGAAGCAC TGCTTATTTA CATCAAAAGA AGACCTTCTT CGAATAGCTC ACGAAATAGC AACTTTAACG GAGTTCCCAA CCATCCCCAC CCTCAGCACA TGCACCCGCA CCATCTTCAT CATCAGATGG TTCATCCTCA TCTTCATACT GGTCATATCC ATCCTGGAAT TCCCCACGAG TCTTACTCCA TTCAAGGTCA AATACAAGGT CCCTACGATT CTCCTTATGC TACTCTACCT GCTGGTACTC CCATGGTTCA TGGAAGTCCC ATGGTAGCCT ACTCTGGACA GTTCTATCCT CGTCCAGGTG GGCTTCCTCT TCCTCAGCCC AAACTGCTCC AAGAACAGAA CCAACCACTT TCGGCGCCAC CGCAGCCAGT TTTCATGTCG CAGTCGTATC CCCATCCTGT TCACTATGGA TATAGCTATG CTGAACGCCA AAACAATCCT CTGTTTAGAC ACTCCGAAAG TTCTGCCACG TACTCTGTAG CGCATGATGA GTCACATTCG TCTCCTTCTG TGGTCTCTAG GCACCAGCTG GACCCCAACG TCCGTCCTGC TCTCGCCAAA ATTGGTTCAG GCTCGATTCC CGTCCAGGCT CAGCATTATG CTCCCAACTT GCAGTTCAGA AAGATCTGGG AAACCAACCC TACCAACCAA CCAACCACTG CTATAACCGC CCAAAACGGC ACATCTTCTC CTTCAGCATC TACTCCAGTC TTGGACAATA GACCTCGTAA CCCTTCGCTT CTTTACGATC CCTTAGTCCC TGTATCGCCT CCTACTGACC ATCACAGATC GCCATCATAC TTACCGCTTT CTCGAAACAA CTCCAACACC GCGAGTTTCC TGAGGGACTC TGTGGACTCT AGACTGTCGA TCAAGCTTCC ACCGCCCTCT TCGTTGCATT CGGTTTCGGG TTCTGTGTCA AATTCTGTGT CGGTATCAAA ATACCCACAG TCACCAGAAC CACCAAGACA ACAGCTGCCA GCACCAGCAC CTCTACCATT TCACCAGCAT CTTCCTCGTA GCATACCGTC CAACTTCGAA GCCTCTCCTC AGCAGGTGTC AGTTCCCCAG CAGCATTCAT CGCTTCCCGG CTCTCCTGTT GTTGCCAACG GAGTCAAGAA GCCTTCGCTT ATACCTGTGA GCAATGGAAT CCATGAACGC TTGAGACCTT CTCTTCTTGA GCTTCATTTT GGTTCTTCGA ACGGCTCAAA CTCAGCCAAG GGATCATCTT CTTCTGCACC AAGACACCCT CAGGATTCAA TAGGATCGTC AACGTCGTCG CACAATTCGG TATTCCTGAC GCAGCTGCTG TTACTGTCTG TATCTTCTTC TGTTGTACAG CGTGCTTCGT CGTTCGGCTC GATTTCTCAC AACCCGCTCA TTCACAAGAA CTCTTTCTCC ATCAGTCCTC ACGACACTCC ACCCATCTCT AGCTCAGGCG CAGTTGAAGC TTCTGTCTCT AGCTACAATG CGACAGCCCT GAGCGACCAC CGCTCTTCTA CAAGTCTGCC ATTGTCGAAG TCTGTAGAAG ATGTTAACAA AAAGGTCTCT GTAAGGTCAC TTTTGGGTGA TTCTTCAGCA ACCTCGGTAG CCAGCGAGTC TGAAGACTCC GAGAGCAAAA GACGCCGTTT GAGATGATTG CCCAATGTTT GAGGTGATCG CTCTCTGTTT AACTAAAACC CTCAACCCTC AAAAAGTCCC TGCAATACGT CACCTCCAAA ATGCACTTTC CGTTCACTTC TACTGAATAT TCTTGTAACT CAACTTCATA TCCAAGGCTC AAGTGAGTGA CTTACTTCAG GTTTATTTCT TCTCTTGATG CCCTTCGGGT GCATCCTGTT TTATAACGAG GCACTCTGCT GTTGCCGAAG AAAGGTTGGA ATAGCCCACC CTGCCGTTTC TCTCCTCTGT ATTATTAGAC AACATTCACG GCTATCTTCT CTTTTTGTCA CATAATAGTT GAATATTCCT GCTGGTAAAT CTTGGTTGAT ACTCTATTAG TACATCAAAA ACATAAGTGC TACGGTAATT GATTGACTAT GATTTCTGCG AAACCTCTAG TTAGTCTACA TATCCCGGTT AATGAAAACA TTAATACACT ACAACTGAAT G
|
Protein sequence | MSKKSSDPPV KKNAFVHKLY SMLNDPKLTH LIWWSENNAN SFALYPGKEF ANSLTRYFKH GNVASFVRQL HMYGFHKVSD PTQSSTSIAS SGENADADSD QPPPVWEFKH SSGKFKKGEE ASLIYIKRRP SSNSSRNSNF NGVPNHPHPQ HMHPHHLHHQ MVHPHLHTGH IHPGIPHESY SIQGQIQGPY DSPYATLPAG TPMVHGSPMV AYSGQFYPRP GGLPLPQPKS LQEQNQPLSA PPQPVFMSQS YPHPVHYGYS YAERQNNPSF RHSESSATYS VAHDESHSSP SVVSRHQSDP NVRPALAKIG SGSIPVQAQH YAPNLQFRKI WETNPTNQPT TAITAQNGTS SPSASTPVLD NRPRNPSLLY DPLVPVSPPT DHHRSPSYLP LSRNNSNTAS FSRDSVDSRS SIKLPPPSSL HSVSGSVSNS VSVSKYPQSP EPPRQQSPAP APLPFHQHLP RSIPSNFEAS PQQVSVPQQH SSLPGSPVVA NGVKKPSLIP VSNGIHERLR PSLLELHFGS SNGSNSAKGS SSSAPRHPQD SIGSSTSSHN SVFSTQSSLS SVSSSVVQRA SSFGSISHNP LIHKNSFSIS PHDTPPISSS GAVEASVSSY NATASSDHRS STSSPLSKSV EDVNKKVSVR SLLGDSSATS VASESEDSES KRRRLR
|
| |