Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28757 |
Symbol | HSP70.4 |
ID | 4851507 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2011937 |
End bp | 2015079 |
Gene Length | 3143 bp |
Protein Length | 946 aa |
Translation table | |
GC content | 37% |
IMG OID | 640393215 |
Product | heat shock protein 70 |
Protein accession | XP_001387620 |
Protein GI | 126274689 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0849325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.596472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAC CAGTAATAGG AATTGATCTC GGGACAACCA ACTCGTGCGT CGCAGTCTTT AATAACAAAG TGGAAGTCAT TGCCAACGTA TTAGGTAGCA GAATAACACC TTCCTGTGTT TCTTTTGATG ATAATGAAAC TATTATTGGA GAGGGGGCAA AAAATCAATT AGGAAAGAAT CCTGAGAATA CTGTCTATGG AACAAAGAGG TTAATTGGAA GAGACTTTGA TGATCCCGAA GTTCAACATG ATATAACTCA TTTCCTATTC AAGGTTGTGA ATAGAAATGG AAAGCCATTC ATTCAAGTAC AATACAAGAA AGAGATCAGA ACTCTTCCCC CTGAAGAAAT TTCTGCAATG GTATTAGAAA GTGTGAAGTG CACAGCAGAA GAATATCTAG GTGTAAAAGT CGAGGATGTG GTTATTACAG TTCCAGCATA CTTCAACGAT TCCCAAAGAA AAGCAACCAA AGCAGCTGGT GAAATTGCTG GTCTCAATGT CCTAGGGATC ATCAATGAAC CAACTGCTGC AGCACTTGCC TATGGGCAGT CCAACAACAA AGATTGCAAA GAAAGAAACT TATTGGTATA TGACCTTGGA GGTGGAACCT TTGATGTTTC TTTGGTCACC CATTGCAAAG ACGTCTACGA AGTGAGAGCA AGTGATGGCG ATAGCCATCT AGGTGGAGAA GATTTCGACA ACATTCTTGT TGACTATTTT GCTAGTGAAT TCATAGAAAG TTACCCTTGT AACCTCAAAT CCGACAAAAC GTCAATGGCT AAACTCAGAA AAGAATGCGA GTCTGCGAAG AGAAGATTAT GTGCTTCTCC TTCAACTGAC ATAGAAATTT CGTCCCTCTA TGATGGAAAG GCATTTAAAA GCAAACTCAG CAGGGCAAAA TTTGATGAAT TGTGTGGAGA TCTCATAATG AAAACAATGA ATACTGTTAA AGCAGTTATC GAGGCAGGTG GAATTATTAA AAGTGATGTG GACGAGGTTC TTCTTGTGGG AGGGTCTACA AGAATTCCAA TGGTTCAGAA AGAGGTTGCA AAGTTTTTTG AAGGCACTAA GATTAGCAAA AAAGCGAATG CGGACGAAGT TATAGCAGAA GGAGCTGCAA TTCAGGCTCA TATACTATCG ACAGAGCCTA GAGTAGTGCT TCTAGATGTT GCTCCTAAAT CCTTAGGATT AAAGGCAATT GGTGATCGAA TGGTGAAAAT GATTCCTAAG AACCTGGCTA TACCTTCTAC TAATTCCAAA GATTTCACAA CAGTTTCAGA TTATCAAACT AATTTGGTAG TCAAAGTATT TGAAGGAGAG AATGAAGTAT GTTCAAAAAA CCGTCTACTT GGGGAGTTTG AACTAAGTGG AATCCAAAGG GCCACAAAGG GAGAAGCTAG AATAATCGTG GTGTTCAATG TCGACGACAA TGGTATATTG AAAGTATCTG CTACAGAATC AAAAACCAAC AAGACAAAGT CTCTCACTAT TACCAAATTA AAGGAGAGTT GGACAGAAAA GGAAAAACAA CGAGTTTCAA CTGAAATTGA AGAGTTTAAA AGAGATCGAG AGCTTCACTC AGCAATGAGT AAAGAAATAG AACAAGAAAT CTATAAAATT GAACACCTTC TCAGTGCAAT GAACAATGTT CTCACAGATC CTAGAATTAA AAGCAAGATA CCTGCTAAGA TTAAAAGAGA TATCATCATA CAAATCATTG ATGCCAACAA GTGGTTACTG AATGCAGATA AGGAGTCTTT ACAAACTTGT AAAGATAAAT TTTCTACTCT TAAATTCTTT TCACAGGCGT GTATTCCAAA TGGGTATGTT AACTATTTTG AGACAAGAAA ATCAAATCTT TAGAATGAAG TCACGAAAGA TGAACAATCT GTCTGTCATT AATTGTCGAT TAATAATATG TAATGTGTTT GTAATTACTT TAGTTTGCAA AAAAAATCTT TACCCTAATC CCGGTAGGGA AGGTAGGCTG TACCAATGTA AAATTAGACA CCACACAAAA GCAACTTGAA GACAAGAAGG TTCTCTTGGT TAAAACAAAT GAAATTTCCC GACTTTGAAA GTTAACACAA CTTGTGGTGA GTATACCATT TTCCATGAAA CTCTTCGTAG GAATTGATAT TGGATCTACA CAAAGTCGGC TCTTTGCCAT CAATGACGAT AAGGAATTCG CCAGTCCGTC CAAACCAGAA GCCGAAGTCC AACAAAATGA ACTTGAACTA TGCGTGCTAA CTGTACCCAG TATTTTAAGT TCGAAACTAT TAGCGAACCC TGATATAAGT GGGAGAAACC AAATGGTTAT CCAAAGGGTA ATAGATGATA TTAAGAAAAA GACAAAGCTG AAATATGATA TTGAATTCCA ACAGTGCTAC ATTTCACTTC CCAGGTGGTA TATCGAGAAA AATAAGTTAC CGAGTGAGTT CAAGATACAA GACAGATGTA AATGTTTAAG AAATTCTTAT ATATCCAAAG TTCCCAATGG AAAAGTTCTA TTCATTGATG TTGGAAGTCA ACTTGAAATT TCTTGCTTCG CAGGAAATAA GATTATTTTT AGTCATTATT TCTCCAAATT GGGTGGTAAC AAGTTCGATA AGGACCTAGT AGCATCATGC GTTGAATCAT TTGAGCCAAG ACATATATCT CTCTACAGAA AAGATCCAGT AGCACAAAAA AGACTTGAGT TTGCCTGTAG ACTAGTGAAG GAGAGCTTTG GAGATAAAGA CGAAGTGACG GCCAATATTT ACGACTTTTT GGATAGCACT GATTTTGTAA GGCTGATCTC CAAAGATAAG TTGATTGCAA TATTAAAGAA CAGCTTCAAT GAATTAAGAA GAGCAATTTC AGAGAAACTT AAAATCGAGA GAGTGGACGA AATAGATGAA ATTGTGTTAG CTGGAAGAGC TGCAAATGTT CCAGGTCTCC AAAATTTTAT AACAAGAGCA TTCAAGGGTG CTAGCATTGA CACCTTAGAT AATTTTTCCA TTGCTAAAGG TGCTGCTCTT GAGATGAAGA AGAAGGAAAG ACTGGAGAAG ATTGACAAGT CCTATTTCAC TACACCGAAG AAGTTCTTCA ATGTAATCTC CAAACATGCG GCTACTACTC TTCACCAAGG GGATGGAAAT TGA
|
Protein sequence | MNKPVIGIDL GTTNSCVAVF NNKVEVIANV LGSRITPSCV SFDDNETIIG EGAKNQLGKN PENTVYGTKR LIGRDFDDPE VQHDITHFLF KVVNRNGKPF IQVQYKKEIR TLPPEEISAM VLESVKCTAE EYLGVKVEDV VITVPAYFND SQRKATKAAG EIAGLNVLGI INEPTAAALA YGQSNNKDCK ERNLLVYDLG GGTFDVSLVT HCKDVYEVRA SDGDSHLGGE DFDNILVDYF ASEFIESYPC NLKSDKTSMA KLRKECESAK RRLCASPSTD IEISSLYDGK AFKSKLSRAK FDELCGDLIM KTMNTVKAVI EAGGIIKSDV DEVLLVGGST RIPMVQKEVA KFFEGTKISK KANADEVIAE GAAIQAHILS TEPRVVLLDV APKSLGLKAI GDRMVKMIPK NLAIPSTNSK DFTTVSDYQT NLVVKVFEGE NEVCSKNRLL GEFELSGIQR ATKGEARIIV VFNVDDNGIL KVSATESKTN KTKSLTITKL KESWTEKEKQ RVSTEIEEFK RDRELHSAMS KEIEQEIYKI EHLLSAMNNV LTDPRIKSKI PAKIKRDIII QIIDANKWLL NADKESLQTC KDKFSTLKFF SQACIPNGEG IDIGSTQSRL FAINDDKEFA SPSKPEAEVQ QNELELCVLT VPSILSSKLL ANPDISGRNQ MVIQRVIDDI KKKTKLKYDI EFQQCYISLP RWYIEKNKLP SEFKIQDRCK CLRNSYISKV PNGKVLFIDV GSQLEISCFA GNKIIFSHYF SKLGGNKFDK DLVASCVESF EPRHISLYRK DPVAQKRLEF ACRLVKESFG DKDEVTANIY DFLDSTDFVR LISKDKLIAI LKNSFNELRR AISEKLKIER VDEIDEIVLA GRAANVPGLQ NFITRAFKGA SIDTLDNFSI AKGAALEMKK KERLEKIDKS YFTTPKKFFN VISKHAATTL HQGDGN
|
| |