Gene PICST_28757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28757 
SymbolHSP70.4 
ID4851507 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2011937 
End bp2015079 
Gene Length3143 bp 
Protein Length946 aa 
Translation table 
GC content37% 
IMG OID640393215 
Productheat shock protein 70 
Protein accessionXP_001387620 
Protein GI126274689 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0849325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.596472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAC CAGTAATAGG AATTGATCTC GGGACAACCA ACTCGTGCGT CGCAGTCTTT 
AATAACAAAG TGGAAGTCAT TGCCAACGTA TTAGGTAGCA GAATAACACC TTCCTGTGTT
TCTTTTGATG ATAATGAAAC TATTATTGGA GAGGGGGCAA AAAATCAATT AGGAAAGAAT
CCTGAGAATA CTGTCTATGG AACAAAGAGG TTAATTGGAA GAGACTTTGA TGATCCCGAA
GTTCAACATG ATATAACTCA TTTCCTATTC AAGGTTGTGA ATAGAAATGG AAAGCCATTC
ATTCAAGTAC AATACAAGAA AGAGATCAGA ACTCTTCCCC CTGAAGAAAT TTCTGCAATG
GTATTAGAAA GTGTGAAGTG CACAGCAGAA GAATATCTAG GTGTAAAAGT CGAGGATGTG
GTTATTACAG TTCCAGCATA CTTCAACGAT TCCCAAAGAA AAGCAACCAA AGCAGCTGGT
GAAATTGCTG GTCTCAATGT CCTAGGGATC ATCAATGAAC CAACTGCTGC AGCACTTGCC
TATGGGCAGT CCAACAACAA AGATTGCAAA GAAAGAAACT TATTGGTATA TGACCTTGGA
GGTGGAACCT TTGATGTTTC TTTGGTCACC CATTGCAAAG ACGTCTACGA AGTGAGAGCA
AGTGATGGCG ATAGCCATCT AGGTGGAGAA GATTTCGACA ACATTCTTGT TGACTATTTT
GCTAGTGAAT TCATAGAAAG TTACCCTTGT AACCTCAAAT CCGACAAAAC GTCAATGGCT
AAACTCAGAA AAGAATGCGA GTCTGCGAAG AGAAGATTAT GTGCTTCTCC TTCAACTGAC
ATAGAAATTT CGTCCCTCTA TGATGGAAAG GCATTTAAAA GCAAACTCAG CAGGGCAAAA
TTTGATGAAT TGTGTGGAGA TCTCATAATG AAAACAATGA ATACTGTTAA AGCAGTTATC
GAGGCAGGTG GAATTATTAA AAGTGATGTG GACGAGGTTC TTCTTGTGGG AGGGTCTACA
AGAATTCCAA TGGTTCAGAA AGAGGTTGCA AAGTTTTTTG AAGGCACTAA GATTAGCAAA
AAAGCGAATG CGGACGAAGT TATAGCAGAA GGAGCTGCAA TTCAGGCTCA TATACTATCG
ACAGAGCCTA GAGTAGTGCT TCTAGATGTT GCTCCTAAAT CCTTAGGATT AAAGGCAATT
GGTGATCGAA TGGTGAAAAT GATTCCTAAG AACCTGGCTA TACCTTCTAC TAATTCCAAA
GATTTCACAA CAGTTTCAGA TTATCAAACT AATTTGGTAG TCAAAGTATT TGAAGGAGAG
AATGAAGTAT GTTCAAAAAA CCGTCTACTT GGGGAGTTTG AACTAAGTGG AATCCAAAGG
GCCACAAAGG GAGAAGCTAG AATAATCGTG GTGTTCAATG TCGACGACAA TGGTATATTG
AAAGTATCTG CTACAGAATC AAAAACCAAC AAGACAAAGT CTCTCACTAT TACCAAATTA
AAGGAGAGTT GGACAGAAAA GGAAAAACAA CGAGTTTCAA CTGAAATTGA AGAGTTTAAA
AGAGATCGAG AGCTTCACTC AGCAATGAGT AAAGAAATAG AACAAGAAAT CTATAAAATT
GAACACCTTC TCAGTGCAAT GAACAATGTT CTCACAGATC CTAGAATTAA AAGCAAGATA
CCTGCTAAGA TTAAAAGAGA TATCATCATA CAAATCATTG ATGCCAACAA GTGGTTACTG
AATGCAGATA AGGAGTCTTT ACAAACTTGT AAAGATAAAT TTTCTACTCT TAAATTCTTT
TCACAGGCGT GTATTCCAAA TGGGTATGTT AACTATTTTG AGACAAGAAA ATCAAATCTT
TAGAATGAAG TCACGAAAGA TGAACAATCT GTCTGTCATT AATTGTCGAT TAATAATATG
TAATGTGTTT GTAATTACTT TAGTTTGCAA AAAAAATCTT TACCCTAATC CCGGTAGGGA
AGGTAGGCTG TACCAATGTA AAATTAGACA CCACACAAAA GCAACTTGAA GACAAGAAGG
TTCTCTTGGT TAAAACAAAT GAAATTTCCC GACTTTGAAA GTTAACACAA CTTGTGGTGA
GTATACCATT TTCCATGAAA CTCTTCGTAG GAATTGATAT TGGATCTACA CAAAGTCGGC
TCTTTGCCAT CAATGACGAT AAGGAATTCG CCAGTCCGTC CAAACCAGAA GCCGAAGTCC
AACAAAATGA ACTTGAACTA TGCGTGCTAA CTGTACCCAG TATTTTAAGT TCGAAACTAT
TAGCGAACCC TGATATAAGT GGGAGAAACC AAATGGTTAT CCAAAGGGTA ATAGATGATA
TTAAGAAAAA GACAAAGCTG AAATATGATA TTGAATTCCA ACAGTGCTAC ATTTCACTTC
CCAGGTGGTA TATCGAGAAA AATAAGTTAC CGAGTGAGTT CAAGATACAA GACAGATGTA
AATGTTTAAG AAATTCTTAT ATATCCAAAG TTCCCAATGG AAAAGTTCTA TTCATTGATG
TTGGAAGTCA ACTTGAAATT TCTTGCTTCG CAGGAAATAA GATTATTTTT AGTCATTATT
TCTCCAAATT GGGTGGTAAC AAGTTCGATA AGGACCTAGT AGCATCATGC GTTGAATCAT
TTGAGCCAAG ACATATATCT CTCTACAGAA AAGATCCAGT AGCACAAAAA AGACTTGAGT
TTGCCTGTAG ACTAGTGAAG GAGAGCTTTG GAGATAAAGA CGAAGTGACG GCCAATATTT
ACGACTTTTT GGATAGCACT GATTTTGTAA GGCTGATCTC CAAAGATAAG TTGATTGCAA
TATTAAAGAA CAGCTTCAAT GAATTAAGAA GAGCAATTTC AGAGAAACTT AAAATCGAGA
GAGTGGACGA AATAGATGAA ATTGTGTTAG CTGGAAGAGC TGCAAATGTT CCAGGTCTCC
AAAATTTTAT AACAAGAGCA TTCAAGGGTG CTAGCATTGA CACCTTAGAT AATTTTTCCA
TTGCTAAAGG TGCTGCTCTT GAGATGAAGA AGAAGGAAAG ACTGGAGAAG ATTGACAAGT
CCTATTTCAC TACACCGAAG AAGTTCTTCA ATGTAATCTC CAAACATGCG GCTACTACTC
TTCACCAAGG GGATGGAAAT TGA
 
Protein sequence
MNKPVIGIDL GTTNSCVAVF NNKVEVIANV LGSRITPSCV SFDDNETIIG EGAKNQLGKN 
PENTVYGTKR LIGRDFDDPE VQHDITHFLF KVVNRNGKPF IQVQYKKEIR TLPPEEISAM
VLESVKCTAE EYLGVKVEDV VITVPAYFND SQRKATKAAG EIAGLNVLGI INEPTAAALA
YGQSNNKDCK ERNLLVYDLG GGTFDVSLVT HCKDVYEVRA SDGDSHLGGE DFDNILVDYF
ASEFIESYPC NLKSDKTSMA KLRKECESAK RRLCASPSTD IEISSLYDGK AFKSKLSRAK
FDELCGDLIM KTMNTVKAVI EAGGIIKSDV DEVLLVGGST RIPMVQKEVA KFFEGTKISK
KANADEVIAE GAAIQAHILS TEPRVVLLDV APKSLGLKAI GDRMVKMIPK NLAIPSTNSK
DFTTVSDYQT NLVVKVFEGE NEVCSKNRLL GEFELSGIQR ATKGEARIIV VFNVDDNGIL
KVSATESKTN KTKSLTITKL KESWTEKEKQ RVSTEIEEFK RDRELHSAMS KEIEQEIYKI
EHLLSAMNNV LTDPRIKSKI PAKIKRDIII QIIDANKWLL NADKESLQTC KDKFSTLKFF
SQACIPNGEG IDIGSTQSRL FAINDDKEFA SPSKPEAEVQ QNELELCVLT VPSILSSKLL
ANPDISGRNQ MVIQRVIDDI KKKTKLKYDI EFQQCYISLP RWYIEKNKLP SEFKIQDRCK
CLRNSYISKV PNGKVLFIDV GSQLEISCFA GNKIIFSHYF SKLGGNKFDK DLVASCVESF
EPRHISLYRK DPVAQKRLEF ACRLVKESFG DKDEVTANIY DFLDSTDFVR LISKDKLIAI
LKNSFNELRR AISEKLKIER VDEIDEIVLA GRAANVPGLQ NFITRAFKGA SIDTLDNFSI
AKGAALEMKK KERLEKIDKS YFTTPKKFFN VISKHAATTL HQGDGN