Gene PICST_78689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_78689 
SymbolSSC1.2 
ID4840318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1634183 
End bp1637118 
Gene Length2936 bp 
Protein Length647 aa 
Translation table12 
GC content45% 
IMG OID640391633 
Productmitochondrial heat shock protein of the HSP70 family upregulated 15 fold under aerobic conditions 
Protein accessionXP_001386012 
Protein GI126138978 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACATCTACA ACTCCGTTTC ACTCCAAACA ACAACGATGT TAGCCGCTAG AAACTCGTTC 
AAGTCTGCTG CCGCACGTGC TCCAGCAGCT GCTGTCCGTT TCAACTCTTC TGCTGCTCCT
TCCGGTCCCG TTATCGGCAT TGATTTGGGT ACCACCAACT CTGCTGTAGC CATCATGGAA
GGTAAGGTTC CAAAGATCAT TGAGAACGCC GAAGGTGGCA GAACCACTCC TTCCATCGTT
GCCTTCACCA AGGAAGGCGA AAGATTGGTT GGTATCCCAG CAAAGCGTCA AGCTGTTGTC
AACCCAGAGA ACACTTTATT TGCCACGAAG CGTTTGATTG GTCGTCGTTT CGAAGACCAG
GAAGTGCAAA GAGACTTGAA TCAGGTTCCT TACAAAATTG TCAAGCACGA CAATGGTGAT
GCCTGGATTG AAGCCCGTGG CGAGAAATAC TCTCCACAAC AAATTGGTGG TTTCATCTTG
AACAAGATGA AGGAAACCGC TGAAGCAAAC CTCGGTAAGC CAGTCAAGAA CGCCGTCGTT
ACCTGTCCTG CTTATTTCAA CGACGCCCAG AGACAAGCTA CCAAAGATGC CGGTAAGATT
GTCGGTTTGA ATGTGATGAG AGTCGTCAAC GAACCTACTG CTGCTGCCTT GGCCTACGGT
TTGGAAAAGA ACGACGGTCA AGTTGTCGCC GTCTTTGACT TGGGTGGTGG TACTTTCGAT
ATCTCGATCT TGGACATTGG TGCTGGTGTT TTTGAAGTCA AATCGACCAA CGGTGACACC
CACTTGGGTG GTGAAGATTT CGACATTGCT GTAGTGAGAA ACATTGTTGA CAACTTCAAG
AAGGAGTCTG GTATCGACTT AGAAAAGGAC AGAATGGCTA TCCAGAGAAT CAGAGAAGCC
GCCGAAAAGG CCAAGATCGA ATTGTCTTCG ACTGTCTCGA CCGAAATTAA CTTGCCTTTC
ATCACCGCTG ATGCTTCAGG TCCAAAGCAC ATCAACCAGA AGATCTCCAG ATCCCAATTC
GAGACCTTGG TTGAACCTTT GGTTAAGAAG ACTGTTGATC CATGTAAGAA GGCCTTGAAG
GATGCCGGTT TGTCCACTTC CGACATTTCG GAAGTCATCT TGGTTGGTGG TATGTCGAGA
ATGCCTAAGG TCATTGAAAC CGTGAAGTCT ATCTTTGGTA GAGAGCCATC TAAGGCTGTC
AACCCTGATG AAGCTGTGGC TATGGGTGCT GCCATTCAAG GTGGTATCTT GGCTGGTGAA
GTGACAGATG TTGTATTGTT GGATGTCACC CCATTGTCAT TGGGTATTGA GACCATGGGA
GGTGTTTTCG CCAGATTGAT TTCTAGAAAC ACCACCATTC CAGCCAAGAA GTCGCAGATC
TTCTCCACTG CTTCGGCCGG TCAAACTTCG GTCGAAATTA GAGTTTTCCA GGGTGAAAGA
GAATTGACCA GAGACAACAA GTTGATTGGT AACTTCACAT TGTCTGGTAT CCCACCTGCT
CCAAAGGGTG TTCCACAGAT CGAAGTCACC TTCGACATTG ACACTGATGG TATCATCAAG
GTCTCTGCAC GTGACAAGGC CTCTAACAAG GACGCCTCCA TTACCGTTGC TGGTTCCTCC
GGTTTGTCTG ACTCTGAAAT CGAAAAGATG GTCAACGATG CCGAGAAGTT TGCCGAATCC
GACAAGGCTA GAAGAGAAGC CATCGAATCC GCTAACAGAG CCGACCAATT GTGTAACGAC
ACCGAAAACT CCTTGAACGA ATTCAAAGAC AAGTTGGAAA CTGCTGATGC CGACAAGGTC
AGAGAACAGG TTGCCGCTTT GAGAGAAATT GTCCTCAAAG CTCAAGCCGG TGAAGAAGTT
GATGCCGCTG AATTGAAGAC TAAGACCGAG GAATTGCAAA ACGAGTCCTT GAAGGTCTTT
GAAAAGTTGT ACAAGAACAA CGATTCCTCT TCAGAAAACT CTTCTGAACC AAAGAACTAA
GCAAGTGGGT GGTGACATGA AAATTCTTCT ATTATTAACT TGTATCATGA TTCATGACCC
TGACTAATTA TTTATTTGTA CATAATCTAG TTTGGGTTTT TTTCTTGGTG AATGCACCTA
CTTCTCTCTT GTAAATATGT TTATAACACT ACATAATGAA CACCTTTAAG ATGTACCTAT
TTATTATCAT TAATAGAAAG GAAATGTAAC TAGTCTCCTT GTAGTAAATC TAGACAAATC
TATCTTTCAA GTCGTTAGTG AGTTAAGCCT ATAATTGTGA AGAAGGCTAT TGCTCTATAC
CTTCAACTCA GTTAGACTGT GTTGAATGCC ATGCGATCGC CTCATAGGTC CCCTACCAGA
AGATTCTGGT TTCAACGTAT GACGTCTGAT TATGGTACCG TAGGTTCTGG TGAGTGGTTC
GGACAGATTG CCATAGGTGC TAGCAGATCG TTTCGATAAA CTTCTTGAGG CTATAGATTC
ACTAATAGAA GACAAAGAAT AAACGGAGGA TGAAGCGACC AGCTCTCCTT CAGATTCATA
TTCGAGCATT TCACCCAAGA TAGATCTTCT GTTCAACTGG TTGTGGAATT CCTCTTCTTC
CATACTAGGA TTAGAATCCC GACCCTGTGA TGCTATGATG GTTGGAGTAT CCGATCTTTC
TTCAGAAGAG CTGCTCAAGT TGGTGTTGGT TTGTGAGACC ATGTCGCTCA AAGAAATCTC
TTTGAATGAA GCCTCCTCTG ATGCCAAAGG CAATCTACTT TCTGTTTCGC TGATTTCTGT
TTCTGGTTCT ACACCTTCTT CGGCTGCAAA AACGGGTTCG GAATTAATAG ATTTGTTTTC
AAGAAGGTTT CTGAAGATAT TGATTTCTTG ACGAAGCATT ATACGCTTAG AAAGAACGTG
AGCCAAATAT CTTCTCCAAA TTTCGAGTGC AACGGCATTG TCATCTACTA TGGCCG
 
Protein sequence
MLAARNSFKS AAARAPAAAV RFNSSAAPSG PVIGIDLGTT NSAVAIMEGK VPKIIENAEG 
GRTTPSIVAF TKEGERLVGI PAKRQAVVNP ENTLFATKRL IGRRFEDQEV QRDLNQVPYK
IVKHDNGDAW IEARGEKYSP QQIGGFILNK MKETAEANLG KPVKNAVVTC PAYFNDAQRQ
ATKDAGKIVG LNVMRVVNEP TAAALAYGLE KNDGQVVAVF DLGGGTFDIS ILDIGAGVFE
VKSTNGDTHL GGEDFDIAVV RNIVDNFKKE SGIDLEKDRM AIQRIREAAE KAKIELSSTV
STEINLPFIT ADASGPKHIN QKISRSQFET LVEPLVKKTV DPCKKALKDA GLSTSDISEV
ILVGGMSRMP KVIETVKSIF GREPSKAVNP DEAVAMGAAI QGGILAGEVT DVVLLDVTPL
SLGIETMGGV FARLISRNTT IPAKKSQIFS TASAGQTSVE IRVFQGEREL TRDNKLIGNF
TLSGIPPAPK GVPQIEVTFD IDTDGIIKVS ARDKASNKDA SITVAGSSGL SDSEIEKMVN
DAEKFAESDK ARREAIESAN RADQLCNDTE NSLNEFKDKL ETADADKVRE QVAALREIVL
KAQAGEEVDA AELKTKTEEL QNESLKVFEK LYKNNDSSSE NSSEPKN