Gene PICST_80235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80235 
SymbolSSE1 
ID4851119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp975430 
End bp977685 
Gene Length2256 bp 
Protein Length696 aa 
Translation table 
GC content46% 
IMG OID640392827 
Productheat shock protein of HSP70 family 
Protein accessionXP_001387826 
Protein GI126274105 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.529839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCGATATCA TCCCATTTCT GAAAATTTTC AGTGATTGAG TTTTCAGTTT TTACCACCAC 
CACTTCTATA CCACTACCAA TACAATGTCC GCTCCGTTCG GTGTCGATTT AGGAAACAGC
AGCTCTGTCA TTGCCTGCGC CAGAAACAGA GGAATTGACA TCATTGTCAA CGAGGTTTCC
AACAGAAACA CTCCCTCGCT TGTAGGATTT GGCCCTAAGA ACAGATTCAT CGGCGAGTCC
GGTAAGAACC AGCAGACTTC GAACTTGAAG AACACCGTCG ACAACATCAA GCGTATCCTC
GGTGCCAACT TCAATGACCC AGATTTTGAA ATTGAGAAGA AGTACTTCAC ATGTCCACTT
GTTGAATCAA AGGATGGAGG CATCTCTGCT AAGGTCAGAT TCTTGGGCGA GCAGCAGGAA
TTCACTGCTA CCCAGTTAGC AGCCATGTAT ATCGATAAAA TCAAGGATAT CACAATCAAA
GAGACCAAGG CAAACATCAC TGATATTTCT TTGTCTGTTC CAGTATGGTA CACCGAGAAA
CAGCGTCGCG CCGCCGCCGA TGCTTGTAGA ATCGCTGGCT TGAACCCTGT CAGAATCGTC
AACGAAGTCA CAGCTGCTGC TGTTGGCTAC GGTGTCTTCA AGGCTAACGA CTTGCCAGAA
GATGAACCTA AGAAGGTGGC GTTTGTGGAC ATTGGTCACT CCTCGTACCA AGTCTCTATT
GCTGCTGTCA AGAAGGGTGA ATTGAAGATT CTTGCTTCTG CTTACGACAA GCACTTTGGT
GGTAGAGATT TCGACTACGC CATCGCCAGC CACTTTGCCG ACGAATTTGT TGGCAAGTAC
AAGATCGACG TCAGAGAAAA CCCTAAGGCT TTCTACCGTA TTTTGACTGC TGCTGAGAAG
TTGAAGAAGG TTCTCTCTGC TAACACTCAG GCTCCATTCA ATATCGAGTC CGTGATGAAC
GACGTTGACG TTTCTTCGTC TTTGACTCGT GAAGAGTTGG AAGAGTTCGT ACAGCCTCTT
TTAGCCAGAG TTCATGTTCC AATCGAGTCA GCTCTTAAGG AAGCTGGTTT GACTACTGAC
GACATCGACT CCATCGAAGT CATTGGTGGC TGTACCAGAG TTCCTTCTTT GAAGAACAAG
TTGAAGGACA TCTTTGGCAA AGAGTTGTCT TTCACTTTGA ACCAAGATGA AGCCATTGCC
AGAGGTAACG CTTTCATCTG TGCTACCCAC TCTCCAACCG TTAGAGTGAG ACCTTTCAAG
TTCGAAGACT ACAACCCATA CTCAGTCTCT TACTTCTGGG CCAAGGAAGA AGAAGACGAA
GACCACATGG AAGTTTTCCC AAGAGGTGGC TCGTTCCCTT CGACCAAGAT CATCACCTTG
TTCAGAAAGG GTGACTTCGA AGTCGAAGCC AAGTACACCA AGCCTGAAGA GTTGCCAGTT
GGCACTTCTC CACTCGTAGC CAAGTGGGAA ATCAAGGGTG TTGTTCCATC TGAAGGTGAG
ACTTCTATTG CCACCAAGAT CAAGTTGAGA AACGATCCTT CTGGATTCTA CACTATCGAG
GCTGCTTACA CCGTCGAAGA AAAGATCGTC AAGGAATTAG TGGAAAAGGA ACCAAAGGAA
GGTGAAGAAC AAGACGACGA AGATTCTGAA CCAGAATACC GTGAAGTTAA GAAGTTGGTC
AAGAAGGCTG ACTTGGAAGT TATTACTCAC TCTGCTTCTC TTGAACCAAG CGTCAGAGAA
GAGTTTATTG AAAAGGAAAA CGCTTTAGTC ATGGGTGACA AGTTGGTTGC TGATACTGAA
GACAGAAAGA ACGCTTTGGA AGAGTACATC TACGACTTGA GAGGTAAGTT GGACGACAAG
TACAAGGACT TCGCCTCCGA TGCCGAAAAG GAACAATTGA CTGCCTTGTT GAGCAAGACT
GAAGACTGGT TGTACGATGA AGGTTACGAT TCGACCAAGG CTAAGTACAT TGCTAAATAC
GAAGAGTTGG CTTCCAAGGG AAACCTCATC AAGGGCCGTT ACTTGCAAAA GGAAGAAGAA
AAGAAGCAAG CCTACAGACA GAAGCAAGAA GCTGCTCAAG CTGCTGCCAT GGCTGAGAAG
TTGGCTGCTG CCAGAGATGC TTCAAAGGCT GAACAGAAGC CAGCTCCTGA AGAAGCCGAT
GTTGACATGG ATTAGATTAG AGAGGTGCTC TTAAATATAT AGTTCTTAAT AATTTCCGGT
TACATATACA TAATTGAAGG AGTATTGTTG ATTGAG
 
Protein sequence
MSAPFGVDLG NSSSVIACAR NRGIDIIVNE VSNRNTPSLV GFGPKNRFIG ESGKNQQTSN 
LKNTVDNIKR ILGANFNDPD FEIEKKYFTC PLVESKDGGI SAKVRFLGEQ QEFTATQLAA
MYIDKIKDIT IKETKANITD ISLSVPVWYT EKQRRAAADA CRIAGLNPVR IVNEVTAAAV
GYGVFKANDL PEDEPKKVAF VDIGHSSYQV SIAAVKKGEL KILASAYDKH FGGRDFDYAI
ASHFADEFVG KYKIDVRENP KAFYRILTAA EKLKKVLSAN TQAPFNIESV MNDVDVSSSL
TREELEEFVQ PLLARVHVPI ESALKEAGLT TDDIDSIEVI GGCTRVPSLK NKLKDIFGKE
LSFTLNQDEA IARGNAFICA THSPTVRVRP FKFEDYNPYS VSYFWAKEEE DEDHMEVFPR
GGSFPSTKII TLFRKGDFEV EAKYTKPEEL PVGTSPLVAK WEIKGVVPSE GETSIATKIK
LRNDPSGFYT IEAAYTVEEK IVKELVEKEP KEGEEQDDED SEPEYREVKK LVKKADLEVI
THSASLEPSV REEFIEKENA LVMGDKLVAD TEDRKNALEE YIYDLRGKLD DKYKDFASDA
EKEQLTALLS KTEDWLYDEG YDSTKAKYIA KYEELASKGN LIKGRYLQKE EEKKQAYRQK
QEAAQAAAMA EKLAAARDAS KAEQKPAPEE ADVDMD