Gene PICST_84519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84519 
SymbolSIS1 
ID4839815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1152177 
End bp1153446 
Gene Length1270 bp 
Protein Length344 aa 
Translation table12 
GC content44% 
IMG OID640391130 
ProductMolecular chaperone (DnaJ superfamily) 
Protein accessionXP_001385916 
Protein GI150866349 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.35649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGATTTCACA ACATGGTCAA GGAAACAAAA TTGTACGACT TATTGGAAGT TTCACCTTCT 
GCCTCTGAGA CGGAAATCAA GAAAGCCTAC AGAAAGGCAG CCTTGAAGTA CCATCCAGAT
AAACCTACTG GGGATACTGA AAAATTCAAG GAGGTCTCTG AAGCATTTGA TATTCTTTCC
AACGGAGACA AAAGGCAAGT CTATGACGAC TACGGCTTAG AAGCAGCCAG AGGAAATGCA
CCAGCTGGTG GAAATCCATT CGCTGGTGCC GGCTCTGGCA ATCCTTTTGG CGGTGCCGGA
GGTTATGGCG GAGGTCACCA CGGCTTTTCT CAGGCTGATG CCTTCAACAT TTTCTCACAG
ATGGGAGGAT TTGGAATGGG AGACGATGGA TTCAGCTTCA GCAGTAGTGG CCCTGGAGGT
TTTGGAGGTG GCCATCCTTT TGGAGGAGGT GCTGGTGGTA TGCCTGGAGG CTTTGGTGGC
CAGGGATTTG GCGGCCGTTC TGCTCGTCGT CCAGAGCCTG ATACCGTTTC TATGCCCTTA
CCAGTCTCTT TAGAAGATTT GTTCCATGGT GGTGTCAAGA AGATGAAGTT GAACAGAAAG
GGACTTCATG GAGAAAGAGA GAGTAAGGTG TTGGAAGTCA ACATCAAACC AGGCTGGAAG
GCCGGAACGA AGATCAACTT CACCAATGAA GGAGACTATC AGCCAGAATG TCAAGCCAGA
CAGACCCTTC AATTCGTGTT GGAAGAAAAG CCTCATCCTG TGTTCAAAAG AGACGGTACC
AGTAACAACT TGATTGTGAA CCTTCCAATA ACCTTCAAAG AATCCTTGTG TGGGTTCGAT
AAGGATATAA CCACTATTGA TGGAAAGAGA CTTCCATTCT CCAAGACTCA GCCAGTCCAA
CCTAACTCTT CAGCACTATA CCCAGGCTTA GGTATGCCAA TCAGCAAGCT GCCAGGCCAA
AGAGGTGATA TGGAAGTGAT TTTCAAAGTT GACTATCCTA TCAGTTTGAC TCCTCAACAA
AAACAAGCAA TACAGACCAA TTTCTAGGAT CAAGAAACAT AAAACACAAA CACAAATACA
AAACATAACG ACTACACAAT TCATCGCTGC GATTTGTCAC GATTAATAGA CGCATTACAT
AGAATGTTCC TCGGCTGGAC ATATCCATGA CATTCCTTCA CTTCTTGATA AACTTACTGC
TTTGCCGATT TGCATGCTAA TATAAATAGT ACTACTACAA TCAATATAAT CATTAGAACA
GAAATAGTAG
 
Protein sequence
MVKETKLYDL LEVSPSASET EIKKAYRKAA LKYHPDKPTG DTEKFKEVSE AFDILSNGDK 
RQVYDDYGLE AARGNAPAGG NPFAGAGSGN PFGGAGGYGG GHHGFSQADA FNIFSQMGGF
GMGDDGFSFS SSGPGGFGGG HPFGGGAGGM PGGFGGQGFG GRSARRPEPD TVSMPLPVSL
EDLFHGGVKK MKLNRKGLHG ERESKVLEVN IKPGWKAGTK INFTNEGDYQ PECQARQTLQ
FVLEEKPHPV FKRDGTSNNL IVNLPITFKE SLCGFDKDIT TIDGKRLPFS KTQPVQPNSS
ALYPGLGMPI SKSPGQRGDM EVIFKVDYPI SLTPQQKQAI QTNF