Gene PICST_31961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31961 
SymbolSCJ1 
ID4839396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp320035 
End bp321159 
Gene Length1125 bp 
Protein Length374 aa 
Translation table12 
GC content46% 
IMG OID640390711 
ProductdnaJ homolog in endoplasmic reticulum 
Protein accessionXP_001385080 
Protein GI150865743 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.108543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTG TCACGATACT AGCAGTGCTT TTCGTAAACT TCGCACTTCT TATTGCGGCT 
GCGCAAAAGG ACTATTATCA GATTCTTGGA GTGAACAAGG ATGCTGGCGA AAAAGAGATC
AAGCTGGCCT ATAGACAGTT GAGTTTGAAA TACCATCCAG ATAAGAATCC TGGCAGTGAA
GAAGCACACG AGAAGTTTTT GGAAGTGGGT GAAGCTTACG ACGTTTTGAG CAATTCCGAA
AAGAGATCAA ACTACGACAA ATTTGGTGAC GCCAACGGAG GACCGTCTAA CCAGGAGTTC
CAGTTTGATT TTGGAGATAT GTTTGGACAA TTCTTTGGAG GTCACGGTGG TGGTGGTCAA
GGCGGCCAGA GAGTACGTAA GGGTGACAGC ACACAAGTAA ACTTGCATGT AGCACTTGGC
GATTTCTATA ATGGAAAGTT GTTGGAGTTT GATGTTGAGA TGATGAACAT CTGTGAGAAA
TGTGAGGGTA CTGGATCAAA GGATAGACAA ACCCATACAT GTGACAAGTG CAAGGGTGCC
GGAGTAGTGA CAGTTCGTCA TCAGCTTGCT CCCGGTATGG TTCAACAGGT CAGAATGCAA
TGTGACCAGT GCGGAGGTAA GGGTAAGACT ATAGCCCATA AATGTGGCTC CTGCTCAGGA
AAGGGTGTCC ATGCTGGACC CAGACATTAT GAAGTATACA TCAAACCGGG CCAGCCGCGC
GATTCCAACA TTGTTTTGCA TGGCGAAGGT GACAGGAATC CAGACTGGGT TCCCGGCGAC
TTGATTATTA ATGTCCGCGA GGAGTTCGTC AAGAGCTGGG GCTACAGACG GATCCATAGT
AATTTGTACA GAACAGAAGT CTTGACTTTG AACGAATCCA TCGAAGGAGG CTGGGAGAGA
AAGATTGCAT TTTTGGATGC CGAAGATAAC GTTCTTACGT TGAAGAGAGA AAAAGGTGTC
AGGGTTACAG ACGGAGAAGT AGAGATCATC AAGGGCAAGG GGATGCCATT GTTGGATGAG
CACCAGGACC ATAATGATGA TTACGGAGAT TTATTCATCC AGTACAAGAT CCTTGTAGCT
GGGGGTAAGG CACAGAAGTT GCTGCACGAG AAAGATGAAT TATAG
 
Protein sequence
MRVVTILAVL FVNFALLIAA AQKDYYQILG VNKDAGEKEI KSAYRQLSLK YHPDKNPGSE 
EAHEKFLEVG EAYDVLSNSE KRSNYDKFGD ANGGPSNQEF QFDFGDMFGQ FFGGHGGGGQ
GGQRVRKGDS TQVNLHVALG DFYNGKLLEF DVEMMNICEK CEGTGSKDRQ THTCDKCKGA
GVVTVRHQLA PGMVQQVRMQ CDQCGGKGKT IAHKCGSCSG KGVHAGPRHY EVYIKPGQPR
DSNIVLHGEG DRNPDWVPGD LIINVREEFV KSWGYRRIHS NLYRTEVLTL NESIEGGWER
KIAFLDAEDN VLTLKREKGV RVTDGEVEII KGKGMPLLDE HQDHNDDYGD LFIQYKILVA
GGKAQKLSHE KDEL