Gene PICST_89414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89414 
Symbol 
ID4839023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp897093 
End bp898498 
Gene Length1406 bp 
Protein Length422 aa 
Translation table12 
GC content41% 
IMG OID640390338 
Productpredicted protein 
Protein accessionXP_001384124 
Protein GI150865066 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5159] 26S proteasome regulatory complex component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.607018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAAC CCTCGGCTCT TCTTGCCGAA GCCAGAGAGT ACGCCACGGC CAAAAACTAC 
GAAGCTGCTG AACAGAAATA CAAAGAGCTT ATCTCGAGCG ACAGCGAGAC AGAGTCACTC
ACCAAAAAAT TACAAGAGCA AGAAGCTGCC ATCATCGAAT TGGGTAAGGT TTACGAGACC
AACCACGACG CCACGAAGTT GACGGAATTG ATCGCCGAAT CAAGAAATGT GTTGGGCAAA
TTTGCCAAAT CGAAAGTGGC CAAAATTGTC AAGTCATTGA TTGAAGATTT TGATACGATT
CCTGACTCCT TGGACTTGCA GATTTCTGCC AGCAGAGAAT GTATCGATTG GGCAGTAGAG
AGCAAGCTTT CCTTTTTAAG ACAATCGTTG CAGTTGAAGT TGGCGGAGTT GCTCTACAAG
AAAACTTTGT ATCAGGAAGC TATCAAATAT ATCAATGACT TGTTGAGAGA GTGCAAGAAG
TTGGACGATA AGTCGCTGAT GGTAGAAGTT CAGTTGTTGG AATCCAAGAT TTACCATGCA
TTGCGCAACA TTCCCAAGTC GCGTGCAGCT TTGACTGGCG CTAGGACGTC GGCAAACTCA
ATCTATTGTC CAACATTGTT GCAGGCTGAA TTGGACTGCC AGAGCGGTAT TTTGAATGCC
GAGGACAAGG ACTACAAGAC TGCGTTCTCG TACTTTTACG AGTCGTTTGA AGGGTTCAAT
TCGCAAGACG ACGAGCGTTC TATCGTAGTT TTGAAGTACA TGTTGCTAAC CAAGATCATG
TTGAACTTGA TTGATGATGT CAACAAAATC TTGAACAACA AGAATGTCAT CAAGTACCAG
TCCAAAGATA TTGATGCAGT GAAGTCAATT GCGACTGCGT ACTCCAACAG ATCTTTGAAG
GAGTTCGAGA GCTCCTTGTT GACATACTCG TCAGAATTGA AGTCTGATCC TATCATCAAA
AACCACTTCA ACGCCTTGTA TGACAACTTG CTCGAACAAA ACTTGTTGAA GATCATAGAA
TCTTATAGTT GTGTAGAATT GTCGCATATT TCCAAGACTA TTGGCTTGAA TTTGCAACAG
GTAGAAGGCA AATTGTCTCA GATGATCTTG GACAAAGTAT TCTATGGGGT TTTGGACCAA
GGAAATGGCT GGTTGATCTT GTATGATGAA CCTAGAAGAG ACGCTGCATA CGATGCTTCG
TTGGATCTTA TCAAGAACTT GTCCAATGTA GTTGAATTAC TTTATGAGAA GGCATCATCA
TTGAATTAGA TAAATGCGAA TGAAATGGCA ATAGTACAAA GAATTGTAAG AGGTGATTTA
AGAATAAAAA GTGGATTACC TCCACAGTCT ATATTTATAG TACATATGTA TAGTGAAATA
TTCATCTAAT AGAACAATCA GATTTG
 
Protein sequence
MSQPSALLAE AREYATAKNY EAAEQKYKEL ISSDSETESL TKKLQEQEAA IIELGKVYET 
NHDATKLTEL IAESRNVLGK FAKSKVAKIV KSLIEDFDTI PDSLDLQISA SRECIDWAVE
SKLSFLRQSL QLKLAELLYK KTLYQEAIKY INDLLRECKK LDDKSSMVEV QLLESKIYHA
LRNIPKSRAA LTGARTSANS IYCPTLLQAE LDCQSGILNA EDKDYKTAFS YFYESFEGFN
SQDDERSIVV LKYMLLTKIM LNLIDDVNKI LNNKNVIKYQ SKDIDAVKSI ATAYSNRSLK
EFESSLLTYS SELKSDPIIK NHFNALYDNL LEQNLLKIIE SYSCVELSHI SKTIGLNLQQ
VEGKLSQMIL DKVFYGVLDQ GNGWLILYDE PRRDAAYDAS LDLIKNLSNV VELLYEKASS
LN