Gene PICST_78867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_78867 
Symbol 
ID4840265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1446589 
End bp1447812 
Gene Length1224 bp 
Protein Length399 aa 
Translation table12 
GC content45% 
IMG OID640391580 
Productpredicted protein 
Protein accessionXP_001385975 
Protein GI150866394 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5187] 26S proteasome regulatory complex component, contains PCI domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.505457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTCG ATTCAGATGT GCCCCACATT CCAGACTACC GGTTGTCGGA AAAAGAGTTT 
CTTCTTTCCC AGACTGTTGA TGCTGGCGTG AGAGCTTCCA TTTTCCACGA TTTAACAGAG
GCCATAACCA AAGATGATTT GGCACCATAC TATCTCCATT TACATACTGA GTATGAAGAT
TTCCCGTATG ATGAAAAGGT TTACCAAGAA TTGGCAGCAA AAAACGAAGC CATTGCTGGC
GACTTGAAGC TGAAGCTCAA GGAAGTTGAA GGTGAAGACG AAACAGAGTT GGACATCTTG
GCTACGACGA TCCAATTGGC TGAACACTAC ACGCGTATTG TAGATAGAAC GAACGCATCT
GAGACGTTGA AGAAGGCATT GGACTTGTCG CAAGGTACTG GCTCGAAGAT CGATCTTCTC
TTGACTTTGA CGCGACTCGA ATTTTTCTTC AATGACTATG TGTTAGTCTC AAAGTATTTG
GACCAGATCA AGACGTTGAT CGATAAGGGA GGAGACTGGG AACGTCGTAA CAGATTCAAG
ACGTACCAGG GTATCTACTT ATTGGCGACA CGTAACTTTG CCGAAGCTGC CAAGTTGTTG
ATCGACTCGT TGGCAACCTT TACTTCTACC GAGCTCTGTA GTTACGAGCA AGTAGCACAA
TATGCGATTA CGGCTGGTGT TTTGTCGTTG GACAGAGTAG ACTTGAAGGA AAAGATCATC
GATTCGCCCG AGATTCTTCT GATCTACTCG TCGGCTCCAG AGACGGAACC ATTGCTCAAC
TTGACCAACT CCTTGTACAC ATGTCAGTAC AACTACTTTT TCCAGTACCT CTTGGAATCG
TACGACAAGC TCCTTGTACC CAACAAGTAT TTGCACAAGC ACGCCAGCTA CTTCTTGCGT
GAAATGCGGT GTAAAGCATA CGGCCAGTTA TTGGAGAGCT ACAAGTCTTT GTCGCTTAAG
TCTATGGCCC AAAACTTCAA CATCTCGGAA GACTTCTTGG ACCAAGACTT GTGCAAGTTC
ATTCCCAACA AGAAATTGAA TTGTACTATT GACAAGGTGA ACGGCATTAT TGAGACAAAT
AGACCGGACA ATAAGAACAA CCAGTACCAT TTGTTGATCA AGCAGGGTGA TGGCTTGTTG
ACGAAATTGC AGAAGTACGG TACAGCTGTG AAGTTGAGTG GGGCCGAGAG AGTAGCCTAG
ATGTAATATA TGTTGATAAA TGGT
 
Protein sequence
MDLDSDVPHI PDYRLSEKEF LLSQTVDAGV RASIFHDLTE AITKDDLAPY YLHLHTEYED 
FPYDEKVYQE LAAKNEAIAG DLKSKLKEVE GEDETELDIL ATTIQLAEHY TRIVDRTNAS
ETLKKALDLS QGTGSKIDLL LTLTRLEFFF NDYVLVSKYL DQIKTLIDKG GDWERRNRFK
TYQGIYLLAT RNFAEAAKLL IDSLATFTST ELCSYEQVAQ YAITAGVLSL DRVDLKEKII
DSPEILSIYS SAPETEPLLN LTNSLYTCQY NYFFQYLLES YDKLLVPNKY LHKHASYFLR
EMRCKAYGQL LESYKSLSLK SMAQNFNISE DFLDQDLCKF IPNKKLNCTI DKVNGIIETN
RPDNKNNQYH LLIKQGDGLL TKLQKYGTAV KLSGAERVA