Gene PICST_54531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_54531 
Symbol 
ID4837003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1923039 
End bp1924712 
Gene Length1674 bp 
Protein Length558 aa 
Translation table12 
GC content43% 
IMG OID640388318 
Productpredicted protein 
Protein accessionXP_001383141 
Protein GI150864363 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5560] Ubiquitin C-terminal hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.737555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCGA ATTACAAATT TGCTAACCGA TCTCGGTTCA CAACCAGGAA GTCCAAGTCC 
CACCGTTTCG CAATCAAAAA TGGTGGCGAG ATTGGCGGCC TCAGTAACGA CGGTAACACT
TGTTTTATGA ACTCGGTGCT TCAGTCGTTG GCATCATCCC GTGAGTTGCT TAAGTTTATC
GACTCCTACC TTTTCTCGGA AACAGCACTT CAGACTCCAC AAGGAAACGT GATCTCCATG
AAATCCAACT CTCCTAGACC AGAATTAATC TTCACCAATG CATTAAAAAC GTTGTTGGAT
AATCTCAATG GTAAATATGG AGCTCGTGGC AAGGAGTTCA GTACGAAAGC TCTCTTGAAC
AAGATGCCCA ATGGCCCCAA ACAGAACTTT TTCTCAGGCT ACAATCAGGA AGACGCCCAA
GAGTTCTACC AGCTCGTAAT GAGTTTGTTG GAACGGGAAT ACAAGAAAGT TTCCCAATCC
AGATTACCCA CCCCAGAACC AGAAGAAAAG GCCGGAAATC AGGAGAAGCA AGTGAGATTC
CTCGATATCG AAAATGTGCC GAACGTCGTC TTCGGCTGCG AAAAATTAGG AAAATTGGGT
AAAGTGTACG TTCCAGCCAA CCAGGTTGAC CCCAACTTGG TAGACTGTGA TCACAAGGTG
TTCCCACTTG AGTTGATTAC TCCTGTAGAT GGAATCTCAG CCGAAAGAAT CGGTTGTTTG
TCCTGCGGAG AAGTAGGTGG CATCCGTTAT TCTGTCAATT CAGGCTTGAG TTTGAATTTG
CCAAACAATT CTTCCTACTA CTCCAGCTTT GACTTGCTTC TGTTAATGAA CGACTGGATC
ACACCAGAAA TCATTGAAGA TGTCAACTGT AACAGATGTG GTTTGAATCA AACCAAAGAA
TTCCTTTTAG AGACTTTGCA GGACCTTCAA GCAAAGCCAA ACGGAGACAA GTTGAGGGAA
CAATTTCAAA TTCGTCTCGA TGCTATCGAT TCGGAATTGT TGAAACCTCA TATCACAGAC
GAAGTGTTCG AAAAGCTCAC CATAAAAAAG CATATTAGAA AAAGTAGAAA ATCCAAACAG
ATACTCTTGA GTAGACCGCC TCCGTTGTTG TCCATCCATA TAAATAGATC GGTATTTGAC
CCTAGAACAT ACATGATAGT CAAGAACTCC AGCAGTGTGA CTTTCCCGTC AAAGTTAAAC
TTGGCTCCAT ACATTGCCGA ACCAAGGGAT ATTAACATGG ACGCTAGATT GCCTTTCAGA
AAACAGGAAG AGAGAACAGT TTTGCAGCAG GAACAGACTA AAAATTCAGA AACACTTCCT
TCAACTCCGT CAGACTCGTC GTTGTCTTCC ACTCCTGAAG AAACGAAGTC AGACTCTACT
ACTAGCACCG ATATAGACAA CTTGGAACCA CTTCCAGTAG ATCCCAAGTT GTTGTACAAC
TTGAAGGCAG TAATCTCCCA CTATGGTACT CACAACTACG GCCACTACAT TTGCTACCGT
CAGTTGAGGG GTACATGGTG GAGAATTAGT GACGAATCTG TATATGTTGT CACTGAAGAA
GAAGTATTGA ACGCTCAGGG TACATTTATG ATCTTCTACG AGTTCGATGA TGGGCATAAG
GAATTTTTGC AAGATGTTTC TGATAGTGAA GAAGAGGAGG AGGAAGAGGA AGAT
 
Protein sequence
MSSNYKFANR SRFTTRKSKS HRFAIKNGGE IGGLSNDGNT CFMNSVLQSL ASSRELLKFI 
DSYLFSETAL QTPQGNVISM KSNSPRPELI FTNALKTLLD NLNGKYGARG KEFSTKALLN
KMPNGPKQNF FSGYNQEDAQ EFYQLVMSLL EREYKKVSQS RLPTPEPEEK AGNQEKQVRF
LDIENVPNVV FGCEKLGKLG KVYVPANQVD PNLVDCDHKV FPLELITPVD GISAERIGCL
SCGEVGGIRY SVNSGLSLNL PNNSSYYSSF DLLSLMNDWI TPEIIEDVNC NRCGLNQTKE
FLLETLQDLQ AKPNGDKLRE QFQIRLDAID SELLKPHITD EVFEKLTIKK HIRKSRKSKQ
ILLSRPPPLL SIHINRSVFD PRTYMIVKNS SSVTFPSKLN LAPYIAEPRD INMDARLPFR
KQEERTVLQQ EQTKNSETLP STPSDSSLSS TPEETKSDST TSTDIDNLEP LPVDPKLLYN
LKAVISHYGT HNYGHYICYR QLRGTWWRIS DESVYVVTEE EVLNAQGTFM IFYEFDDGHK
EFLQDVSDSE EEEEEEED