Gene PICST_61453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61453 
SymbolUSP39 
ID4839739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp59777 
End bp61246 
Gene Length1470 bp 
Protein Length489 aa 
Translation table12 
GC content40% 
IMG OID640391054 
Productubiquitin specific protease 
Protein accessionXP_001385345 
Protein GI150865930 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.981787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.483859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGAA ACGATTCTAC TCCTTTGCTA AAGAAACAAA AAGTAGCAGC TCATGAAGAT 
TCACAAAGTG ATGCCGAATT CGATTTTCCA GAATATATGA CAAGCAATGT CAACCATTCG
GCACAACAAG ATACTCTTTA CCTAGATACC ATCCACAGGC CTCTATTGGA CTTTGATTTC
GAGAAAGTAT GCCTGGTGAC TCTTTCCAAC ACCAATGTTT ATTGCTGTTT GGTTTGCGGA
AAGTATTTCC TGGGAAGAAG CAGAACTTCT CCATCGTACA CCCATTCGCT CGAGAACAAT
CATCATGTCT TCATAAATCT TCACAACGAA AAGTACTATG TTCTTCCAGA AAACTATGAA
ATTACATCAT ATTCGGCCCT AAAATCGCTA AGAGATATCA ACAACTTCTT GAATCCCAAA
TACACCAAAT CAGACATTGA AAATCTTCCT GTTACTGCTC GAGATCTCGA TCATAAATCC
TACGATGTGG GCTATGTAGG ACTAAACAAT ATATCAGCCA ATGACTATTC CAATGTCGTT
ATACAAGCTC TCAGTCATAT AGTTCCAATA AGGAATTGGT ACTTGAGTTT ATCTCATCGA
CAGGACTTAA ACGCAAGATT GGAATCCATG TCTCCGTTGA ATTACAATTT TGGAATATTG
GTAAGAAAGC TTTGGTCCAA GTACTTGTAC AGAAACCATG TATCTCCACA CGAATTTTTA
CAGTACTTGT CGAGCACCAC AAAGAACCAA TTTCTGATAA CTCGGCAATC GACTCCCAAG
AAGTTGTTGG TGTGGTTATT GAATAATTTG CATCTTCAGC TAAGCAAGCT GATCAAGAAG
TCACTGACTG TACTCAGTGA CAATCTTCGA GGTCGGATAA AAGTCACTTC AATGGAAGTT
CATACCAAAG AAGTTAACGA TAAAGTTGAG TTCGAGACCG ATGAAAATTC TGCAAAGGAG
CTGGTACTGT CTTTTTGGGT TTTAACTCTC GATTTGATAT CATCACCGTT GTTCATGGAT
GTTACTAAAA TACAGGAAGT AGCATTAACC AAGTTATTGC AGAAGTATGA TGGTAAACAA
ACCAGTCAAG TTTCCTCCTC AGAGTTGAGG ACATATAAGC TTATTGCGCC GTTGCCACGA
TACCTCATCT TCCATATAGA CAGAGGACTC GAGAAGGACG GATCCACGAG AGGAAACCCT
ACAGTAGTAC AGTTTTCTAC AACTATCGAT ATGGCTCCGT ATACTGAAGG AGCTGATCAG
TCGCTTAACT ATAAGTTGCT CAGTACTATA AAGCGCCAGC TGACTGAAGG TGTCAAATTG
GATCACAGTG ATGACAAAAA CAATTGGGCT ATAAGTCTTC GTCGTACGGA TGATTCCTGG
GTATGTATTA AGGATTTGGA ATTCAACGCC TGTGAAGGAG GATTGCTTTT CCTAGAAGAG
AACTATTTAC AGATATGGGA AAGATGCTAG
 
Protein sequence
MNGNDSTPLL KKQKVAAHED SQSDAEFDFP EYMTSNVNHS AQQDTLYLDT IHRPLLDFDF 
EKVCSVTLSN TNVYCCLVCG KYFSGRSRTS PSYTHSLENN HHVFINLHNE KYYVLPENYE
ITSYSALKSL RDINNFLNPK YTKSDIENLP VTARDLDHKS YDVGYVGLNN ISANDYSNVV
IQALSHIVPI RNWYLSLSHR QDLNARLESM SPLNYNFGIL VRKLWSKYLY RNHVSPHEFL
QYLSSTTKNQ FSITRQSTPK KLLVWLLNNL HLQLSKSIKK SSTVLSDNLR GRIKVTSMEV
HTKEVNDKVE FETDENSAKE SVSSFWVLTL DLISSPLFMD VTKIQEVALT KLLQKYDGKQ
TSQVSSSELR TYKLIAPLPR YLIFHIDRGL EKDGSTRGNP TVVQFSTTID MAPYTEGADQ
SLNYKLLSTI KRQSTEGVKL DHSDDKNNWA ISLRRTDDSW VCIKDLEFNA CEGGLLFLEE
NYLQIWERC