Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_61453 |
Symbol | USP39 |
ID | 4839739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 59777 |
End bp | 61246 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391054 |
Product | ubiquitin specific protease |
Protein accession | XP_001385345 |
Protein GI | 150865930 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.981787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.483859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGAA ACGATTCTAC TCCTTTGCTA AAGAAACAAA AAGTAGCAGC TCATGAAGAT TCACAAAGTG ATGCCGAATT CGATTTTCCA GAATATATGA CAAGCAATGT CAACCATTCG GCACAACAAG ATACTCTTTA CCTAGATACC ATCCACAGGC CTCTATTGGA CTTTGATTTC GAGAAAGTAT GCCTGGTGAC TCTTTCCAAC ACCAATGTTT ATTGCTGTTT GGTTTGCGGA AAGTATTTCC TGGGAAGAAG CAGAACTTCT CCATCGTACA CCCATTCGCT CGAGAACAAT CATCATGTCT TCATAAATCT TCACAACGAA AAGTACTATG TTCTTCCAGA AAACTATGAA ATTACATCAT ATTCGGCCCT AAAATCGCTA AGAGATATCA ACAACTTCTT GAATCCCAAA TACACCAAAT CAGACATTGA AAATCTTCCT GTTACTGCTC GAGATCTCGA TCATAAATCC TACGATGTGG GCTATGTAGG ACTAAACAAT ATATCAGCCA ATGACTATTC CAATGTCGTT ATACAAGCTC TCAGTCATAT AGTTCCAATA AGGAATTGGT ACTTGAGTTT ATCTCATCGA CAGGACTTAA ACGCAAGATT GGAATCCATG TCTCCGTTGA ATTACAATTT TGGAATATTG GTAAGAAAGC TTTGGTCCAA GTACTTGTAC AGAAACCATG TATCTCCACA CGAATTTTTA CAGTACTTGT CGAGCACCAC AAAGAACCAA TTTCTGATAA CTCGGCAATC GACTCCCAAG AAGTTGTTGG TGTGGTTATT GAATAATTTG CATCTTCAGC TAAGCAAGCT GATCAAGAAG TCACTGACTG TACTCAGTGA CAATCTTCGA GGTCGGATAA AAGTCACTTC AATGGAAGTT CATACCAAAG AAGTTAACGA TAAAGTTGAG TTCGAGACCG ATGAAAATTC TGCAAAGGAG CTGGTACTGT CTTTTTGGGT TTTAACTCTC GATTTGATAT CATCACCGTT GTTCATGGAT GTTACTAAAA TACAGGAAGT AGCATTAACC AAGTTATTGC AGAAGTATGA TGGTAAACAA ACCAGTCAAG TTTCCTCCTC AGAGTTGAGG ACATATAAGC TTATTGCGCC GTTGCCACGA TACCTCATCT TCCATATAGA CAGAGGACTC GAGAAGGACG GATCCACGAG AGGAAACCCT ACAGTAGTAC AGTTTTCTAC AACTATCGAT ATGGCTCCGT ATACTGAAGG AGCTGATCAG TCGCTTAACT ATAAGTTGCT CAGTACTATA AAGCGCCAGC TGACTGAAGG TGTCAAATTG GATCACAGTG ATGACAAAAA CAATTGGGCT ATAAGTCTTC GTCGTACGGA TGATTCCTGG GTATGTATTA AGGATTTGGA ATTCAACGCC TGTGAAGGAG GATTGCTTTT CCTAGAAGAG AACTATTTAC AGATATGGGA AAGATGCTAG
|
Protein sequence | MNGNDSTPLL KKQKVAAHED SQSDAEFDFP EYMTSNVNHS AQQDTLYLDT IHRPLLDFDF EKVCSVTLSN TNVYCCLVCG KYFSGRSRTS PSYTHSLENN HHVFINLHNE KYYVLPENYE ITSYSALKSL RDINNFLNPK YTKSDIENLP VTARDLDHKS YDVGYVGLNN ISANDYSNVV IQALSHIVPI RNWYLSLSHR QDLNARLESM SPLNYNFGIL VRKLWSKYLY RNHVSPHEFL QYLSSTTKNQ FSITRQSTPK KLLVWLLNNL HLQLSKSIKK SSTVLSDNLR GRIKVTSMEV HTKEVNDKVE FETDENSAKE SVSSFWVLTL DLISSPLFMD VTKIQEVALT KLLQKYDGKQ TSQVSSSELR TYKLIAPLPR YLIFHIDRGL EKDGSTRGNP TVVQFSTTID MAPYTEGADQ SLNYKLLSTI KRQSTEGVKL DHSDDKNNWA ISLRRTDDSW VCIKDLEFNA CEGGLLFLEE NYLQIWERC
|
| |