Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_62908 |
Symbol | |
ID | 4840572 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 110519 |
End bp | 113368 |
Gene Length | 2850 bp |
Protein Length | 879 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391887 |
Product | predicted protein |
Protein accession | XP_001386236 |
Protein GI | 150866587 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5533] Ubiquitin C-terminal hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGTG AAGTCAATGA CAAGTCCAAC CAACAGTATT ACACCAAAGT GATCAACCAC GAATTCCGGA ACCTCCAGCT CGATCCTGAA TTAGCGTCAA AATCGCTATT TGATCTCATT GACTACTGTG AACTTTTATA CGAGAAGTCT GCCACGGCGT TCTCTTCCGG TGATCACCAC AACGGATTGA AGAATTACAT CAAGGGATAC TTGATCTTCA ACTATTTCAT TAACAGCTTC ATTATGCTCC ACTTTAAGGG CTTTGACGCC TTTGTCGAGT CCAACGAGCA GGACTTCATC ATCTATCTCA ATGTATTTGC CTTTTATAAC GCTGACGACA TTATACGAAA CAGCTCACAC ACAGTTTCGC TGGCGACGCT TCGCGGCTAC ATCAAAAAGT ACCTTGTCGA CTCAAACTTG CTTAGTTTTA ACGTGGAGGA GCTTTATGCC TGGCTCCATG AGTATATAAA ATACCTCAAG GAAAAAGACC GATCCATGGT GGAAATCAAT ATCTCCAGTG GTGACGAAAT GCGCTCAGAT GGTGATCTTT TTGTCGAAAC TGCAACGTCG TCTTCAACAA GACAGACGTC TAATGGAAAT TCTCTGATCA CAAAGTCAAA GAAAACTTAC AGTTTGAACC CGCTTTTGAA CCCGGAATTG AACTCTAGTG AAGAGTTCCT AACAGTGGCC GATAAAAGTC AAAATACCTA TGGTAGCGAC ACTAGACAAG ACTATAACGA TAACTATGAC CAATACATTC AAGAAAAAGA GGCCCTTCTG CCAAGCGATT CTGTCTCAGA ATTCAAGCAC AGATTTCCGT CCTTAAATGG GAAAAATGAT ACATCGCTCT TTGGTTCTAA GAAGCTTCCT CCACCTTCGC CTCCAAATAT TCCACATCCA ATTCCAAAGC ACTTTTCTCC ATCTTTGCTT CCTCCACATT CTCCGCCACC TCCTCCGCCT CATTCTGCTG CAAATACATT ACAACCAAAT AGAGCTAGTA CGTTTCCAGG ATCACCTACA GCTTCGGCTC CGTATCCTCC TGACGAAATT ACAAGCAGAT CACCAATTCC GTCGCCAATA TCTCCGGGTT TGAGATCTCC GGTTGCTCCA GAATTGGCCA ACGATACAGA CGAGTGGTAT TTCCTGAATC CACAAAAATC ACCTACTGAT GCAAACCCAA ATACAAATTC ATATAGCAAT CTACAGAACA ATAGCGGCAA TATTTCCCAC CGTCCTCCAC AAACGATTCC CATAGCTAGG CCGGTGACGC TGCCTACAAA CGTCAATGTC CATTACAATA GTCACGAAAT CCACCAACAG AATGGTAACA ATAACCACTA CATGAATAAC AATAATGTCT TTCACAATGG CCAACAATCC AACAACCCAT TTCCTAACTA TCCTCAACAG CAAGAGGTAG GCCAGCAAGT AAATCAACAA GGAGCGAATG GTTATTATGA CAATAATGGT CCTAGCAGTG TGCAACATTT CGGAGCACCT CGCTCTTCTG TTCCTCAACA TGTGCAGATG CAACAGTATC AGATCAAAAC TCAGAAGCAG CAATACATGC AAGAGTATTC TGTTTGTGGA TTGAGAAACT TTGGTTCTTC CTGTTATATC AACCTGACAA TACAGTTGAT ATTCGGTGTT CTGTTGTTCA AGTCGTTGTT CATCAACCTG GCGTACCAGA GATATGTCAA GGATCCCAAG TACTTGAGAC TTATCCTGTC GCTGAAGTTG AACAGTCACC ACAAAGATTC TATTCTTCTC TCTGAAGCAA TTTCAGCATT GTTACGAACT TTCTCACAGC ATGGAAGTGT TTCTATAGCT CCTACAAAGT TCATCAGAGT GACGTCCTTG TTGAAGCCAG ACTTCAATAT TCCATACGAA CAGCAGGATG CACAGGAGTT TCTACTCTTT GTTTTGGAGA GACTTCATCT GGAGTTGTCC AATAAGAGCA TTGAAACCAA CTACGAGCTA GAGGACTATA TTCGTAAGTG GGATATTAAT GTCAATATGA AGGATAGAAA CGAATACTTG AAGTGGTACT TGTCCCTTGT GAAACTGGAA GGAACGTCTC CAGTCCATGA TCTTTTCCAG GGCCACTTGC AAAACAAATT GACATGTAAT TCTTGTGGAT ATGAGTCTAT TAGTTATTCT CCATTTACCA TTTTGTCACT ACCGATTCCA AGCAGCCATA GCAGTAAGAA TGTGGTCAAC TTAGCCGATT GTTTACGCTA CTACTCCCAG GATGAAGTTC TCAGTGGAGA AAATGCCTGG AACTGTCCCA AGTGTAGCAA GAACGGAGAC CAAGTTTCTG CTAGCAGTGT TCTTGATAAT CATCCTGTTT TCGTCAACAA GAAGTCAGGA ATCTTCAAAC TAGGCAGAAG AAGCAAATCG CCTTCTTCTC AGACGAGCAC GAAAACAACG ACAAGTTCCA TTCACTCCAA CATTTCAACG AAGAGTTTGA ACTTCATCAA ATTGCCACAG ATTTTGATTA TTCAACTCTC CCGGTTTTCA GTCTTCAACT TGACTGATAA GTTAGACACA TACATTCAAT ATCCGTTGAA ATTGAAGTTC AATAATGACG GTCATGAAAT TGTGTACAAA TTATCCGGTT TGATAAATCA CTTTGGTAAT CTTAAAAGCG GGCACTACAC ATCCATTGTC AACAAATCTA CGGTCAATCA AAATCTAGGA AGTAATTTGG ACAATCTCAA AGTTCCTTAT TGGTGTTTGT TTGATGATGA GAATGTGAAG GTGAACTTGC CTCACGGAAG TATTACGCAG CCTGGTATGG GTGCTATCAA CTCCAAGGAT GTCTATGTGT TGTGCTACGA ACGAGTCTAA
|
Protein sequence | MNSEVNDKSN QQYYTKVINH EFRNLQLDPE LASKSLFDLI DYCELLYEKS ATAFSSGDHH NGLKNYIKGY LIFNYFINSF IMLHFKGFDA FVESNEQDFI IYLNVFAFYN ADDIIRNSSH TVSSATLRGY IKKYLVDSNL LSFNVEELYA WLHEYIKYLK EKDRSMVEIN ISSGDEMRSD GDLFVETATS SSTRQTSNGN SSITNDSVSE FKHRFPSLNG KNDTSLFGSK KLPPPSPPNI PHPIPKHFSP SLLPPHSPPP PPPHSAANTL QPNRASTFPG SPTASAPYPP DEITSRSPIP SPISPGLRSP VAPELANDTD EWYFSNPQKS PTDANPNTNS YSNLQNNSGN ISHRPPQTIP IARPVTSPTN NGNNNHYMNN NNVFHNGQQS NNPFPNYPQQ QEVGQQVNQQ GANGYYDNNG PSSVQHFGAP RSSVPQHVQM QQYQIKTQKQ QYMQEYSVCG LRNFGSSCYI NSTIQLIFGV SLFKSLFINS AYQRYVKDPK YLRLISSSKL NSHHKDSILL SEAISALLRT FSQHGSVSIA PTKFIRVTSL LKPDFNIPYE QQDAQEFLLF VLERLHSELS NKSIETNYEL EDYIRKWDIN VNMKDRNEYL KWYLSLVKSE GTSPVHDLFQ GHLQNKLTCN SCGYESISYS PFTILSLPIP SSHSSKNVVN LADCLRYYSQ DEVLSGENAW NCPKCSKNGD QVSASSVLDN HPVFVNKKSG IFKLGRRSKS PSSQTSTKTT TSSIHSNIST KSLNFIKLPQ ILIIQLSRFS VFNLTDKLDT YIQYPLKLKF NNDGHEIVYK LSGLINHFGN LKSGHYTSIV NKSTVNQNLG SNLDNLKVPY WCLFDDENVK VNLPHGSITQ PGMGAINSKD VYVLCYERV
|
| |