Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66630 |
Symbol | |
ID | 4851749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2734760 |
End bp | 2737612 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393457 |
Product | predicted protein |
Protein accession | XP_001387091 |
Protein GI | 126275482 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5021] Ubiquitin-protein ligase |
TIGRFAM ID | [TIGR01053] zinc finger domain, LSD1 subclass |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.214435 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACGAA GAAAAAGTAG CGTCGTTGCT CAGAACCAGA ACCTGCCTCG AAATCAAAAT CATAATCAAA GTCATAACCA GATTAACAAT GCTGAGCAAA ACACCAATAG TAGCAACAGT AGCCAGATTT CTTCTACAAT TGCCTCTGAG CCAAGATCAA CTCCCTGGAG AATCAAGGAT TTCCTCAAGC TCCCTTCAGC TCCTAAGGGT CTGAACTCAA ATACTGTCAG CACTTCTCCT ACTAAGTCAA ATGTAGAGCA GTTGGTTGCT CCGACTACTT CAACTTCAAT CTCAAATACA AGTGCTAATT CAAATTCTGC GGCCACCTCA GATTCCATTC CTACTTCTGT AAGCGGATGT TATTGCTGTG GAACTCTTCT CACATACCCT ACAAAAGCTT CTAAGTTCAG ATGTTCAGTC TGTAACACTA CCAACATTCT CGCTGTAGCT CCCGAAAATT CTGCAGAAGG CAGCAATGAA GCTGTTCACA TAATTTCGTA CGATTACGTA AAAAAGCAAG TAGAGAAATG CTTGAAATAC TTGAACCAGT CCCTGGATAA GTCAATTCAC GAAGTGTTCG AACCATTGTC TGATTACCTA TACGATGCAT TCAAAAACTA TCATATTTTA TCGAAGTCGT TCAAAACTCG TAGATCTAGC CAAAATCAGC ATTACCACAC TTCCAAGATC AACTATGAAG AGATCCACAA TACCTTCTTA CTCTTACTGA AATTGCCTAC TAAAAGACCT TTGTACAATG CCCTTAGAGG AGCTTCTCAC TTGCTAAAGA GAGTCTATGT ATTTCCAAAA GGCAACGATG CCAGTTCCTA TGTCTGGGTA TTGATTTTGA TGGAAATTCC CTTCCTTTCA AGATCGCTTC TATCTCCTGG AGACGATTCT AAAGCTAAGT CCATGATAGA TGTGCCTGAA ATTAAAGGAT TATGCTATGA TATCTTGAAG CGTTGTATAG GGATTTTGGC TTGCATAGAA TCCGTAGGTG CCATCAACTA TATAACCAGT TGGTACGCCA ATCTTCCCAC TGCTGAATTC GCCAAAAAGG TTGATCTTCT CAATCTCTAT ATCACCTTCC ATTTAAAGAA ATACTTCTAT ATTGCAAACA ATCCGCACTT GCTTAGAAGA ACGTCTTCCA CTGCTGCCTA TAATGCCAGC ACTAGCGGGT CTCGGACTGA AGATAGCCAT CCCACGGATC GTGAGTACTC TGAAAATGTT CATATTAAAG AAGAAATCGA TGCCATGAAC CAGGAAACCC CAAATTTCAC AAATCAATTG GCATTACCCA CTTCGTATTT GGGCTCATTT CCAGCTCGTA GAAATTCTAA AAAGAATCAG CTGCAAGAAG CAAAGATCAA GATATACCAG TATGGAAATG ATTGGCATAT TAAGACGGCT GCAATCCTGT TGTCTTTCTA TCTATGTGCA AATACTTACA GAGTAGAAAA GGTTTCTATA GCTTCGTTCT ACAACTCTTT GGTAGACTTT GTCAATATCA AATTGGACTT TGATTCATGG CAGACCAATA AGAAATCGAA ATTCAACTCC TCTTCAGGTG AGAATGACTT GCAACAGGTT ATAGATTATA TTAATGGAAG TACACGTGGC AGTACACTTC ATGAAAATGC TTCATACTAC TTTTGTCAAT ACCCATTTTT GATTACTTTG GGAGGTAAAA TCTCAATTTT GGAGTACGAA GCTCGAAGGC AGATGGAACG GAAAGCAGAA GAAGCCTTTA TCAACTCCTT GGACAAACGT GTTGCCCTTG ATATTTACTT CAAGGTTAAA GTCAGAAGAG AAAACATCGT GCAAGATTCG ATTTCCGCTA TCAAGAACAA CAGCAATAAC CTCAAGAAAA GTTTGCGAGT TCAGTTCGTA AATGAACCTG GTGTGGATGT AGGTGGGTTG AAAAAGGAAT GGTTCTTACT TTTGACCAGA GCATTGTTCA ATCCACAGGC AGGGATGGTT TACAATATTG AAGACTCCAA CTACTTGTGG TTCAATCTAG TGCCCATAGA AAACTTTGAA ATGTACTATT TGCTAGGTGC TGTATTGGGT TTAGCCATTT ATAATTCCAC TATTTTAGAT CTTCACTTCC CAATGGCACT CTACAAGATT CTTCTCGATA AGCCTGTTGG CTTGGATGAT TACAAGCAGT TGTTTCCTGT GTCGTATGGT AATTTGATGA AGCTCAAGAA ATACTCAACA GAAGAGCTAT TGGCATTGGA TCTCACTTTT GAAGTTTCGT ATCAGGATTT ATTTGGAAAG ACGTATTCGG CTGAATTGAT CAAGGATGGT AGGAAAATAT TTGTCACTGC TGAGACTCGT AAATCATATA TTGAGAAATA CACTCAGTTC TTTTTGCAAG AAGGAATCAA GAAACAAATA ACAGCTTTTT CCAGTGGATT CAAGAATGTT ATTGGTGGCA ATGGACTTTC TCTTTTCCTC CCAGAAGAGA TTCAGTTGTT GCTATGTGGA AGTGAAGAAG GTGGTATAGA TGTTGACGTC TTGAAATCAG TGACAAAATA CGTTGGTTGG AAAACGCCGG ATGATGGTGC TGATTCAACT GTAGTTCAAT GGTTCTGGGA GTACATGTGC GAAATTAACA CCCAGGAACG GAAACGTCTA CTCATGTTTG TTACCGGCTC CGATAGAGTT CCTGCAACAG GGATTCAAAA CCTAAGCTTC AAAATCAGTA GCCAGGGTAA GGATAGCAAC AGACTTCCTG TTGCACATAC GTGTTTCAAC GAGTTAGGCT TATATAATTA TAGTTCTAAG GAAAAGTTGG TCGACAAACT TGTTACAGCG GTCAATGAGA GCGCAGGGTT TGGGTTAAAA TAG
|
Protein sequence | MLRRKSSVVA QNQNLPRNQN HNQSHNQINN AEQNTNSSNS SQISSTIASE PRSTPWRIKD FLKLPSAPKG LNSNTVSTSP TKSNVEQLVA PTTSTSISNT SANSNSAATS DSIPTSVSGC YCCGTLLTYP TKASKFRCSV CNTTNILAVA PENSAEGSNE AVHIISYDYV KKQVEKCLKY LNQSLDKSIH EVFEPLSDYL YDAFKNYHIL SKSFKTRRSS QNQHYHTSKI NYEEIHNTFL LLLKLPTKRP LYNALRGASH LLKRVYVFPK GNDASSYVWV LILMEIPFLS RSLLSPGDDS KAKSMIDVPE IKGLCYDILK RCIGILACIE SVGAINYITS WYANLPTAEF AKKVDLLNLY ITFHLKKYFY IANNPHLLRR TSSTAAYNAS TSGSRTEDSH PTDREYSENV HIKEEIDAMN QETPNFTNQL ALPTSYLGSF PARRNSKKNQ LQEAKIKIYQ YGNDWHIKTA AILLSFYLCA NTYRVEKVSI ASFYNSLVDF VNIKLDFDSW QTNKKSKFNS SSGENDLQQV IDYINGSTRG STLHENASYY FCQYPFLITL GGKISILEYE ARRQMERKAE EAFINSLDKR VALDIYFKVK VRRENIVQDS ISAIKNNSNN LKKSLRVQFV NEPGVDVGGL KKEWFLLLTR ALFNPQAGMV YNIEDSNYLW FNLVPIENFE MYYLLGAVLG LAIYNSTILD LHFPMALYKI LLDKPVGLDD YKQLFPVSYG NLMKLKKYST EELLALDLTF EVSYQDLFGK TYSAELIKDG RKIFVTAETR KSYIEKYTQF FLQEGIKKQI TAFSSGFKNV IGGNGLSLFL PEEIQLLLCG SEEGGIDVDV LKSVTKYVGW KTPDDGADST VVQWFWEYMC EINTQERKRL LMFVTGSDRV PATGIQNLSF KISSQGKDSN RLPVAHTCFN ELGLYNYSSK EKLVDKLVTA VNESAGFGLK
|
| |