Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83187 |
Symbol | HUL5 |
ID | 4838637 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 194535 |
End bp | 197630 |
Gene Length | 3096 bp |
Protein Length | 977 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389952 |
Product | ubiquitin-protein ligase (E3) |
Protein accession | XP_001384337 |
Protein GI | 150865213 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5021] Ubiquitin-protein ligase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.346459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAACT TTACGGGTCA GACGCACAAA CGTGTGGTTA ATTTGGGCAA CCGTGGGCGT AATGGCAGTG GTTCTGGATC CAAGAACTAC TTGGAACAAA CTAGAATACA GAGGCTACAA CGAGAAGAAC AGCGACAAAA GGAGAAGTCC GGATTGCTAT TGCAATCGTA TATTCGTAGA CATCTAGACC TTTCTGAGAG TGGGGAGAAA TTGAAACTGG AATGGCTTCA AAACCGTGGC AATTTTGTAG ATGAAGAGCA ATGGAATTTG TGGATTCTTC AGTTCAACTT TCTTGCCAAA TGGACTTTGC CTCAGCAGCC GATTGCAGAA TTGTCTTTCC AATTGAAAGT TCTATTTGAG GATTTACAAC TTAGTAAATT TGACTTATCT CAAAGACAAT TTAGTCAGAT GGCCAGGGCT CTCACTAGCA CTGTTGAGAG AATAGATAGA CGACGAGATC TTGACCAAAG CCAGAAGCTT CTGGTTGACG AACAGATAAT CCAGATTATC TGCTTCATTT TAGACAAGTT CGGCTCGAAA TACGTAGTGG AGTTGCCTGA TATTATCAAC TATTTGTCGT CATTTGTATT AAATACAAAT GTTTCACAGA ATTTGCAGGC ACAGATTATA GAATTGGTAT TTAAGTTTAG CACTAATAGC TTGTCGATTT GGAAATTGTT AACTTCGCCA GGTTTGTTTA AAGATATTGG AGACACCAAA AGATATTTGG AGAGAATCAG GACAAGTGAT TTTGCTAGTG AAGCTTTTGG AGCTGCCAAT AACTTCGATG GGAATTCACT AGTAAGTGTT TTGGTGAATT TCCTAACAAT TCATACGGAG AATGAGCCTT TTATCAACGA GGATTTGTTC TTTATAGCCA GCGTTTTTGA GCGTATTTCG TTTTCGATAC ATAGGATAGA TGAAGACGTA GATGAAGACG ACTTCGAAGA TTTTCAATCC AGATCAACAG AAGACGTTCT TGTGGTGTCT CAGAATTGCA TTGATACAAT AAGTCTTTTG TACTCTACCA CTTTCATCAG ACAAATCTTT GATCATTTCA CAGCGCGTGA AGATGACAAC AAGCTCTCTT GGGCAGCATT ACAGATCATT TCCACGTTGA TATTCTTCTA CCCTGCAGCC AAAAGTAAAC TTTGTATGCT TATCACTATC ATTCCCAATT CATATAGACG CTTTTTTGCC CAGTTGGAAT CTCACGAAAT CTACCAAAAA CTCATTTCTC AAACAGTAGA AACTGGCAAA GATTTCCTTA TAAAACCACA GATCGAAGAG TTATACACAA ATATGGATGC TCACTCGATT GGATTTTTCT GGAAAACGTT ATATACTTTC CAGGAACTTT ACTCCTACTG GTTGATAGTT TCCAACGACT TGGAGAGTTT CCAAGAGGAC AAACTTAATC TCAAGGATGT ATCTCAGTTT CTCATTTTCC TGAGATCGTT GTGTTTGACA TTGATTTTCT GTAACGACAA ACCGGATTAT TTCCATCAGT ACGAAAAGTT GAAGGATATT TCCATTTCAT TGTTAAACCA GTTGTATATG AAGAATTTGC GCTTGCGATT TTTGCCTGAT GATTTCTGGA AATTGAAGGT GCTCAAGTTC AACATCGACA GTTTGTTACA GATAATAGCT GAGGAAGAAG AGAAACGCAT TGAAGAAATG GACATTGACG AGGATGACTC CTTTCAGTAT CTGCGAAGGA GAAGACTGTC TTTTGACGGA ACGTTCAACT CAATCAAGAG ATCTAAAAAC ATGTCGAGTG CTTCTGCTGA AACTTTGGCA AAGTTGGAGA TGTTGAAAAA AGTACCCTTC TTCATTGAAT TCAAAGATCG TGTTAGGGTA TTCCAATCTT TGATTGAATT GGATAGACAG CGTAATCTCT CTGTTGGTCC ATTCTTCGAT TCAAAGCTTG GAGCGAACAT TAGAAGAGAG TTTTTGCTTG AAGATGCTTA TAACAGCTTC CATAAGGCTG GTTCCAATTT CAAGAATCGT ATTCAGGTCA GTTTCTTCAA TGAATATGGT CCAGAAGCTG GAATTGACGG TGGTGGCATA ACTAAAGAAT TCCTCACCAG TGTAGTCAGA GAAGGATTTG ATCCCTCTAA TGAGCTTGAA TTGTTTAAGG AAACAATCAG TGACAATCAG ATATATCCGA ACGACGACAT CCACAAGTCA ATCACTGTAG GTGATGATCC ACAGCTTCAG CAGAAGAAGC TTCTGTATTT GAAGTTTCTT GGAAGCATTG TAGGCAAATG TCTCTATGAG AACGTCTTGA TTGATGTCTC ATTTGCACCT TTTTTTTTAA ACAAGTGGTG CAATGACAAT ATGAAAAACT CCATCAACGA CTTGAACTAC TTGGACCATG AGTTGTTCAT GGGTTTGATG AAGTTAGTGA AGATGCCGGA ACAGGAGTTG GATAGCTTGG ACTTAAATTT CACTGTTAAT GAAACTTTGA AAGGAAAGAA TTATGTATTT GATTTGCTTC CTCCAAATGG AGAGAATACC AAATTGAATC TGTCGAACAA ATTGAGTTAC ATACATCAAA TTTCCAACTT CAAATTGAAC CAGTCGTTGC ACATCCAGAC GAAGTACTTT ATTGAAGGTC TTTTTGGACT TATCTCGTCT AGCTGGTTGA GCATGTTTGA TTCGTTTGAG CTCCAGATGC TTATATCTGG TGGCCAAAAC GATATCAACA TTTTGGACTG GAAGAATAAT GTAGAATATG GAGGCTATTT GGACAGCGAC ATCACTGTTC GTTACTTCTG GGAAGTGGTA GCGGAAATGA CTCCCGATGA GAGGTTTGCC CTTATCAAAT TTGTCACTTC CGTTAGTAGA GCACCACTCC TTGGATTTGG ATCTTTGAAT CCAAAGTTTG GAATTCGAAA TTCAGGAAGC GACACTTCCA GATTACCTAC TGCTTCAACT TGTGTGAACT TGTTGAAGCT TCCAGACTAT AGGAACAAGG AACTTATAAG GTCCAAGTTG TTGTATGCCA TAGAAGCCGA AGCTGGCTTT GATTTGAGTT AGTCCATAAT CTTACAGTTG GTATGTAGTG AAAACTTAAT GTACAATTGA TATGAT
|
Protein sequence | MLNFTGQTHK RVVNLGNRGR NGSGSGSKNY LEQTRIQRLQ REEQRQKEKS GLLLQSYIRR HLDLSESGEK LKSEWLQNRG NFVDEEQWNL WILQFNFLAK WTLPQQPIAE LSFQLKVLFE DLQLSKFDLS QRQFNLDQSQ KLSVDEQIIQ IICFILDKFG SKYVVELPDI INYLSSFVLN TNVSQNLQAQ IIELVFKFST NSLSIWKLLT SPGLFKDIGD TKRYLERIRT SDFASEAFGA ANNFDGNSLV SVLVNFLTIH TENEPFINED LFFIASVFER ISFSIHRIDE DVDEDDFEDF QSRSTEDVLV VSQNCIDTIS LLYSTTFIRQ IFDHFTARED DNKLSWAALQ IISTLIFFYP AAKSKLCMLI TIIPNSYRRF FAQLESHEIY QKLISQTVET GKDFLIKPQI EELYTNMDAH SIGFFWKTLY TFQELYSYWL IVSNDLESFQ EDKLNLKDVS QFLIFSRSLC LTLIFCNDKP DYFHQYEKLK DISISLLNQL YMKNLRLRFL PDDFWKLKVL KFNIDSLLQI IAEEEEKRIE EMDIDEDDSF QSKNMSSASA ETLAKLEMLK KVPFFIEFKD RVRVFQSLIE LDRQRNLSVG PFFDSKLGAN IRREFLLEDA YNSFHKAGSN FKNRIQVSFF NEYGPEAGID GGGITKEFLT SVVREGFDPS NELELFKETI SDNQIYPNDD IHKSITVGDD PQLQQKKLSY LKFLGSIVGK CLYENVLIDV SFAPFFLNKW CNDNMKNSIN DLNYLDHELF MGLMKLVKMP EQELDSLDLN FTVNETLKGK NYVFDLLPPN GENTKLNSSN KLSYIHQISN FKLNQSLHIQ TKYFIEGLFG LISSSWLSMF DSFELQMLIS GGQNDINILD WKNNVEYGGY LDSDITVRYF WEVVAEMTPD ERFALIKFVT SVSRAPLLGF GSLNPKFGIR NSGSDTSRLP TASTCVNLLK LPDYRNKELI RSKLLYAIEA EAGFDLS
|
| |