Gene PICST_83187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83187 
SymbolHUL5 
ID4838637 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp194535 
End bp197630 
Gene Length3096 bp 
Protein Length977 aa 
Translation table12 
GC content39% 
IMG OID640389952 
Productubiquitin-protein ligase (E3) 
Protein accessionXP_001384337 
Protein GI150865213 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5021] Ubiquitin-protein ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.346459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAACT TTACGGGTCA GACGCACAAA CGTGTGGTTA ATTTGGGCAA CCGTGGGCGT 
AATGGCAGTG GTTCTGGATC CAAGAACTAC TTGGAACAAA CTAGAATACA GAGGCTACAA
CGAGAAGAAC AGCGACAAAA GGAGAAGTCC GGATTGCTAT TGCAATCGTA TATTCGTAGA
CATCTAGACC TTTCTGAGAG TGGGGAGAAA TTGAAACTGG AATGGCTTCA AAACCGTGGC
AATTTTGTAG ATGAAGAGCA ATGGAATTTG TGGATTCTTC AGTTCAACTT TCTTGCCAAA
TGGACTTTGC CTCAGCAGCC GATTGCAGAA TTGTCTTTCC AATTGAAAGT TCTATTTGAG
GATTTACAAC TTAGTAAATT TGACTTATCT CAAAGACAAT TTAGTCAGAT GGCCAGGGCT
CTCACTAGCA CTGTTGAGAG AATAGATAGA CGACGAGATC TTGACCAAAG CCAGAAGCTT
CTGGTTGACG AACAGATAAT CCAGATTATC TGCTTCATTT TAGACAAGTT CGGCTCGAAA
TACGTAGTGG AGTTGCCTGA TATTATCAAC TATTTGTCGT CATTTGTATT AAATACAAAT
GTTTCACAGA ATTTGCAGGC ACAGATTATA GAATTGGTAT TTAAGTTTAG CACTAATAGC
TTGTCGATTT GGAAATTGTT AACTTCGCCA GGTTTGTTTA AAGATATTGG AGACACCAAA
AGATATTTGG AGAGAATCAG GACAAGTGAT TTTGCTAGTG AAGCTTTTGG AGCTGCCAAT
AACTTCGATG GGAATTCACT AGTAAGTGTT TTGGTGAATT TCCTAACAAT TCATACGGAG
AATGAGCCTT TTATCAACGA GGATTTGTTC TTTATAGCCA GCGTTTTTGA GCGTATTTCG
TTTTCGATAC ATAGGATAGA TGAAGACGTA GATGAAGACG ACTTCGAAGA TTTTCAATCC
AGATCAACAG AAGACGTTCT TGTGGTGTCT CAGAATTGCA TTGATACAAT AAGTCTTTTG
TACTCTACCA CTTTCATCAG ACAAATCTTT GATCATTTCA CAGCGCGTGA AGATGACAAC
AAGCTCTCTT GGGCAGCATT ACAGATCATT TCCACGTTGA TATTCTTCTA CCCTGCAGCC
AAAAGTAAAC TTTGTATGCT TATCACTATC ATTCCCAATT CATATAGACG CTTTTTTGCC
CAGTTGGAAT CTCACGAAAT CTACCAAAAA CTCATTTCTC AAACAGTAGA AACTGGCAAA
GATTTCCTTA TAAAACCACA GATCGAAGAG TTATACACAA ATATGGATGC TCACTCGATT
GGATTTTTCT GGAAAACGTT ATATACTTTC CAGGAACTTT ACTCCTACTG GTTGATAGTT
TCCAACGACT TGGAGAGTTT CCAAGAGGAC AAACTTAATC TCAAGGATGT ATCTCAGTTT
CTCATTTTCC TGAGATCGTT GTGTTTGACA TTGATTTTCT GTAACGACAA ACCGGATTAT
TTCCATCAGT ACGAAAAGTT GAAGGATATT TCCATTTCAT TGTTAAACCA GTTGTATATG
AAGAATTTGC GCTTGCGATT TTTGCCTGAT GATTTCTGGA AATTGAAGGT GCTCAAGTTC
AACATCGACA GTTTGTTACA GATAATAGCT GAGGAAGAAG AGAAACGCAT TGAAGAAATG
GACATTGACG AGGATGACTC CTTTCAGTAT CTGCGAAGGA GAAGACTGTC TTTTGACGGA
ACGTTCAACT CAATCAAGAG ATCTAAAAAC ATGTCGAGTG CTTCTGCTGA AACTTTGGCA
AAGTTGGAGA TGTTGAAAAA AGTACCCTTC TTCATTGAAT TCAAAGATCG TGTTAGGGTA
TTCCAATCTT TGATTGAATT GGATAGACAG CGTAATCTCT CTGTTGGTCC ATTCTTCGAT
TCAAAGCTTG GAGCGAACAT TAGAAGAGAG TTTTTGCTTG AAGATGCTTA TAACAGCTTC
CATAAGGCTG GTTCCAATTT CAAGAATCGT ATTCAGGTCA GTTTCTTCAA TGAATATGGT
CCAGAAGCTG GAATTGACGG TGGTGGCATA ACTAAAGAAT TCCTCACCAG TGTAGTCAGA
GAAGGATTTG ATCCCTCTAA TGAGCTTGAA TTGTTTAAGG AAACAATCAG TGACAATCAG
ATATATCCGA ACGACGACAT CCACAAGTCA ATCACTGTAG GTGATGATCC ACAGCTTCAG
CAGAAGAAGC TTCTGTATTT GAAGTTTCTT GGAAGCATTG TAGGCAAATG TCTCTATGAG
AACGTCTTGA TTGATGTCTC ATTTGCACCT TTTTTTTTAA ACAAGTGGTG CAATGACAAT
ATGAAAAACT CCATCAACGA CTTGAACTAC TTGGACCATG AGTTGTTCAT GGGTTTGATG
AAGTTAGTGA AGATGCCGGA ACAGGAGTTG GATAGCTTGG ACTTAAATTT CACTGTTAAT
GAAACTTTGA AAGGAAAGAA TTATGTATTT GATTTGCTTC CTCCAAATGG AGAGAATACC
AAATTGAATC TGTCGAACAA ATTGAGTTAC ATACATCAAA TTTCCAACTT CAAATTGAAC
CAGTCGTTGC ACATCCAGAC GAAGTACTTT ATTGAAGGTC TTTTTGGACT TATCTCGTCT
AGCTGGTTGA GCATGTTTGA TTCGTTTGAG CTCCAGATGC TTATATCTGG TGGCCAAAAC
GATATCAACA TTTTGGACTG GAAGAATAAT GTAGAATATG GAGGCTATTT GGACAGCGAC
ATCACTGTTC GTTACTTCTG GGAAGTGGTA GCGGAAATGA CTCCCGATGA GAGGTTTGCC
CTTATCAAAT TTGTCACTTC CGTTAGTAGA GCACCACTCC TTGGATTTGG ATCTTTGAAT
CCAAAGTTTG GAATTCGAAA TTCAGGAAGC GACACTTCCA GATTACCTAC TGCTTCAACT
TGTGTGAACT TGTTGAAGCT TCCAGACTAT AGGAACAAGG AACTTATAAG GTCCAAGTTG
TTGTATGCCA TAGAAGCCGA AGCTGGCTTT GATTTGAGTT AGTCCATAAT CTTACAGTTG
GTATGTAGTG AAAACTTAAT GTACAATTGA TATGAT
 
Protein sequence
MLNFTGQTHK RVVNLGNRGR NGSGSGSKNY LEQTRIQRLQ REEQRQKEKS GLLLQSYIRR 
HLDLSESGEK LKSEWLQNRG NFVDEEQWNL WILQFNFLAK WTLPQQPIAE LSFQLKVLFE
DLQLSKFDLS QRQFNLDQSQ KLSVDEQIIQ IICFILDKFG SKYVVELPDI INYLSSFVLN
TNVSQNLQAQ IIELVFKFST NSLSIWKLLT SPGLFKDIGD TKRYLERIRT SDFASEAFGA
ANNFDGNSLV SVLVNFLTIH TENEPFINED LFFIASVFER ISFSIHRIDE DVDEDDFEDF
QSRSTEDVLV VSQNCIDTIS LLYSTTFIRQ IFDHFTARED DNKLSWAALQ IISTLIFFYP
AAKSKLCMLI TIIPNSYRRF FAQLESHEIY QKLISQTVET GKDFLIKPQI EELYTNMDAH
SIGFFWKTLY TFQELYSYWL IVSNDLESFQ EDKLNLKDVS QFLIFSRSLC LTLIFCNDKP
DYFHQYEKLK DISISLLNQL YMKNLRLRFL PDDFWKLKVL KFNIDSLLQI IAEEEEKRIE
EMDIDEDDSF QSKNMSSASA ETLAKLEMLK KVPFFIEFKD RVRVFQSLIE LDRQRNLSVG
PFFDSKLGAN IRREFLLEDA YNSFHKAGSN FKNRIQVSFF NEYGPEAGID GGGITKEFLT
SVVREGFDPS NELELFKETI SDNQIYPNDD IHKSITVGDD PQLQQKKLSY LKFLGSIVGK
CLYENVLIDV SFAPFFLNKW CNDNMKNSIN DLNYLDHELF MGLMKLVKMP EQELDSLDLN
FTVNETLKGK NYVFDLLPPN GENTKLNSSN KLSYIHQISN FKLNQSLHIQ TKYFIEGLFG
LISSSWLSMF DSFELQMLIS GGQNDINILD WKNNVEYGGY LDSDITVRYF WEVVAEMTPD
ERFALIKFVT SVSRAPLLGF GSLNPKFGIR NSGSDTSRLP TASTCVNLLK LPDYRNKELI
RSKLLYAIEA EAGFDLS