Gene PICST_74933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_74933 
SymbolYBU4 
ID4851297 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1435625 
End bp1437968 
Gene Length2344 bp 
Protein Length696 aa 
Translation table 
GC content44% 
IMG OID640393005 
ProductPredicted tubulin-tyrosine ligase 
Protein accessionXP_001387504 
Protein GI126274283 
COG category[R] General function prediction only 
COG ID[COG0496] Predicted acid phosphatase 
TIGRFAM ID[TIGR00087] 5'/3'-nucleotidase SurE 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATTAACTACA TTGAACACAT TCAGTTACCA CATGAGTCCC AGAAGACTCT CCATATCGTT 
ATTGAATCTA TACTAAACTC AGATATCTTT TTTTCAATTT TCCAGTTTCA GATTCCACAC
CAATGCACGT TCTCTTAACC AACGACGATG GTCCTTTGGA CGACAACTCA TGTCCATACA
TGAAGTACTT TGTAGACGAG ATCCTCACCA CGACTGACTG GGACCTCTCG ATTGTTGTAC
CGAACGAACA GCGGTCATGG ATCGGAAAGG CTCATTTCGC TGGAAAGACT CTAACAACTA
CATACATCTA CACCAGGCTT CTGACCAGTG CTCCCAACGC CAATATCAAT AGCTTTGAAG
GTCCCTTCAA AACTTCTCAA CCACAGTTTC CACAGCCGGA ATGGCAGGAA TGGGTTTTGG
TCAACTCGAC TCCGGCAGCC TGTGCCGATA TCGGAATCCA TCATGTCTAC AGCAAGAAGA
AGGGCCCCAT AGATCTTGTT CTCAGTGGTC CCAACTTCGG CAAGAACTCG AGTAACTTGT
ATATTTTGGC CAGTGGAACT GTTGGTGCAG CCATGGAGGC TGTGACCCAC GGAGTTAAGG
CTATTGCCTT GAGCTATGCC TTCAACAACC TCGACCACGA CTTCCATATC TTGAAAGAAG
CAGCTAAGAT CTCGGTCAAG TTGATCAAGA AGTTATACGT TCAATTGCAG ACCATGGAAA
ATGTGGATAT CTTTTCTGTT AACGTTCCAT TGATCGAATC GCTAAAGTTG GGATCAACTA
AAATCCACTA TGCTCCCATC TTGAACAACT ACTGGAACTC CATCTACGCT CCACTGGACG
AGCTAAACGA ACATGGACAA CAACAGTATA TGTGGAATCC AGACTTCAAG AAAGTGTACA
AGGACGGTTT GGCTGATCTT ACTCATACTG ATAGCAGAGT TCTTTTGGAG GAAGGAATTA
GCGTCACACC ACTTAAGGCT TCCTTCAACA TCGTGGAGCC ATTTTCTGGT GAAATTACGT
TGGACGATGA TGAAAGCGTA GGAAACTTAG GCAGAAAACT CGTTGCAACT GAAATCGACC
TGAAGGCTAG TAAAGGAAAC AATATAGTAA GTGGAGCTGA ACAGGCGAGG CATCGCTTCT
TGATTACTAT CCCCCAAGAA GCGTATGTAT ACAAGCCATT GGTGGACGCT TTTGAGCAGT
TGCCTGATTT CAGTATTACG ACCGATATCT CACTTTTGAA GAATATTCCT CAAGACGTGA
AGGTATTCCA CTACGGTGAA TACGAAGATA TTGACATTGA CCTCATAGGC GAAAAACCGC
TGCAGTACTA CATACCCTCA TACATCTATA GAAAGGCATT GATACGTAAG CATTTCCTTG
CCAATACCAT CCAGCACTAT GTGGCCAAAC ATCCGGAGTC CGTTCTCATT CAGAATGTCC
CACAAAGCTA CCAGTTGGAA GTAGATTATG CCGAGTTTCT CGATGATGCC TTGGACGACG
CATATGAGTT GCGAGACGAA ATTGAAGCCG GTGGAAGGAC CTGGATTTTG AAGCCCAGCA
TGAGTGACAA GGGCCAGGGC ATAAGATTGT TTAAAACCAT AGACCGGTTG CAAGAAATTT
TTAACTCATT TGAAGAAGGC GATAGCGAAG ATGAAGACGA AGTTAATGAA ACTGAAAACG
GGGTCATCAT CTCACAATTG CGTCATTTCA TCGTCCAGGA ATACAAGTCC CGCCCCTTAC
TTTTGCAAAA TTATGACAAC AAGAAGTTCC ATTTGAGAAC CTACGTCGTT TGCAAAGGTA
ATCTACAAGT GTTTGTGTAC AAGAACATAT TAACTCTTTT CGCCGCAACA GAATACCACG
ACCCCAACGA TGACAATGAC GAAGAACAGG TATCTATGGA TGGACATCTC ACCAATACGT
GTTTACAAGA GACTGGCAAT CCCTTAGTGG TTCCATTCTG GAAGCTAGAG GACACGAAGT
TCAGTGAAGA GCAGAAGAAG AAAGTCTTTG ACCAAGTCCT TGAAACCACA AAAGAATTGT
ATACGGCTGC CACGAGTGTT GACAAAATGA ACTTCCAGCC TATGGATAAT GCCATAGAAA
TATTTGGCAT AGACTTCTTG GTCAACGAAG ACTACACTGT GACCCTTCTT GAGGTCAACT
CATACCCTGA TTTCAAGCAG ACCGGAGATG ACTTGAAGGG CCTTATTTAC GAATTGTTTG
ACAGAGTAGT AAAGGAAGTC GTGAGCCCTC TAGTTACTGG AACACAGTCA GAAACCACGG
AAAGTACATT GGTTTCAGTT TTGTCCCAAT AGTTTATAAT AGTTATCAAT GGAATAAAGC
ATGC
 
Protein sequence
MHVLLTNDDG PLDDNSCPYM KYFVDEILTT TDWDLSIVVP NEQRSWIGKA HFAGKTLTTT 
YIYTRLLTSA PNANINSFEG PFKTSQPQFP QPEWQEWVLV NSTPAACADI GIHHVYSKKK
GPIDLVLSGP NFGKNSSNLY ILASGTVGAA MEAVTHGVKA IALSYAFNNL DHDFHILKEA
AKISVKLIKK LYVQLQTMEN VDIFSVNVPL IESLKLGSTK IHYAPILNNY WNSIYAPLDE
LNEHGQQQYM WNPDFKKVYK DGLADLTHTD SRVLLEEGIS VTPLKASFNI VEPFSGEITL
DDDESAENSH RFLITIPQEA YVYKPLLPDF SITTDISLLK NIPQDVKVFH YGEYEDIDID
LIGEKPLQYY IPSYIYRKAL IRKHFLANTI QHYVAKHPES VLIQNVPQSY QLEVDYAEFL
DDALDDAYEL RDEIEAGGRT WILKPSMSDK GQGIRLFKTI DRLQEIFNSF EEGDSEDEDE
VNETENGVII SQLRHFIVQE YKSRPLLLQN YDNKKFHLRT YVVCKGNLQV FVYKNILTLF
AATEYHDPND DNDEEQVSMD GHLTNTCLQE TGNPLVVPFW KLEDTKFSEE QKKKVFDQVL
ETTKELYTAA TSVDKMNFQP MDNAIEIFGI DFLVNEDYTV TLLEVNSYPD FKQTGDDLKG
LIYELFDRVV KEVVSPLVTG TQSETTESTL VSVLSQ