Gene PICST_84338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84338 
SymbolDEG1 
ID4840127 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp321058 
End bp322639 
Gene Length1582 bp 
Protein Length465 aa 
Translation table12 
GC content41% 
IMG OID640391442 
Productpseudouridine synthase 
Protein accessionXP_001385404 
Protein GI150865972 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0101] Pseudouridylate synthase 
TIGRFAM ID[TIGR00071] pseudouridylate synthase I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.188482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.052244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTTTACGAT GAGCATCAAT ACCAATTTCA ACTTTTCACT TCCAGCGAGC CTATTACTGG 
GATCCAGCAG TAGTATCAAA TACCAGGTGT ATGCTAAGGT ATAGACCGAC GTGGGCGATG
CTCAAAAGGA CATTTAGCAG CATGACCAAA ACAGAAGCAA GGGTCGATTA TGAAAACTGG
ACTAAAGAAC AGCTTATAGA AAGAATCCAA CAGTTGGAAA ATTCATCCTC GAAATTAAAC
TCAACACTTC CAGTTGCTAG TCCAGTTGTA AACGATGCAG TTGCTTCCGC AAAAGACATA
AGTAGATCAG AGTCTGCTCC TCCAATCATG GACATGGCCA CAGAAGGGTC AAAGAAAAAG
AAAAAGGTTC GAACTTTCGA TATGAACAAA TACAACAAGA GATTTATAGC ATTGAAATTT
GCCTATTTAG GTTGGAACTA CAACGGATTG GCCTACCAGC TGGAGCCAAC ACCATTGCCT
ACTGTTGAAG AAGTCGTTTT GAAAGCTTTG ACGATGTCAA GACTTATTAC TGAACCTACT
CCAGACAAGT GCAAGTTCAG TCGTTGTGGC CGTACTGATA AAGGTGTCAG TGCCATGAAC
CAAGTCATCT CGTTAGTCGT GAGATCCAAC TTAAATGAAG AAGAGCAATT GCTCAAAGAA
AACGACCACA AAGAAATCAA GTACTTGTCC ATCATTAATG CTTTGTTACC TCCAGATATA
AGAATGACAG CTGTCTGTTT AAGACCTCCT CCTAAATTCG ATGCTAGATT CAGTTGCGAC
TATAGACACT ACAGATACTT GTTCAAGAAA CACGATCTTG ATATTGAGCT TATGAATGAG
GCTTGCATAA AATACATCGG ATCCCATGAC TTCCGTAACT TTTGTAAGAT TGACGGATCC
AAACAGATCA CAAACTACGT CAGAGAAGTC TACAGCATGA AAATCATCCA CCTAAAAGAT
GATTTTTATG CTGTTGACTT GAAGGGTTCT GCCTTCCTTT GGCATCAGGT CCGTTGTATG
GTGGCTATAT TGTTCTTGAT TGGTCAGAAG CTTGAAGCTA CCACCATAAT CGAAGACTTG
TTTGACTTGG AAAAATATCC TACTAAGCCA GTCTACGAAA TGGCCAATGA TATTCCCTTG
GTTCTCTACG ATTGTATATA TCCGGAAATG GAATGGCTCT CGCCAATCGG GTCTGAAGGT
ACCATCGAGA AGTTCTACAA ACACTTCGCC ATGTTCAGGG GCCAAGTATT GGACTACCAA
GTTAAGGCTA ACATGATTGG AATTATGGAA CCATTGGTAA TGAAAGATGC TCCTGAAATT
GAGAACACCC AGAAACGTGG AACCATGAAT GTTGGTGATG GTAGTGGTCG TAACTATTCC
AAGTACGTTC CAATCAGCAA GCGTGAAGTA GGCGAAACCG TTGAGGCAAT CAACTCAAGA
CACAAGGAAA AGAAGAGAAA AAGAGCCATT GCACTCAGTG AAGCTAACAG CAGGGCAGAC
AGCGAAGTCA GTAGCATTTT AGAAGAACAA TAGTCCTGTA AATATTGTAT CATAGACTTT
AGAATAGATG GAATCTAACT GG
 
Protein sequence
MLRYRPTWAM LKRTFSSMTK TEARVDYENW TKEQLIERIQ QLENSSSKLN STLPVASPVV 
NDAVASAKDI RSKKKKKVRT FDMNKYNKRF IALKFAYLGW NYNGLAYQSE PTPLPTVEEV
VLKALTMSRL ITEPTPDKCK FSRCGRTDKG VSAMNQVISL VVRSNLNEEE QLLKENDHKE
IKYLSIINAL LPPDIRMTAV CLRPPPKFDA RFSCDYRHYR YLFKKHDLDI ELMNEACIKY
IGSHDFRNFC KIDGSKQITN YVREVYSMKI IHLKDDFYAV DLKGSAFLWH QVRCMVAILF
LIGQKLEATT IIEDLFDLEK YPTKPVYEMA NDIPLVLYDC IYPEMEWLSP IGSEGTIEKF
YKHFAMFRGQ VLDYQVKANM IGIMEPLVMK DAPEIENTQK RGTMNVGDGS GRNYSKYVPI
SKREVGETVE AINSRHKEKK RKRAIALSEA NSRADSEVSS ILEEQ