Gene PICST_62786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_62786 
SymbolPRP40 
ID4840034 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp940708 
End bp942162 
Gene Length1455 bp 
Protein Length478 aa 
Translation table12 
GC content38% 
IMG OID640391349 
Productpre-mRNA processing protein 
Protein accessionXP_001385539 
Protein GI150866066 
COG category[A] RNA processing and modification 
COG ID[COG5104] Splicing factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.344613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.715644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAGT GGGAGAAAGT GACTGACAAC GAAGGTAGAG TATATTACTA CAATTCCAAA 
ACGAAGGAAA CAAGTTGGAC CCTCCCCCAA TCAGAATCTT CAGTTTCCAG TGGTTCCAAA
TGGCAGGAAT ATGCTACCGA TGATGGAAGA AAATATTACT ATAACGAGTC CACAGGCGAG
ACTACGTGGG AGATGCCGCA GGAAATGGAG AAAGCTGAAG ATAAAAGAAA CGTCGATGAT
GTAAAAGAAA AGGACGAACA AGTTGCTTCG AAATCAACTG AAGAGAGCCA ATTAGATCTT
CGACTAGCTT CAGAACCAAT CAAAAAGTCA GACTTGGTAA ACCCACCAAA GGATGATTCA
TATCCCGAAT CAGAGGCATT TGTGGAGATG CTTCGTTCTA ATAAGGTTGA TTCGACTTGG
TCTTTTCAAG CAGTAATGTC GAAGTTTATT GATGACCCCA AGTATTGGGC CATTCCTGAT
GCATTGGAGC GGAAGAAATT ATACGACGAA TATCTTGTGA CGAGATTCAA AGAAGATTTA
TCCAACAAGA GTTTATTGGT GGAGACATTC AAAAAGAACT TTGTCGAAAC TCTAAAGAAA
TACGAAGAAA ATGGTAGACT TCTGCGGAAT AGTAGATGGA TCTCAGTAAA AAAGTTACTT
ATCGCTGAAG ACAATCCAAT CTTCAAGCAT TCCATTTTGT CAGATGCTGA GATAGCGGAA
ATATATTATG AATATATCTC CAGACTTAAG AAGCAATATG AAGAAGAATT GTCGAAAAAC
AAGGATCGTG CATTATCTGA ACTTGAATCA TACCTTACCC AAATTAATCC CAACATAGTA
TCTAGCACAA GTAATTGGCA GGAATTACTT GAAAACCTCA AGGCAGATGC CAGGTTCAGG
GCTAACAAGC ATTTCAATGT ACTCAGTGAC GTAGATTTAC TTGAAATGTA TGAGACAAAG
ATATACCCGA CTATCATACA AAAAATTAAG AGCGAGATTG ATGACGTTCA GAAAAAGAAT
TACCGATCAG ACAGGAAGGC AAGACAAAAG TACAAGGCAT TATTGAAGAC ACTCGATATC
AATGCAAATT CTAACTTCAA AGACTTTCTC TACATTCTTG AGAATGATGA TTCATTTATA
GAGCTTTGTG GAAGAAATGG GTCTACAGCA CTCGAGCTCT TTTGGGACAT CGTCGATGAG
AAATCGCAAG TCTTGAAATT GAAAATGTAC TTAGTGGAAT CTGTTTTGCT CGATTTGAAG
AAGGAAGACT CTACTTTAAC AAAGTCCAAG ATACTACTGT CAGAGAATAA TTTCATAGAA
TTTTTGTCCA ATTCTAGTGA CCAGAGAATC GAGAATCTAG ACATTGACCT TAATGATGCT
AATGAAACAG AGGTATTGTA TGGGGCATTG AAAAGAGAGT TTGAAGCTCA ACAAGAAAAG
AGACGCGTTC GCTTC
 
Protein sequence
MSEWEKVTDN EGRVYYYNSK TKETSWTLPQ SESSVSSGSK WQEYATDDGR KYYYNESTGE 
TTWEMPQEME KAEDKRNDEQ VASKSTEESQ LDLRLASEPI KKSDLVNPPK DDSYPESEAF
VEMLRSNKVD STWSFQAVMS KFIDDPKYWA IPDALERKKL YDEYLVTRFK EDLSNKSLLV
ETFKKNFVET LKKYEENGRL SRNSRWISVK KLLIAEDNPI FKHSILSDAE IAEIYYEYIS
RLKKQYEEEL SKNKDRALSE LESYLTQINP NIVSSTSNWQ ELLENLKADA RFRANKHFNV
LSDVDLLEMY ETKIYPTIIQ KIKSEIDDVQ KKNYRSDRKA RQKYKALLKT LDINANSNFK
DFLYILENDD SFIELCGRNG STALELFWDI VDEKSQVLKL KMYLVESVLL DLKKEDSTLT
KSKILSSENN FIEFLSNSSD QRIENLDIDL NDANETEVLY GALKREFEAQ QEKRRVRF