Gene PICST_82941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82941 
SymbolPRP39 
ID4838138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp822386 
End bp824525 
Gene Length2140 bp 
Protein Length707 aa 
Translation table12 
GC content36% 
IMG OID640389453 
Productpre-mRNA splicing factor 
Protein accessionXP_001383792 
Protein GI150864815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.630314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAAATTATAG TCCAAGATGT CCCATATAAA TTTTAGGTTC CACCAAGATT TGAATTTAGA 
CGCAATTTCC TTTGGCGGAG CTTCCAAATC TGATGCAGAA GTCAAAGAAG TAGACAACTT
AGACAAGCTC CATAATGATA TTCGAAGGAG CCCCAATGAT CTCACCAAAT GGGATAAGCT
CTTTCAGCTG TTTGAGAGGA CATTCACAGT GAAATTTGAA GGCAAGCCTG ACAAAGTTTC
GACCCAATTC AAGCTCTTAG TGACAAAAAC ATATGCTTCA TTACTTTCTA GATTTCCATA
TTTGGCTTCT TACTGGAAAT CATGGCTGAT TTTTGCATTC AAACTCAGTG GTACCAAAGA
GTCAATCGAG GTACTAGAGA AGTCTGTAAT TGGATTTCCT TATTCGGTTG AATTATGGAC
AGACTATATC AGTGCTTTGA TCTTAACATA TGGGAACGAT CCAGAGAAGT TGAGTTTTAT
CAGAGCCCAA TATAGTGAGG CACTTCGTTT AAATGGATTA AATTTTCTAT CTCATCCATT
ATGGGATAAA GTGATTGAAT TTGAAACTGG AATTGGCGAA AAATCAGTTA TAGTAGGATT
ATATCTCAGA GTCACCAAGA TTCCACTTTA TCAGTATGCT CAATACTACA ATAGCTTTAC
ACAAATCAAT AAGAATTACG ATATTACGGA TGTGATACCT TCTCTTGAGT TGGCTGAATA
TGTGAAAAGA TTCAACAAGA CAGATGTTAC TGAATTGACT CTTGGTGAAA AGAATCAAGT
TGTAGATGAT CACACTGACA TTATCTTTAC ATCGACTCAA GAACAGGTGA CTGATAAATG
GTCTCATGAG TCGTCTATAT TCATTCATGA TTTCTCTCTT GACAGGCTCG ACGAAATTGC
GAAGGAAAAG GAAATATGGA TCAAGTACTT GGATCACGAG ATTTCCAAGT ATAAAGTAAG
CCTGGCCATA GATCAATTTG ATAATGTAGC AAATATTTTT GAGAGAGCAC TAGTACCCAA
CTATTACAAT GAAGGAATAT GGCTCAAGTA TCTTGCTTTT ATCAACATCC TGGAATTGGA
AGATGAAGTC AAATACGAGA AAGCTAAAGC TATTTACCTA AGAGCAATTT CAGGTTTACC
AGTAGAAAGT ACTGTTCTTA GATCTTTGTA CCCGAAGTTT CTTATGAAGT ACAAGCATTT
AGACATTGCC AAGAGCTACT TATACGATTA TTTGAAGTTG TTTGGAGGTC GTGGAAATAG
ATACTTCAAA CTGCAGTATC TACAAACAGT CCAAGACACA GTAGAGGTTT GGGAGAAATC
AGAGAATAGC AAAGATTACC TTGCAAAATT GCAGACTATT GTTGACGAAT ATTTCTCTTT
ACACAATGCA AAGTCAATAA AAAAGGATAG CAAGGGAGTA AAAGGTGAAT CTGCAGATGC
GTTGTTTGTT TTGAATTTGC TTAATGATGA AGCTATCACT ATAATCACTG TTGCGTATTT
GAAGGAATTG TATGCTCAGA CAGATTCAGT ATCGAGAATT AGGGAAATGT TCAATTTATT
ATACAAAGAA AATGCTTTCA AAAAATCTGT TTTGTTTTGG AAATACTTCT TGAATTTTGA
AAGATTACAC GGACAAACCC TTCACAACTT GAGAATGATA ATAAATTATG TCAAAACTGA
AACTCAACTT CCCAAAGCTA TTGTAGATGC TTTCATCACA ATCGAATATG ATATCATTGG
CGCAAATTTG GACTCGGCTG TTGAACAGCA TAGAGCAGGA GTCCCTAGAT CATCATTAGA
AGATTTAATA AAGAAGGATA TGGAAACGTC ACTTTCCTTA ATTCGTAATG CCTCCGCCAG
AAAAAGGCTA GCCAATTCTA ATTATATTGT TAAAGACGCT GAAGATTTGA AGTCTGCTCA
CAAGACAAAT TTCAACAGGG AAGCAGAGCT TTTAAAGATC ACCAGAAAGC ACATTGGCCA
TCCAGGAATA TTGATCGATG CTATTCCTGA TATAACCAAC AAGTTCATGA AGGAAGGAAA
CGATGTTTCT TTGCTAGACT CAAAGCTTGT TGTACCTTCG TTTCCAACAT TTAAAAATGC
CGAAAAGGCG AATGCATCGA TCAACTATCC AAAAGCATAA
 
Protein sequence
MSHINFRFHQ DLNLDAISFG GASKSDAEVK EVDNLDKLHN DIRRSPNDLT KWDKLFQSFE 
RTFTVKFEGK PDKVSTQFKL LVTKTYASLL SRFPYLASYW KSWSIFAFKL SGTKESIEVL
EKSVIGFPYS VELWTDYISA LILTYGNDPE KLSFIRAQYS EALRLNGLNF LSHPLWDKVI
EFETGIGEKS VIVGLYLRVT KIPLYQYAQY YNSFTQINKN YDITDVIPSL ELAEYVKRFN
KTDVTELTLG EKNQVVDDHT DIIFTSTQEQ VTDKWSHESS IFIHDFSLDR LDEIAKEKEI
WIKYLDHEIS KYKVSSAIDQ FDNVANIFER ALVPNYYNEG IWLKYLAFIN ISELEDEVKY
EKAKAIYLRA ISGLPVESTV LRSLYPKFLM KYKHLDIAKS YLYDYLKLFG GRGNRYFKSQ
YLQTVQDTVE VWEKSENSKD YLAKLQTIVD EYFSLHNAKS IKKDSKGVKG ESADALFVLN
LLNDEAITII TVAYLKELYA QTDSVSRIRE MFNLLYKENA FKKSVLFWKY FLNFERLHGQ
TLHNLRMIIN YVKTETQLPK AIVDAFITIE YDIIGANLDS AVEQHRAGVP RSSLEDLIKK
DMETSLSLIR NASARKRLAN SNYIVKDAED LKSAHKTNFN REAELLKITR KHIGHPGILI
DAIPDITNKF MKEGNDVSLL DSKLVVPSFP TFKNAEKANA SINYPKA