Gene PICST_46989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46989 
SymbolPRP31 
ID4839069 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1120984 
End bp1122681 
Gene Length1698 bp 
Protein Length544 aa 
Translation table12 
GC content40% 
IMG OID640390384 
Productsplicing factor 
Protein accessionXP_001385225 
Protein GI150865843 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.192868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGACT ATGAGAAAGA ATTGCTAGCT GACTTCGACA GTGACAGCGA CGTAAGCTTG 
GAAGAGGAAC CCTTGGTTGA AAATTCGACC GAAAACGAGG CAAAATCGTT TGAAAACCTC
AATGGGCTCC AGGAAAATGA CTTTATTCAC CAACAGAATG GATTCAATAG CGGTGAAAAT
TCAACCAACA ACCACGAAAA AACATTTTTT ACACAACTAA GCGAGTTGAT TGCCTCCAAT
ACCGTAGCAG GTTCACTTTC GGCGATTCTA AGTGAGCCAG ACATAAGCAG CATTGAAGAT
ATCACTGTCT TCTCGAAGGT ATACCCACTT ATTCCTCAAT TGAAGAAGCA TATAGAGTTG
TATTCCAACG AGGAGACAAC TGATTTTTTG GAACTCCTAT CTTCTATTGA TGATAGTGAA
GACCAATCGG AAGAGTACAA GTTCATTCTT TTGGTAAACG AGCTTCTGGG TATCATTAAT
CAGGAAATTA TAGCCTACCA TCAGCTTCTC AAGACTCAGT ATAAAGTAGT GTTTCCAGAG
CTTGAAACGT TGGTGCTTAA CCCCATAGAT TATGCTCGTA TTATAGCGAT AATCAAACAG
GACTTGAAGA ATATCCGTTC ATATGACGAA CAAATGAAGG CGATAGTGTC CAATGAAAAG
ATTCTTGTCA TTATAATGGC GGCTCTTCAG CAGCTAGGCC AACAATTTGT ACTCAATGAT
AAAGATATGA ACAGCATAAT TGATTGCTGT GTCATTTTGC TTGAATTATA TGAAATATTA
CAGCTTCTAT CGAACTTCAT AACTCAAAAG CTCACTAAGT TTGCACCTAA TGTGAGTGCT
ATTGTCGGTT CTATTACAAC TTCGCAATTG TTAATAGCAA CAGGTTCTTT AAAACTGTTG
GCCATGACTC CTTCGTGCAA CTTGGCGTCC TTAGGAATCA GAGACCTTTC ATCAAAGACG
AAATCCAAAT CTAGAACCGT ACGGCAAACA GGCTATTTAT ATCATTCTGA AGTTGTAAAG
TATCTTCCTG AGGATATAGT TCGTTCGACA ATGCGTATAG TAAGCGGAAA AGTGATTTTG
GCCGCTCGTG TAGATTTGGC AGGCTCTTGT CCTGATGGTT CCATTGGTCA TACGTATTTG
GAAGAGATTA GGAAGAAGAT CGACAAGCTC TTGACTCCTC CTGAACACCA ACCCGACAAG
GCATTGCCTG CTCCAGTAGA TGTGAAATCA AAAAAGAGAG GAGGTAGACG ATTCAGGAAG
ATGAAGGAAA GGTTCCAGAT GTCTGATTTA CGCCGAGCCC AGAACAAGAT GGAATTCGGC
AAGGAAGAAG ATTCTGTGAC AGACAGTTTT GGTGAAGAAA TTGGATTGGG TATGAGCAGA
ACGAATGGAG GCAGTGGAAG AATTGGAGAG ATTAGAGTGA ATACTAATAC TGGAGCCAGA
ATGTCAAAGG GCATGGTTCA CAGATTACAG AAACATGAAC AGAGTGCCAA AATTCAGAGA
ATTGACAAGG GTATATTTGA CCAAGACTTT GACAGCATTC TTTTGGTAAA CCCTAGTAGT
AAAAAAAGCA GCGAGAACAA GCTCAATGGT TCAAGTAGTC TGACAATTGG AAGCAAGTGG
TTTACAGGAA TGAGCAAGAG GAAAAATGAG GACGATGGTG GCAACGACAA GAAGAGACAA
CAGAATAGTG TATTATAG
 
Protein sequence
MQDYEKELLA DFDSDSDVSL EEEPLVENST ENEAKSGENS TNNHEKTFFT QLSELIASNT 
VAGSLSAILS EPDISSIEDI TVFSKVYPLI PQLKKHIELY SNEETTDFLE LLSSIDDSED
QSEEYKFILL VNELSGIINQ EIIAYHQLLK TQYKVVFPEL ETLVLNPIDY ARIIAIIKQD
LKNIRSYDEQ MKAIVSNEKI LVIIMAALQQ LGQQFVLNDK DMNSIIDCCV ILLELYEILQ
LLSNFITQKL TKFAPNVSAI VGSITTSQLL IATGSLKSLA MTPSCNLASL GIRDLSSKTK
SKSRTVRQTG YLYHSEVVKY LPEDIVRSTM RIVSGKVILA ARVDLAGSCP DGSIGHTYLE
EIRKKIDKLL TPPEHQPDKA LPAPVDVKSK KRGGRRFRKM KERFQMSDLR RAQNKMEFGK
EEDSVTDSFG EEIGLGMSRT NGGSGRIGEI RVNTNTGARM SKGMVHRLQK HEQSAKIQRI
DKGIFDQDFD SILLVNPSSK KSSENKLNGS SSSTIGSKWF TGMSKRKNED DGGNDKKRQQ
NSVL