Gene PICST_87539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_87539 
SymbolPRP5 
ID4837624 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2193221 
End bp2196041 
Gene Length2821 bp 
Protein Length875 aa 
Translation table12 
GC content40% 
IMG OID640388939 
Productpre-mRNA processing RNA-helicase 
Protein accessionXP_001382645 
Protein GI150863984 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.223181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAATA CAGTTACTAA CGATTCAGAT GAATCCAAGT TATCCAAAGA AGAGAAGTTG 
AAGAAAAGAC GCGAGCAGCT TGCGCTTTGG CGGCAAAAAA AGGAACAAGA TCTGAAGAAT
GGAGAAAAAG AAATGAATTC AAATGCAACA ACATCGTCAG ACGATAAACA GAAACTCAGG
CAACAACGTA TGGAAGAATG GAAGAAGAAA CGAGCACAGG AGACTAAGAT AGATTCGAAG
GAATCTGTAA AAGAGTCACC GAAACAAGAG AACTCAATCG TAGACGATAA GCTCAAACAG
AGACAACAGA GAATTGAAGA GTGGAAGAAG AAACGAGCAC AAGAATCAAA TGCAACTAAT
TCAGTTTCAC CGTCAGTTAC AGAAGTAAAA GTTCAAAAGC CAGCTTTTCA GTTAAAAAGA
TTGGCCTCTA GCATCAAGAA GCAGGAGCTT CCATTTACAA AGAAAAGAAT ACTATTCGAT
GAAGAAGAGG AAGACGAATC GAAGAGAAAA CCTAAATTCA AGAAGCCAAC TTTAGATGGT
CATACTGAAG ATTTGCAGCA CAGAGAAGAT GACCCCAATG GAAAAATAGA CGAACTTGAT
GATTTTATAG CTTCCCTTTC AAAACAGGAG TCTAGCTCCA ACGACATACC TTCTCAAATA
ATTGAAGATG AACAGTTAGA AGTGGAAAAT GAAGGTGACT CTGAAGATGA AGAAATAGAC
CAAGACAAGA AACAGCAGGA ATTACTCTCT TCCAAGTTCC AGAAACTTCA GAATGAGAAA
CAGTTAGAAA CTATAGACCA TTCTACTATG AACTACTCTG ACTTTAGAAA GAATTTCTAT
CAAGAACCAA GTGAGATTCA AAATTGGACA GCTGAACAAG TGGAAAGTAT CAGACTAGAA
TTGGATGGGA TAAAGGTGGC TGGTTCAAAT GTTCCTAGAC CGGTTTTGAA ATGGTCTCAT
CTAGGATTGC CTGCTTCATA TATGAATATC ATTGAAGACA AACTTGAGTA CAAGGCTCCT
ACCTCAATTC AATCTCAGGC TCTTCCTGCT ATTATGTCTG GAAGAGACAT TATTGGTGTA
GCCAAAACTG GCTCGGGCAA GACGTTATCT TTTGTACTTC CAATGTTAAG GCATATCCAG
GATCAACCTG ATTTAAAAGA TGGTGAGGGT CCTATTGGTT TAATATTGTC TCCTACCAGA
GAGCTAGCAG TTCAGATTCA TAAGGAGATC ACTAATTTCA CGAAGAGATT GGGAATGACA
GCTTGCTGTT GTTATGGAGG ATCTCCTATT GAATCACAAA TTGCGGAGCT AAAGAAAGGG
GCACAAATAT TGGTAGGAAC ACCAGGTAGA ATCATAGAAT TACTAGCTGC TAATAGTGGC
AGGATAACAA ACTTACGAAG AGTAACATAC GTTGTATTGG ACGAAGCTGA CAGAATGTTC
GACTTGGGAT TTGAACCTCA AGTGACAAAG ATATCGTCAC AGATTCGGCC TGAGAGTCAA
ACAGTTCTTT TCTCAGCTAC GTTTCCCAGA AAAATCGAAT TGCTAGCCAA ACGATTACTC
TATAATCCGT TAGAAATTAT TGTGGGAGGT ATTAGTGTCG TGGCATCTGA GATCACTCAG
AAAGTGGAGC TATTTGAGAA GGGCGAGAGT AGCCAGCTAG AAGATGAGAA GTTTGACAGA
CTACTCAATA TTTTGAATGT TTTCTCGATA GAGTCCAAAC ACAGCAAAGT ATTGATTTTT
GTCGAAAAAC AGTCTGCTGC CGACGATTTG CTAGTCAAGT TGTTAGGAAG CAACCACCCA
TGCCTAACAA TACATGGAGG TAAGGATCAA ATTGATAGAA AGTATGCTAT TAAAGAATTT
TCATCCAAAG ATAGCGGTGT AGATATTTTG ATCGCAACCT CCATTGCAGC CAGAGGTCTT
GATGTCAAAG GGTTAGATCT TGTAATCAAT TACGATCCAC CTAATCACAT GGAAGATTAC
GTTCATCGTG TTGGTAGAAC TGGTAGAGCA GGTATGAAAG GTACAGCTAT AACCTTTGTT
TCTTCAGATC AAGAAAGACT GGTTACAGAT CTAGTCAGGG CCATGACGAT GAGTAAGATC
CCTGAGGATG AGATTCCCAG CAGATTAATA GAGATAAGGA ACCAATTCTT AGAAAAAGTT
AAGGCTGGTA AATTTAAATA TAGTTTTGGA TTCGGTGGCA AGGGTTTGGA AAAATTACAG
CAAATCAGAG ACTCTACTAG AAGCTTACAA AAGAAAGAAT ATGGGCCAAA TGACGACGAT
GATGTCAATT TTGTTGCGGA CAAAACCAAC GGTACTGCTA AAAAAGATGG AGCTACTTCT
TTACCGGCTC TGGAGGTCGC CGTGGATTTG CCTGACTTCC AGGTTATTGA AGGACGAGCA
CCAGAAACTT CTGGTCCAGA CAAGAGTAAA TTCCATTCCA GAATCACCAT CAACGACTTA
CCACAAAGAG CTCGTTGGTT TGTGGTCAAT CGTGATAGCT TGTCGAAGAT AATTGAGTCT
ACATCGACAT CGATAACAAA CAAAGGTCAA TACTATGCAC CCAACGTCAA GGTTCCACAG
ACCGTTACGG TCAATGGAAA GGAAAGTCCA GCTCCACCAA GGTTGTACTT GCTTGTAGAA
GGGTTAACAG AGCAATCCGT TCGCGAAGCT AACTCATTGA TTAGAGACAA AATGATAGAA
GGATTGGAGG TAGCTTCTAA GGAACTGAAT ATGGCACCAA CGGGTAAGTA CAAAGTATAG
AAGTTATAGA GTATTGAAGC TGGATATGAT AGAATAGAAC ATAGGAAATT AGACTGATAG
G
 
Protein sequence
MDNTVTNDSD ESKLSKEEKL KKRREQLALW RQKKEQDSKN GEKEMNSNAT TSSDDKQKLR 
QQRMEEWKKK RAQETKIDSK ESVKESPKQE NSILKRLASS IKKQELPFTK KRILFDEEEE
DESKRKPKFK KPTLDGHTED LQHREDDPNG KIDELDDFIA SLSKQESSSN DIPSQIIEDE
QLEVENEGDS EDEEIDQDKK QQELLSSKFQ KLQNEKQLET IDHSTMNYSD FRKNFYQEPS
EIQNWTAEQV ESIRLELDGI KVAGSNVPRP VLKWSHLGLP ASYMNIIEDK LEYKAPTSIQ
SQALPAIMSG RDIIGVAKTG SGKTLSFVLP MLRHIQDQPD LKDGEGPIGL ILSPTRELAV
QIHKEITNFT KRLGMTACCC YGGSPIESQI AELKKGAQIL VGTPGRIIEL LAANSGRITN
LRRVTYVVLD EADRMFDLGF EPQVTKISSQ IRPESQTVLF SATFPRKIEL LAKRLLYNPL
EIIVGGISVV ASEITQKVEL FEKGESSQLE DEKFDRLLNI LNVFSIESKH SKVLIFVEKQ
SAADDLLVKL LGSNHPCLTI HGGKDQIDRK YAIKEFSSKD SGVDILIATS IAARGLDVKG
LDLVINYDPP NHMEDYVHRV GRTGRAGMKG TAITFVSSDQ ERSVTDLVRA MTMSKIPEDE
IPSRLIEIRN QFLEKVKAGK FKYSFGFGGK GLEKLQQIRD STRSLQKKEY GPNDDDDVNF
VADKTNGTAK KDGATSLPAS EVAVDLPDFQ VIEGRAPETS GPDKSKFHSR ITINDLPQRA
RWFVVNRDSL SKIIESTSTS ITNKGQYYAP NVKVPQTVTV NGKESPAPPR LYLLVEGLTE
QSVREANSLI RDKMIEGLEV ASKESNMAPT GKYKV