Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_87539 |
Symbol | PRP5 |
ID | 4837624 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2193221 |
End bp | 2196041 |
Gene Length | 2821 bp |
Protein Length | 875 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388939 |
Product | pre-mRNA processing RNA-helicase |
Protein accession | XP_001382645 |
Protein GI | 150863984 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.223181 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAATA CAGTTACTAA CGATTCAGAT GAATCCAAGT TATCCAAAGA AGAGAAGTTG AAGAAAAGAC GCGAGCAGCT TGCGCTTTGG CGGCAAAAAA AGGAACAAGA TCTGAAGAAT GGAGAAAAAG AAATGAATTC AAATGCAACA ACATCGTCAG ACGATAAACA GAAACTCAGG CAACAACGTA TGGAAGAATG GAAGAAGAAA CGAGCACAGG AGACTAAGAT AGATTCGAAG GAATCTGTAA AAGAGTCACC GAAACAAGAG AACTCAATCG TAGACGATAA GCTCAAACAG AGACAACAGA GAATTGAAGA GTGGAAGAAG AAACGAGCAC AAGAATCAAA TGCAACTAAT TCAGTTTCAC CGTCAGTTAC AGAAGTAAAA GTTCAAAAGC CAGCTTTTCA GTTAAAAAGA TTGGCCTCTA GCATCAAGAA GCAGGAGCTT CCATTTACAA AGAAAAGAAT ACTATTCGAT GAAGAAGAGG AAGACGAATC GAAGAGAAAA CCTAAATTCA AGAAGCCAAC TTTAGATGGT CATACTGAAG ATTTGCAGCA CAGAGAAGAT GACCCCAATG GAAAAATAGA CGAACTTGAT GATTTTATAG CTTCCCTTTC AAAACAGGAG TCTAGCTCCA ACGACATACC TTCTCAAATA ATTGAAGATG AACAGTTAGA AGTGGAAAAT GAAGGTGACT CTGAAGATGA AGAAATAGAC CAAGACAAGA AACAGCAGGA ATTACTCTCT TCCAAGTTCC AGAAACTTCA GAATGAGAAA CAGTTAGAAA CTATAGACCA TTCTACTATG AACTACTCTG ACTTTAGAAA GAATTTCTAT CAAGAACCAA GTGAGATTCA AAATTGGACA GCTGAACAAG TGGAAAGTAT CAGACTAGAA TTGGATGGGA TAAAGGTGGC TGGTTCAAAT GTTCCTAGAC CGGTTTTGAA ATGGTCTCAT CTAGGATTGC CTGCTTCATA TATGAATATC ATTGAAGACA AACTTGAGTA CAAGGCTCCT ACCTCAATTC AATCTCAGGC TCTTCCTGCT ATTATGTCTG GAAGAGACAT TATTGGTGTA GCCAAAACTG GCTCGGGCAA GACGTTATCT TTTGTACTTC CAATGTTAAG GCATATCCAG GATCAACCTG ATTTAAAAGA TGGTGAGGGT CCTATTGGTT TAATATTGTC TCCTACCAGA GAGCTAGCAG TTCAGATTCA TAAGGAGATC ACTAATTTCA CGAAGAGATT GGGAATGACA GCTTGCTGTT GTTATGGAGG ATCTCCTATT GAATCACAAA TTGCGGAGCT AAAGAAAGGG GCACAAATAT TGGTAGGAAC ACCAGGTAGA ATCATAGAAT TACTAGCTGC TAATAGTGGC AGGATAACAA ACTTACGAAG AGTAACATAC GTTGTATTGG ACGAAGCTGA CAGAATGTTC GACTTGGGAT TTGAACCTCA AGTGACAAAG ATATCGTCAC AGATTCGGCC TGAGAGTCAA ACAGTTCTTT TCTCAGCTAC GTTTCCCAGA AAAATCGAAT TGCTAGCCAA ACGATTACTC TATAATCCGT TAGAAATTAT TGTGGGAGGT ATTAGTGTCG TGGCATCTGA GATCACTCAG AAAGTGGAGC TATTTGAGAA GGGCGAGAGT AGCCAGCTAG AAGATGAGAA GTTTGACAGA CTACTCAATA TTTTGAATGT TTTCTCGATA GAGTCCAAAC ACAGCAAAGT ATTGATTTTT GTCGAAAAAC AGTCTGCTGC CGACGATTTG CTAGTCAAGT TGTTAGGAAG CAACCACCCA TGCCTAACAA TACATGGAGG TAAGGATCAA ATTGATAGAA AGTATGCTAT TAAAGAATTT TCATCCAAAG ATAGCGGTGT AGATATTTTG ATCGCAACCT CCATTGCAGC CAGAGGTCTT GATGTCAAAG GGTTAGATCT TGTAATCAAT TACGATCCAC CTAATCACAT GGAAGATTAC GTTCATCGTG TTGGTAGAAC TGGTAGAGCA GGTATGAAAG GTACAGCTAT AACCTTTGTT TCTTCAGATC AAGAAAGACT GGTTACAGAT CTAGTCAGGG CCATGACGAT GAGTAAGATC CCTGAGGATG AGATTCCCAG CAGATTAATA GAGATAAGGA ACCAATTCTT AGAAAAAGTT AAGGCTGGTA AATTTAAATA TAGTTTTGGA TTCGGTGGCA AGGGTTTGGA AAAATTACAG CAAATCAGAG ACTCTACTAG AAGCTTACAA AAGAAAGAAT ATGGGCCAAA TGACGACGAT GATGTCAATT TTGTTGCGGA CAAAACCAAC GGTACTGCTA AAAAAGATGG AGCTACTTCT TTACCGGCTC TGGAGGTCGC CGTGGATTTG CCTGACTTCC AGGTTATTGA AGGACGAGCA CCAGAAACTT CTGGTCCAGA CAAGAGTAAA TTCCATTCCA GAATCACCAT CAACGACTTA CCACAAAGAG CTCGTTGGTT TGTGGTCAAT CGTGATAGCT TGTCGAAGAT AATTGAGTCT ACATCGACAT CGATAACAAA CAAAGGTCAA TACTATGCAC CCAACGTCAA GGTTCCACAG ACCGTTACGG TCAATGGAAA GGAAAGTCCA GCTCCACCAA GGTTGTACTT GCTTGTAGAA GGGTTAACAG AGCAATCCGT TCGCGAAGCT AACTCATTGA TTAGAGACAA AATGATAGAA GGATTGGAGG TAGCTTCTAA GGAACTGAAT ATGGCACCAA CGGGTAAGTA CAAAGTATAG AAGTTATAGA GTATTGAAGC TGGATATGAT AGAATAGAAC ATAGGAAATT AGACTGATAG G
|
Protein sequence | MDNTVTNDSD ESKLSKEEKL KKRREQLALW RQKKEQDSKN GEKEMNSNAT TSSDDKQKLR QQRMEEWKKK RAQETKIDSK ESVKESPKQE NSILKRLASS IKKQELPFTK KRILFDEEEE DESKRKPKFK KPTLDGHTED LQHREDDPNG KIDELDDFIA SLSKQESSSN DIPSQIIEDE QLEVENEGDS EDEEIDQDKK QQELLSSKFQ KLQNEKQLET IDHSTMNYSD FRKNFYQEPS EIQNWTAEQV ESIRLELDGI KVAGSNVPRP VLKWSHLGLP ASYMNIIEDK LEYKAPTSIQ SQALPAIMSG RDIIGVAKTG SGKTLSFVLP MLRHIQDQPD LKDGEGPIGL ILSPTRELAV QIHKEITNFT KRLGMTACCC YGGSPIESQI AELKKGAQIL VGTPGRIIEL LAANSGRITN LRRVTYVVLD EADRMFDLGF EPQVTKISSQ IRPESQTVLF SATFPRKIEL LAKRLLYNPL EIIVGGISVV ASEITQKVEL FEKGESSQLE DEKFDRLLNI LNVFSIESKH SKVLIFVEKQ SAADDLLVKL LGSNHPCLTI HGGKDQIDRK YAIKEFSSKD SGVDILIATS IAARGLDVKG LDLVINYDPP NHMEDYVHRV GRTGRAGMKG TAITFVSSDQ ERSVTDLVRA MTMSKIPEDE IPSRLIEIRN QFLEKVKAGK FKYSFGFGGK GLEKLQQIRD STRSLQKKEY GPNDDDDVNF VADKTNGTAK KDGATSLPAS EVAVDLPDFQ VIEGRAPETS GPDKSKFHSR ITINDLPQRA RWFVVNRDSL SKIIESTSTS ITNKGQYYAP NVKVPQTVTV NGKESPAPPR LYLLVEGLTE QSVREANSLI RDKMIEGLEV ASKESNMAPT GKYKV
|
| |