Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_62786 |
Symbol | PRP40 |
ID | 4840034 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 940708 |
End bp | 942162 |
Gene Length | 1455 bp |
Protein Length | 478 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640391349 |
Product | pre-mRNA processing protein |
Protein accession | XP_001385539 |
Protein GI | 150866066 |
COG category | [A] RNA processing and modification |
COG ID | [COG5104] Splicing factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.344613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.715644 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAGT GGGAGAAAGT GACTGACAAC GAAGGTAGAG TATATTACTA CAATTCCAAA ACGAAGGAAA CAAGTTGGAC CCTCCCCCAA TCAGAATCTT CAGTTTCCAG TGGTTCCAAA TGGCAGGAAT ATGCTACCGA TGATGGAAGA AAATATTACT ATAACGAGTC CACAGGCGAG ACTACGTGGG AGATGCCGCA GGAAATGGAG AAAGCTGAAG ATAAAAGAAA CGTCGATGAT GTAAAAGAAA AGGACGAACA AGTTGCTTCG AAATCAACTG AAGAGAGCCA ATTAGATCTT CGACTAGCTT CAGAACCAAT CAAAAAGTCA GACTTGGTAA ACCCACCAAA GGATGATTCA TATCCCGAAT CAGAGGCATT TGTGGAGATG CTTCGTTCTA ATAAGGTTGA TTCGACTTGG TCTTTTCAAG CAGTAATGTC GAAGTTTATT GATGACCCCA AGTATTGGGC CATTCCTGAT GCATTGGAGC GGAAGAAATT ATACGACGAA TATCTTGTGA CGAGATTCAA AGAAGATTTA TCCAACAAGA GTTTATTGGT GGAGACATTC AAAAAGAACT TTGTCGAAAC TCTAAAGAAA TACGAAGAAA ATGGTAGACT TCTGCGGAAT AGTAGATGGA TCTCAGTAAA AAAGTTACTT ATCGCTGAAG ACAATCCAAT CTTCAAGCAT TCCATTTTGT CAGATGCTGA GATAGCGGAA ATATATTATG AATATATCTC CAGACTTAAG AAGCAATATG AAGAAGAATT GTCGAAAAAC AAGGATCGTG CATTATCTGA ACTTGAATCA TACCTTACCC AAATTAATCC CAACATAGTA TCTAGCACAA GTAATTGGCA GGAATTACTT GAAAACCTCA AGGCAGATGC CAGGTTCAGG GCTAACAAGC ATTTCAATGT ACTCAGTGAC GTAGATTTAC TTGAAATGTA TGAGACAAAG ATATACCCGA CTATCATACA AAAAATTAAG AGCGAGATTG ATGACGTTCA GAAAAAGAAT TACCGATCAG ACAGGAAGGC AAGACAAAAG TACAAGGCAT TATTGAAGAC ACTCGATATC AATGCAAATT CTAACTTCAA AGACTTTCTC TACATTCTTG AGAATGATGA TTCATTTATA GAGCTTTGTG GAAGAAATGG GTCTACAGCA CTCGAGCTCT TTTGGGACAT CGTCGATGAG AAATCGCAAG TCTTGAAATT GAAAATGTAC TTAGTGGAAT CTGTTTTGCT CGATTTGAAG AAGGAAGACT CTACTTTAAC AAAGTCCAAG ATACTACTGT CAGAGAATAA TTTCATAGAA TTTTTGTCCA ATTCTAGTGA CCAGAGAATC GAGAATCTAG ACATTGACCT TAATGATGCT AATGAAACAG AGGTATTGTA TGGGGCATTG AAAAGAGAGT TTGAAGCTCA ACAAGAAAAG AGACGCGTTC GCTTC
|
Protein sequence | MSEWEKVTDN EGRVYYYNSK TKETSWTLPQ SESSVSSGSK WQEYATDDGR KYYYNESTGE TTWEMPQEME KAEDKRNDEQ VASKSTEESQ LDLRLASEPI KKSDLVNPPK DDSYPESEAF VEMLRSNKVD STWSFQAVMS KFIDDPKYWA IPDALERKKL YDEYLVTRFK EDLSNKSLLV ETFKKNFVET LKKYEENGRL SRNSRWISVK KLLIAEDNPI FKHSILSDAE IAEIYYEYIS RLKKQYEEEL SKNKDRALSE LESYLTQINP NIVSSTSNWQ ELLENLKADA RFRANKHFNV LSDVDLLEMY ETKIYPTIIQ KIKSEIDDVQ KKNYRSDRKA RQKYKALLKT LDINANSNFK DFLYILENDD SFIELCGRNG STALELFWDI VDEKSQVLKL KMYLVESVLL DLKKEDSTLT KSKILSSENN FIEFLSNSSD QRIENLDIDL NDANETEVLY GALKREFEAQ QEKRRVRF
|
| |