Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_46989 |
Symbol | PRP31 |
ID | 4839069 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1120984 |
End bp | 1122681 |
Gene Length | 1698 bp |
Protein Length | 544 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390384 |
Product | splicing factor |
Protein accession | XP_001385225 |
Protein GI | 150865843 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.192868 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGACT ATGAGAAAGA ATTGCTAGCT GACTTCGACA GTGACAGCGA CGTAAGCTTG GAAGAGGAAC CCTTGGTTGA AAATTCGACC GAAAACGAGG CAAAATCGTT TGAAAACCTC AATGGGCTCC AGGAAAATGA CTTTATTCAC CAACAGAATG GATTCAATAG CGGTGAAAAT TCAACCAACA ACCACGAAAA AACATTTTTT ACACAACTAA GCGAGTTGAT TGCCTCCAAT ACCGTAGCAG GTTCACTTTC GGCGATTCTA AGTGAGCCAG ACATAAGCAG CATTGAAGAT ATCACTGTCT TCTCGAAGGT ATACCCACTT ATTCCTCAAT TGAAGAAGCA TATAGAGTTG TATTCCAACG AGGAGACAAC TGATTTTTTG GAACTCCTAT CTTCTATTGA TGATAGTGAA GACCAATCGG AAGAGTACAA GTTCATTCTT TTGGTAAACG AGCTTCTGGG TATCATTAAT CAGGAAATTA TAGCCTACCA TCAGCTTCTC AAGACTCAGT ATAAAGTAGT GTTTCCAGAG CTTGAAACGT TGGTGCTTAA CCCCATAGAT TATGCTCGTA TTATAGCGAT AATCAAACAG GACTTGAAGA ATATCCGTTC ATATGACGAA CAAATGAAGG CGATAGTGTC CAATGAAAAG ATTCTTGTCA TTATAATGGC GGCTCTTCAG CAGCTAGGCC AACAATTTGT ACTCAATGAT AAAGATATGA ACAGCATAAT TGATTGCTGT GTCATTTTGC TTGAATTATA TGAAATATTA CAGCTTCTAT CGAACTTCAT AACTCAAAAG CTCACTAAGT TTGCACCTAA TGTGAGTGCT ATTGTCGGTT CTATTACAAC TTCGCAATTG TTAATAGCAA CAGGTTCTTT AAAACTGTTG GCCATGACTC CTTCGTGCAA CTTGGCGTCC TTAGGAATCA GAGACCTTTC ATCAAAGACG AAATCCAAAT CTAGAACCGT ACGGCAAACA GGCTATTTAT ATCATTCTGA AGTTGTAAAG TATCTTCCTG AGGATATAGT TCGTTCGACA ATGCGTATAG TAAGCGGAAA AGTGATTTTG GCCGCTCGTG TAGATTTGGC AGGCTCTTGT CCTGATGGTT CCATTGGTCA TACGTATTTG GAAGAGATTA GGAAGAAGAT CGACAAGCTC TTGACTCCTC CTGAACACCA ACCCGACAAG GCATTGCCTG CTCCAGTAGA TGTGAAATCA AAAAAGAGAG GAGGTAGACG ATTCAGGAAG ATGAAGGAAA GGTTCCAGAT GTCTGATTTA CGCCGAGCCC AGAACAAGAT GGAATTCGGC AAGGAAGAAG ATTCTGTGAC AGACAGTTTT GGTGAAGAAA TTGGATTGGG TATGAGCAGA ACGAATGGAG GCAGTGGAAG AATTGGAGAG ATTAGAGTGA ATACTAATAC TGGAGCCAGA ATGTCAAAGG GCATGGTTCA CAGATTACAG AAACATGAAC AGAGTGCCAA AATTCAGAGA ATTGACAAGG GTATATTTGA CCAAGACTTT GACAGCATTC TTTTGGTAAA CCCTAGTAGT AAAAAAAGCA GCGAGAACAA GCTCAATGGT TCAAGTAGTC TGACAATTGG AAGCAAGTGG TTTACAGGAA TGAGCAAGAG GAAAAATGAG GACGATGGTG GCAACGACAA GAAGAGACAA CAGAATAGTG TATTATAG
|
Protein sequence | MQDYEKELLA DFDSDSDVSL EEEPLVENST ENEAKSGENS TNNHEKTFFT QLSELIASNT VAGSLSAILS EPDISSIEDI TVFSKVYPLI PQLKKHIELY SNEETTDFLE LLSSIDDSED QSEEYKFILL VNELSGIINQ EIIAYHQLLK TQYKVVFPEL ETLVLNPIDY ARIIAIIKQD LKNIRSYDEQ MKAIVSNEKI LVIIMAALQQ LGQQFVLNDK DMNSIIDCCV ILLELYEILQ LLSNFITQKL TKFAPNVSAI VGSITTSQLL IATGSLKSLA MTPSCNLASL GIRDLSSKTK SKSRTVRQTG YLYHSEVVKY LPEDIVRSTM RIVSGKVILA ARVDLAGSCP DGSIGHTYLE EIRKKIDKLL TPPEHQPDKA LPAPVDVKSK KRGGRRFRKM KERFQMSDLR RAQNKMEFGK EEDSVTDSFG EEIGLGMSRT NGGSGRIGEI RVNTNTGARM SKGMVHRLQK HEQSAKIQRI DKGIFDQDFD SILLVNPSSK KSSENKLNGS SSSTIGSKWF TGMSKRKNED DGGNDKKRQQ NSVL
|
| |