Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_39705 |
Symbol | PRP1 |
ID | 4851966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3318177 |
End bp | 3320882 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | |
GC content | 39% |
IMG OID | 640393674 |
Product | Pre-mRNA splicing factor prp1 |
Protein accession | XP_001386957 |
Protein GI | 126276163 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.215456 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAGAC CGGCATTTCT AGACCAAGAA CCTCCACCTG GATATATATC TGGAATTGGA CGAGGTGCCA CGGGATTTAC AACTTCTGCT GATACTGGCT CTTTACAACC TGGCTTTACT ATAGAAAATG GAGAAGAATC CGATGACAAT TTAGCTGGAG AAATTGGTGA TGAGGGAGCT ATCCTTGCGA GCGGAAAAAA CCGCGATAAA GAAGACGAAG AAGCAGATCA AATTTATGAG GAAATTGAGA GGAAATTGCA AAGAAGAAAA GTGTTAAAAG AAGAAGCAGT TTCAGAAGCA GAAAATCAAA GTTATGAAAT CAAGAGCAAG TTTTCGGATT TGAAGAGATC GCTTTCGAGC ATTTCAGCCG AACAGTGGGA AGCTTTACCA GAAGTTGGAG ATATTACAAG ACGTAACAAG AGAACGAGAC TTTATGAGCA GCAACAGCAG AGAACGTATG CTGTTCCAGA TAGTGTTATA GCTGGAAGTA TAGCGAGAGC ATCACCTACC AACTTCCAGT CAATTTCCGA GTCTCGAGAC CAATTACTTC TGAGCCAACT TGACTCCTTA TTACCTAAAC ATGAGATAGA GTTTAATGCA GAAGAAGCAA CCGAGATATT GAAGAATGAC CAGCATGTAC AAATTGCAGA TATCAGGAAG GGCAGGCAAA TTCTTGCTTC CTTGAGAAGA ACACAGCCAA ATCTGTCAAA CTCGTGGATT GCATCTGCAA GACTTGAAGA ACAGGCAACC AACTACACTA TGGCCAAAAA GCTCATTATA GAAGGGTGCA AGCACGCTCC AAAAAGTGAG GATATCTGGC TTGAAAGTAT ACGTATACAC AAACTAACTT CTGAGGGAGC GAAAATGTGT AAGGTTATCG TAGCCGAATC TTTGAAGTAC AACCCTGGTT CGGAAAAACT TTGGATCCAA GCAGAAAACT TGGAGAATTC TATAGATGTG GTCTCCAGAA AACGGATTTT GATGAGAGCT ATAGAGGCTA TACCATCCAG TGCCAGTCTT TGGAAGAGAC TAGTGGATTT GGAGACTAAC CAAGATGACG TAAGCAAGAT TTTGACCAAG GCAATAGAGT TGTGTCCTAC TGAATGGGAC TTCTGGTTAA CTTTAATCAA TTCGTCCGAA TACAAAGACG CAAAAACACT TCTTAACAGA GCCAGAAAGG CATTGAATGG AAACTACCAG GTATGGATTA CCGCAGCTAA ATTGGAAGAG AGAGAAAATT CAACAATTGA TTCATCAAAG ATATCAAAAT TGATGGATAA GGCATTTAAG GAAACCGAGA AGAGTACGAC TACAATTCTG CGAACTACTT GGTTAGAAGA AGCCATCAAA GCAGAAGAAG AAGGATTCAG AAATACTTGT AGAGCCATTG TTAACAGTCT ACTTTCTTCT GAAATAAACC AAGATAATCC AGAAGAAAAC CTTGTAACTT GGTTCCAGGA TGCTGAAACT CTAGCACTGA AAGAGAGTGT GGAAGCTGCG AACTATATCC ACCAATTCAT TGTTGAGACC AATCCACATA GCATTAACAG TTGGAAGGAG TTGTTCTCAT TTTTAAAGAA CTCATCAAAC AGGAATTTGG ATACTTTATT TAACTACTAC AAGAGATCTA TAGAGTTAAA TCCGAAGGTC GAGGTTTTGC ATCTCATGTA TGCAAAGGAT TTGTGGCAAT TGGCTGGCAA TATTGTCGAA GCGAGAAAGG TTTTGAATGC TGCTAGTCAT GGACTTGAAA ATAATGAAGA AGTGTGGTTC GCAAGAATCA AATTAGAAAT CAAATCCGGT AACTTTGAAC AGGCTCTATC CATTTCTTCT AAAATGATCA AAGCTATTCC TACTTCTTCT GATCGCGTTT GGTATAAACA CATTCACTTG GTGCGATGTA TGAACAATAG AGAACAAAAT CCTAACTACG AGGGACAGAT ATTGGCGTTA CTAGAAAGTG GGTTGGATTC TTTTCCAGAA TCTCCTAGAC TTCATTTGCA GAAAATACAA GTTCTTCTTA GAGACTTGAG AAAACCTGAC ATCGCTAGAG AATCTGCCCG TGCTGCAGTT GAAAAACTCC CCAGCATAGT TGAACTTTGG ATCTTGTTGT CTCATATTGA CGAACAACAT TTGAATATTT TGATCAAAGC CAGATCTGTA TTGGATACTG CTATCCTCAA AAATCCAACA AGTGACAAAT TATGGACAGC AAAAATACAA CTAGAAAGAA GAAATAAGGA CTTTGTGGCT GCTAGACAAC TTGTTAACAA GGGATTGAAA GCTTTTCCTA AGTCTTCCAG AATCTGGATC GAGTACTTAT CTTTGATTCC CAAAATGTCA CACAGAAAAA CTGCGTTCTT GGATGCTTTG AAAAGTACCG AAAATTCTCC AGAAATCCTT TTAGGTATTG GTGTATTCTT CTGGCTTGAC AGTAAACATT CCAAGGCTAA ATCGTGGTTC GAAAGAGCGT TGTCAAATGA TCGCAATAGT GGGGAATGCT GGGGCTGGCT CTTCAACTTC ATGAAATCAT ATGGAAAGCA ACAAGAGAAA GAAAATTTGA TAAACTCATT CGAGAGTCAC TACGAAGAAA TCAATAAAGG AGATATTTGG AATTCTACGA ACAAAGATAT AACCAACTTT GAGAAAACTC CAAAGGAAAT CCTTGAACTT GTGTCCAAGG TTCTTATATC TAAATATAGT ACATAA
|
Protein sequence | MLRPAFLDQE PPPGYISGIG RGATGFTTSA DTGSLQPGFT IENGEESDDN LAGEIGDEGA ILASGKNRDK EDEEADQIYE EIERKLQRRK VLKEEAVSEA ENQSYEIKSK FSDLKRSLSS ISAEQWEALP EVGDITRRNK RTRLYEQQQQ RTYAVPDSVI AGSIARASPT NFQSISESRD QLLLSQLDSL LPKHEIEFNA EEATEILKND QHVQIADIRK GRQILASLRR TQPNLSNSWI ASARLEEQAT NYTMAKKLII EGCKHAPKSE DIWLESIRIH KLTSEGAKMC KVIVAESLKY NPGSEKLWIQ AENLENSIDV VSRKRILMRA IEAIPSSASL WKRLVDLETN QDDVSKILTK AIELCPTEWD FWLTLINSSE YKDAKTLLNR ARKALNGNYQ VWITAAKLEE RENSTIDSSK ISKLMDKAFK ETEKSTTTIL RTTWLEEAIK AEEEGFRNTC RAIVNSLLSS EINQDNPEEN LVTWFQDAET LALKESVEAA NYIHQFIVET NPHSINSWKE LFSFLKNSSN RNLDTLFNYY KRSIELNPKV EVLHLMYAKD LWQLAGNIVE ARKVLNAASH GLENNEEVWF ARIKLEIKSG NFEQALSISS KMIKAIPTSS DRVWYKHIHL VRCMNNREQN PNYEGQILAL LESGLDSFPE SPRLHLQKIQ VLLRDLRKPD IARESARAAV EKLPSIVELW ILLSHIDEQH LNILIKARSV LDTAILKNPT SDKLWTAKIQ LERRNKDFVA ARQLVNKGLK AFPKSSRIWI EYLSLIPKMS HRKTAFLDAL KSTENSPEIL LGIGVFFWLD SKHSKAKSWF ERALSNDRNS GECWGWLFNF MKSYGKQQEK ENLINSFESH YEEINKGDIW NSTNKDITNF EKTPKEILEL VSKVLISKYS T
|
| |