Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_52034 |
Symbol | CFT1 |
ID | 4851436 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1810109 |
End bp | 1814164 |
Gene Length | 4056 bp |
Protein Length | 1341 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393144 |
Product | pre-mRNA 3'-end processing factor CF II mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) RNA processing and modification |
Protein accession | XP_001387581 |
Protein GI | 126274564 |
COG category | [A] RNA processing and modification |
COG ID | [COG5161] Pre-mRNA cleavage and polyadenylation specificity factor |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.164795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTTT ATCATGAATT CATCGATCCG TCTCGAGTTT CCCACTCTGT AGGCTGCAAC TTCATTTCAT CCACCGTTAA ACATTTAGTT GTTGGAAAAG CCACGCTTCT CCAGATCTTT GAAGTTGTAC AGTTGAAGCT GCTGACGCCG TCAAAGCCTC AGCATCGCTT GAAATTAATA GACCAGTTCA AACTCCATGG ATTAATCACA GATATAAAGC CCATAAGAAC TGTAGAATCA CCCAATTTTG ACTACTTGTT GGTTTCGACG AAGTCCGCAA AGTTTTCTGT CATCAAATGG GACCATCATC TCCATACGAT TCTGACAGTA TCTTTGCATT ATTATGAGAA CGCGATCCAG AACTCTACCT ATGAGAAGTT GTCGAAGTCT GAACTCTTGT TAGAACCTTA TGGAAGCTGT AGTTGCTTGC GTTTCAAGAA TTTGCTCTGT TTTTTGCCGT TTGAAACAGC AGAGGAGCTC GACGATGACG ACGCAGACTC TGAAAACGAA GATATGGTCA AATCAGAGAA GAAGGAACAC GAAAATGGTA CAGTTAACGT TCCAGTTACA GATCAGCCAG GTAGTTTTTT CGATACCAGC TTCTTAATAG ATGGCCAGAG TCTCGACTCG TCTATTGGTA GCATTATAGA CATGCAATTT TTGTTCAAAT ATAGAGAGCC AACTTTTGGT ATACTATCCC AGCGACAACA GGCTTGGGCT GGGAACTTAC CAAAGATTAA GGATAACGTC CAGTTTTGCA TTTTGACCTT GGATTTGACT ACTAAGTCTA CTGTTTCAGT TTTGAAGATT GACAATCTTC CGTACGATGT TGACAGAATT GTTCCCTTGC CTTCTCCCTT GAACGGGTGT TTGCTTTTGG GCTGCAACGA AATAATCCAT GTAGACAACG GTGGGATTGT AAGAAGAATA GCGGTAAACC AGTTCACCTC TCTCATCACA GCTTCCACCA AAGCGTACCA GGATCAAACC CACTTGAATC TCAAGTTGGA AGACTGTAGT GTGGTAGCTT TGCCAAACGA TCACCGTGCT CTATTGGTAT TATCCACGGG TGAATTCTAC TACTTAAACT TTGAAGTAGA CGGTAAGTCC ATCAAGAAAT TCACCATAGA AAGTGTGGAC AAATTATTGT ACAGCGACAT AAAATTAACT TTTCCCGGTC AAATAGCGAC CCTCGACAAC AACTTGCTAT TCTTTGCCAA CCATAATGGT AACAGTCCTT TGGTACAATT CAAATATCAA GATGAAGCAC TTAATCCTAA GAAACTCGCT AAAATTGTAG ACGAAGATAG TAAGAATGAA GATGAAGATG AAGACGAAGA CGACTTATAC AAGGACGAAG AAGAGGAAGT GCAAGTAGTT TTGGGCAATT CTGTCATTGA GTTTGTAAAA CATGACGAGT TGGTGAATAC TGGAGTAGTT TCTAGTTTTT CTTTAGGATA CTATTCCACT GAAAAGTTCA AATTCAATTT GAAGAATCCT AATTGCAAAG AAGTGTCGAT TATTGCCAAT GCTGGCACAC ATTCAGAGAC TAAGTTGAAC ATCATAACAC CTTCCATTCA ACCCACTATC TCTTCAACGT TGAGTTTTTC GCAGGTCAAC AGAATGTGGA ATTTGAACCA GAAGTATTTG ATTACGTCAG ACGATATTAA CTTCAAGTCC GAAATCTTTC AAATCGAAAA GTCGTTTGCA CGTTTGAATT CAAAAGATTT CATTAACAAC GAATTGACCA TTTCGATGCA TGAGTTGAAC AATGGTAAGT TCATCTTGCA AATCACTCCT AAGCAAATCG TCTTGTACAA CAACTTGTTC AGAAAGAAAA TCACCTTGAA CGAAGAAATC AAGGACGATG AAATCATTAA CAGTGTCTTG AGAGACGAGT TCTTGATGAT TTTCCTTGCC AGTGGTGATG TAATGATCTT TGCTATTAAT ACCTACAATG AATCGTACTC AAAGTTGGAG ATTCCCAAGA TCTTGGATGA TACGATTATC ACTACCGGAT ACATCACCAA TTCTCATTTG TTAAGGGCTG TTCTGAAAGA TGTAAATTTA CTCTTGAAGA GCGGAACTAA GAGAAATAGA TCTTCCTCCG TCGTTTCCAA TGTTGGAACT GCTGCAGAAC CTAAGAATGT TGGCCCTAAA CTGAAGACAT TTGTCTTAGT GACAGGTGAT AATAGAATTG TAGCCTTCAA CAGATTTCAT AACCAGAGAT GTTATCAGTT GAATCACGTA GACAAGTTCT CTGACAATTT GCATCTAGGA TTTTTTGACC CTGCTCAAAA CGAACCAGAT CCTTTTATTA AACAAGTAAT GCTCAACGAA ATAGGTGACA AGGATCACAA AGAAGAATAT TTGACTATAT TGACGATTGG GGGTGAAGTT CTTCTTTACA AGTTGTACTT TGATGGAGAA AACTACGAGT TCAAGAAGGA GAAAGATTTA GCTATCACTG GTGCTCCAGA AAACGCATAT CCTATAGGTA CGGCCGTTGA AAGAAGATTG GCATATTTCC CTAATTTGAA TGGATACACT TGTATATTCG TTACTGGTGT TACTCCCTAT TTGATTCTTA AGAGTCTTCA TTCCATTCCA AGAATTTACC AGTTCTCGAA AATACCAGCT GTTTCTATTT CTCCTTTCCA CGATTCGAAA GTAGCAAACG GGTTGATTTT CTTGGACAAT CAGCAGAATG CGAGAATCTG CCAGCTTCCA CTTGACTTCA ATTATGAAAA CACATGGCCC ATGAAGTTGA TCCATATCGG AGAGCTGATT CGTGCAATCA CATACCACGA GTCATCTCAC ACATATGTTG TTTCCACCTT CAAGGATATT GACTACGAGT GTTTTGATGA AGAAGGAAAG CCAATAGTAG GGCTTCATAA GGACAAACCA CCTTCTTCTG CTTATAAAGG CTCCATCAAA TTGATTTCTC CTTTTAATTG GTCTGTCATC GATACGATAG AATTGGCTGA TAACGAGTTA GGCATGACTG TAAAGTCGAT GATTCTCGAT GTAGGCTCAT CTACCAAGAA GTTCAAACAC AAGAAGGAGT TCATTGTGAT CGGATCTGGT AAATACAGAA TGGAAGATTT GTCAGCCAAT GGGTCGTTCA GAATCTACGA AATTATCGAC ATTATTCCTG AGCCAGATAG ACCTGAAACA AACCACAAGT TTAAAGAGGT TTTCAAAGAA GATACCAAGG GTGCTGTCAC TTCTGTATGT GAAGTCAGTG GCAGATTCCT AGTATCACAA GGTCAGAAGG TCATAGTAAG AGATTTACAG GACGATGGGG TGGTCCCTGT AGCCTTTTTG GATACGGCAG TGTATGTTTC TGAGGCCAAA AGTTTTGGTA ACATGATGAT CTTGGGTGAC TCGTTGAAGA GTGTTTGGTT GGTAGGATTC GACGCTGAAC CATTCAGAAT GATCATGTTA GGAAAGGACT TACAAGGACT AGACGTAAAC TGTGCAGACT TTATTACTAA GGATGAGGAG GTGTTTATCT TGATTGCTGA TAACAATAAT GTCTTGCATT TAGTTCAGTA TGACCCTGAA GATCCTACTG CATTAAATGG CCAGAGATTA CTTTCCAAGT CTTCTTTTTC CATCAACTCA TTCGTGACGT GCCTTAAATC TTTGCCCAAG ACTGAAGAGA AATACGACAC TGGCAGTGGA CAGAAGACCT CGTCGGTTAT AGGAGACTTC CAGACGATTG GTTCGACCAT TGATGGTTCT TTCTTTAGTG TTGTCCCCAT AAACGAAGCC AGCTACAGAA GAATGTACAT ATTGCAACAG CAGTTGACCG ACAAGGAGTA CCATTACTGT GGTTTAAATC CTCGTTTGAA CCGTTTCGGT GGATTATCGA TGACAGCAAA CGACACCAAC ACTAAGCCTA TTCTTGACTA CGATGTGATT AGAGCCTACG GCAAGTTGAA CGAAGAAAGA AAGAAAAACT TGGCTAGTAA AGTAAGTGCA AAGAATATTT ACCAGGATAT CTGGAAGGAT ATCATAGAGT TCGAGAATGC GTTGAAGGGT TTGTAG
|
Protein sequence | MDVYHEFIDP SRVSHSVGCN FISSTVKHLV VGKATLLQIF EVVQLKLLTP SKPQHRLKLI DQFKLHGLIT DIKPIRTVES PNFDYLLVST KSAKFSVIKW DHHLHTILTV SLHYYENAIQ NSTYEKLSKS ELLLEPYGSC SCLRFKNLLC FLPFETAEEL DDDDADSENE DMVKSEKKEH ENGTVNVPVT DQPGSFFDTS FLIDGQSLDS SIGSIIDMQF LFKYREPTFG ILSQRQQAWA GNLPKIKDNV QFCILTLDLT TKSTVSVLKI DNLPYDVDRI VPLPSPLNGC LLLGCNEIIH VDNGGIVRRI AVNQFTSLIT ASTKAYQDQT HLNLKLEDCS VVALPNDHRA LLVLSTGEFY YLNFEVDGKS IKKFTIESVD KLLYSDIKLT FPGQIATLDN NLLFFANHNG NSPLVQFKYQ DEALNPKKLA KIVDEDSKNE DEDEDEDDLY KDEEEEVQVV LGNSVIEFVK HDELVNTGVV SSFSLGYYST EKFKFNLKNP NCKEVSIIAN AGTHSETKLN IITPSIQPTI SSTLSFSQVN RMWNLNQKYL ITSDDINFKS EIFQIEKSFA RLNSKDFINN ELTISMHELN NGKFILQITP KQIVLYNNLF RKKITLNEEI KDDEIINSVL RDEFLMIFLA SGDVMIFAIN TYNESYSKLE IPKILDDTII TTGYITNSHL LRAVLKDVNL LLKSGTKRNR SSSVVSNVGT AAEPKNVGPK LKTFVLVTGD NRIVAFNRFH NQRCYQLNHV DKFSDNLHLG FFDPAQNEPD PFIKQVMLNE IGDKDHKEEY LTILTIGGEV LLYKLYFDGE NYEFKKEKDL AITGAPENAY PIGTAVERRL AYFPNLNGYT CIFVTGVTPY LILKSLHSIP RIYQFSKIPA VSISPFHDSK VANGLIFLDN QQNARICQLP LDFNYENTWP MKLIHIGELI RAITYHESSH TYVVSTFKDI DYECFDEEGK PIVGLHKDKP PSSAYKGSIK LISPFNWSVI DTIELADNEL GMTVKSMILD VGSSTKKFKH KKEFIVIGSG KYRMEDLSAN GSFRIYEIID IIPEPDRPET NHKFKEVFKE DTKGAVTSVC EVSGRFLVSQ GQKVIVRDLQ DDGVVPVAFL DTAVYVSEAK SFGNMMILGD SLKSVWLVGF DAEPFRMIML GKDLQGLDVN CADFITKDEE VFILIADNNN VLHLVQYDPE DPTALNGQRL LSKSSFSINS FVTCLKSLPK TEEKYDTGNF QTIGSTIDGS FFSVVPINEA SYRRMYILQQ QLTDKEYHYC GLNPRLNRFG GLSMTANDTN TKPILDYDVI RAYGKLNEER KKNLASKVSA KNIYQDIWKD IIEFENALKG L
|
| |