Gene PICST_52034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52034 
SymbolCFT1 
ID4851436 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1810109 
End bp1814164 
Gene Length4056 bp 
Protein Length1341 aa 
Translation table 
GC content40% 
IMG OID640393144 
Productpre-mRNA 3'-end processing factor CF II mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) RNA processing and modification 
Protein accessionXP_001387581 
Protein GI126274564 
COG category[A] RNA processing and modification 
COG ID[COG5161] Pre-mRNA cleavage and polyadenylation specificity factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.164795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTTT ATCATGAATT CATCGATCCG TCTCGAGTTT CCCACTCTGT AGGCTGCAAC 
TTCATTTCAT CCACCGTTAA ACATTTAGTT GTTGGAAAAG CCACGCTTCT CCAGATCTTT
GAAGTTGTAC AGTTGAAGCT GCTGACGCCG TCAAAGCCTC AGCATCGCTT GAAATTAATA
GACCAGTTCA AACTCCATGG ATTAATCACA GATATAAAGC CCATAAGAAC TGTAGAATCA
CCCAATTTTG ACTACTTGTT GGTTTCGACG AAGTCCGCAA AGTTTTCTGT CATCAAATGG
GACCATCATC TCCATACGAT TCTGACAGTA TCTTTGCATT ATTATGAGAA CGCGATCCAG
AACTCTACCT ATGAGAAGTT GTCGAAGTCT GAACTCTTGT TAGAACCTTA TGGAAGCTGT
AGTTGCTTGC GTTTCAAGAA TTTGCTCTGT TTTTTGCCGT TTGAAACAGC AGAGGAGCTC
GACGATGACG ACGCAGACTC TGAAAACGAA GATATGGTCA AATCAGAGAA GAAGGAACAC
GAAAATGGTA CAGTTAACGT TCCAGTTACA GATCAGCCAG GTAGTTTTTT CGATACCAGC
TTCTTAATAG ATGGCCAGAG TCTCGACTCG TCTATTGGTA GCATTATAGA CATGCAATTT
TTGTTCAAAT ATAGAGAGCC AACTTTTGGT ATACTATCCC AGCGACAACA GGCTTGGGCT
GGGAACTTAC CAAAGATTAA GGATAACGTC CAGTTTTGCA TTTTGACCTT GGATTTGACT
ACTAAGTCTA CTGTTTCAGT TTTGAAGATT GACAATCTTC CGTACGATGT TGACAGAATT
GTTCCCTTGC CTTCTCCCTT GAACGGGTGT TTGCTTTTGG GCTGCAACGA AATAATCCAT
GTAGACAACG GTGGGATTGT AAGAAGAATA GCGGTAAACC AGTTCACCTC TCTCATCACA
GCTTCCACCA AAGCGTACCA GGATCAAACC CACTTGAATC TCAAGTTGGA AGACTGTAGT
GTGGTAGCTT TGCCAAACGA TCACCGTGCT CTATTGGTAT TATCCACGGG TGAATTCTAC
TACTTAAACT TTGAAGTAGA CGGTAAGTCC ATCAAGAAAT TCACCATAGA AAGTGTGGAC
AAATTATTGT ACAGCGACAT AAAATTAACT TTTCCCGGTC AAATAGCGAC CCTCGACAAC
AACTTGCTAT TCTTTGCCAA CCATAATGGT AACAGTCCTT TGGTACAATT CAAATATCAA
GATGAAGCAC TTAATCCTAA GAAACTCGCT AAAATTGTAG ACGAAGATAG TAAGAATGAA
GATGAAGATG AAGACGAAGA CGACTTATAC AAGGACGAAG AAGAGGAAGT GCAAGTAGTT
TTGGGCAATT CTGTCATTGA GTTTGTAAAA CATGACGAGT TGGTGAATAC TGGAGTAGTT
TCTAGTTTTT CTTTAGGATA CTATTCCACT GAAAAGTTCA AATTCAATTT GAAGAATCCT
AATTGCAAAG AAGTGTCGAT TATTGCCAAT GCTGGCACAC ATTCAGAGAC TAAGTTGAAC
ATCATAACAC CTTCCATTCA ACCCACTATC TCTTCAACGT TGAGTTTTTC GCAGGTCAAC
AGAATGTGGA ATTTGAACCA GAAGTATTTG ATTACGTCAG ACGATATTAA CTTCAAGTCC
GAAATCTTTC AAATCGAAAA GTCGTTTGCA CGTTTGAATT CAAAAGATTT CATTAACAAC
GAATTGACCA TTTCGATGCA TGAGTTGAAC AATGGTAAGT TCATCTTGCA AATCACTCCT
AAGCAAATCG TCTTGTACAA CAACTTGTTC AGAAAGAAAA TCACCTTGAA CGAAGAAATC
AAGGACGATG AAATCATTAA CAGTGTCTTG AGAGACGAGT TCTTGATGAT TTTCCTTGCC
AGTGGTGATG TAATGATCTT TGCTATTAAT ACCTACAATG AATCGTACTC AAAGTTGGAG
ATTCCCAAGA TCTTGGATGA TACGATTATC ACTACCGGAT ACATCACCAA TTCTCATTTG
TTAAGGGCTG TTCTGAAAGA TGTAAATTTA CTCTTGAAGA GCGGAACTAA GAGAAATAGA
TCTTCCTCCG TCGTTTCCAA TGTTGGAACT GCTGCAGAAC CTAAGAATGT TGGCCCTAAA
CTGAAGACAT TTGTCTTAGT GACAGGTGAT AATAGAATTG TAGCCTTCAA CAGATTTCAT
AACCAGAGAT GTTATCAGTT GAATCACGTA GACAAGTTCT CTGACAATTT GCATCTAGGA
TTTTTTGACC CTGCTCAAAA CGAACCAGAT CCTTTTATTA AACAAGTAAT GCTCAACGAA
ATAGGTGACA AGGATCACAA AGAAGAATAT TTGACTATAT TGACGATTGG GGGTGAAGTT
CTTCTTTACA AGTTGTACTT TGATGGAGAA AACTACGAGT TCAAGAAGGA GAAAGATTTA
GCTATCACTG GTGCTCCAGA AAACGCATAT CCTATAGGTA CGGCCGTTGA AAGAAGATTG
GCATATTTCC CTAATTTGAA TGGATACACT TGTATATTCG TTACTGGTGT TACTCCCTAT
TTGATTCTTA AGAGTCTTCA TTCCATTCCA AGAATTTACC AGTTCTCGAA AATACCAGCT
GTTTCTATTT CTCCTTTCCA CGATTCGAAA GTAGCAAACG GGTTGATTTT CTTGGACAAT
CAGCAGAATG CGAGAATCTG CCAGCTTCCA CTTGACTTCA ATTATGAAAA CACATGGCCC
ATGAAGTTGA TCCATATCGG AGAGCTGATT CGTGCAATCA CATACCACGA GTCATCTCAC
ACATATGTTG TTTCCACCTT CAAGGATATT GACTACGAGT GTTTTGATGA AGAAGGAAAG
CCAATAGTAG GGCTTCATAA GGACAAACCA CCTTCTTCTG CTTATAAAGG CTCCATCAAA
TTGATTTCTC CTTTTAATTG GTCTGTCATC GATACGATAG AATTGGCTGA TAACGAGTTA
GGCATGACTG TAAAGTCGAT GATTCTCGAT GTAGGCTCAT CTACCAAGAA GTTCAAACAC
AAGAAGGAGT TCATTGTGAT CGGATCTGGT AAATACAGAA TGGAAGATTT GTCAGCCAAT
GGGTCGTTCA GAATCTACGA AATTATCGAC ATTATTCCTG AGCCAGATAG ACCTGAAACA
AACCACAAGT TTAAAGAGGT TTTCAAAGAA GATACCAAGG GTGCTGTCAC TTCTGTATGT
GAAGTCAGTG GCAGATTCCT AGTATCACAA GGTCAGAAGG TCATAGTAAG AGATTTACAG
GACGATGGGG TGGTCCCTGT AGCCTTTTTG GATACGGCAG TGTATGTTTC TGAGGCCAAA
AGTTTTGGTA ACATGATGAT CTTGGGTGAC TCGTTGAAGA GTGTTTGGTT GGTAGGATTC
GACGCTGAAC CATTCAGAAT GATCATGTTA GGAAAGGACT TACAAGGACT AGACGTAAAC
TGTGCAGACT TTATTACTAA GGATGAGGAG GTGTTTATCT TGATTGCTGA TAACAATAAT
GTCTTGCATT TAGTTCAGTA TGACCCTGAA GATCCTACTG CATTAAATGG CCAGAGATTA
CTTTCCAAGT CTTCTTTTTC CATCAACTCA TTCGTGACGT GCCTTAAATC TTTGCCCAAG
ACTGAAGAGA AATACGACAC TGGCAGTGGA CAGAAGACCT CGTCGGTTAT AGGAGACTTC
CAGACGATTG GTTCGACCAT TGATGGTTCT TTCTTTAGTG TTGTCCCCAT AAACGAAGCC
AGCTACAGAA GAATGTACAT ATTGCAACAG CAGTTGACCG ACAAGGAGTA CCATTACTGT
GGTTTAAATC CTCGTTTGAA CCGTTTCGGT GGATTATCGA TGACAGCAAA CGACACCAAC
ACTAAGCCTA TTCTTGACTA CGATGTGATT AGAGCCTACG GCAAGTTGAA CGAAGAAAGA
AAGAAAAACT TGGCTAGTAA AGTAAGTGCA AAGAATATTT ACCAGGATAT CTGGAAGGAT
ATCATAGAGT TCGAGAATGC GTTGAAGGGT TTGTAG
 
Protein sequence
MDVYHEFIDP SRVSHSVGCN FISSTVKHLV VGKATLLQIF EVVQLKLLTP SKPQHRLKLI 
DQFKLHGLIT DIKPIRTVES PNFDYLLVST KSAKFSVIKW DHHLHTILTV SLHYYENAIQ
NSTYEKLSKS ELLLEPYGSC SCLRFKNLLC FLPFETAEEL DDDDADSENE DMVKSEKKEH
ENGTVNVPVT DQPGSFFDTS FLIDGQSLDS SIGSIIDMQF LFKYREPTFG ILSQRQQAWA
GNLPKIKDNV QFCILTLDLT TKSTVSVLKI DNLPYDVDRI VPLPSPLNGC LLLGCNEIIH
VDNGGIVRRI AVNQFTSLIT ASTKAYQDQT HLNLKLEDCS VVALPNDHRA LLVLSTGEFY
YLNFEVDGKS IKKFTIESVD KLLYSDIKLT FPGQIATLDN NLLFFANHNG NSPLVQFKYQ
DEALNPKKLA KIVDEDSKNE DEDEDEDDLY KDEEEEVQVV LGNSVIEFVK HDELVNTGVV
SSFSLGYYST EKFKFNLKNP NCKEVSIIAN AGTHSETKLN IITPSIQPTI SSTLSFSQVN
RMWNLNQKYL ITSDDINFKS EIFQIEKSFA RLNSKDFINN ELTISMHELN NGKFILQITP
KQIVLYNNLF RKKITLNEEI KDDEIINSVL RDEFLMIFLA SGDVMIFAIN TYNESYSKLE
IPKILDDTII TTGYITNSHL LRAVLKDVNL LLKSGTKRNR SSSVVSNVGT AAEPKNVGPK
LKTFVLVTGD NRIVAFNRFH NQRCYQLNHV DKFSDNLHLG FFDPAQNEPD PFIKQVMLNE
IGDKDHKEEY LTILTIGGEV LLYKLYFDGE NYEFKKEKDL AITGAPENAY PIGTAVERRL
AYFPNLNGYT CIFVTGVTPY LILKSLHSIP RIYQFSKIPA VSISPFHDSK VANGLIFLDN
QQNARICQLP LDFNYENTWP MKLIHIGELI RAITYHESSH TYVVSTFKDI DYECFDEEGK
PIVGLHKDKP PSSAYKGSIK LISPFNWSVI DTIELADNEL GMTVKSMILD VGSSTKKFKH
KKEFIVIGSG KYRMEDLSAN GSFRIYEIID IIPEPDRPET NHKFKEVFKE DTKGAVTSVC
EVSGRFLVSQ GQKVIVRDLQ DDGVVPVAFL DTAVYVSEAK SFGNMMILGD SLKSVWLVGF
DAEPFRMIML GKDLQGLDVN CADFITKDEE VFILIADNNN VLHLVQYDPE DPTALNGQRL
LSKSSFSINS FVTCLKSLPK TEEKYDTGNF QTIGSTIDGS FFSVVPINEA SYRRMYILQQ
QLTDKEYHYC GLNPRLNRFG GLSMTANDTN TKPILDYDVI RAYGKLNEER KKNLASKVSA
KNIYQDIWKD IIEFENALKG L