Gene PICST_75374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_75374 
SymbolRPO21 
ID4852036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3464674 
End bp3469963 
Gene Length5290 bp 
Protein Length1739 aa 
Translation table 
GC content44% 
IMG OID640393744 
ProductDNA-directed RNA polymerase II largest subunit (RNA polymerase II subunit 1) (B220) 
Protein accessionXP_001386996 
Protein GI126276379 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGCC AGTTCCCTTA CTCAAGCGCT CCACTCCGTT CCGTCAAGGA AGTTCAGTTC 
GGCTTGTTGT CGCCAGAAGA AGTGCGTGCC ATTTCCGTAG CCAAGATTGA ATACCCCGAG
ACCATGGACC AGACCACCAA GACTCCTAGA GAAGGTGGTT TGAACGATCC TCGTTTAGGT
TCAATCGACA GAAACTTTAG ATGTCAGACC TGTGGCGAGG ATATGGCCGA ATGTCCGGGC
CATTTCGGTC ACATTGAGTT GGCCAAGCCG GTGTTCCACA TTGGGTTCAT CGCCAAAATC
AAGAAGGTAT GTGAGTGTGT ATGTATGCAC TGCGGAAAGT TGTTGCTCGA TGAAAACAAC
CCCGCCATGG CTCAAGCTAT CAAAATAAGA GACCCCAAGA AGAGGTTCAA CGCCGTCTGG
CAGTTGTGTA AGGCTAAAAT GGTTTGTGAA ACCGATATCA TAGAAGAGGG AGCTACTGAA
ACCACAACCA GAGGTGGTTG TGGTCATACT CAGCCCACCA TTCGTAGAGA TGGGTTGAAG
TTATGGGGTA CCTGGAGACA TAACAAGAAT TTTGAAGAAA ACGAACAGCC TGAACGTAGA
TTGTTGACTC CTTCAGAGAT CTTGAATGTT CTTAAACATA TAAGCTCCTT GGACTGTTTG
AGACTTGGCT TCAACGAGGA CTATGCCAGA CCAGAGTGGA TGTTGATCAC AGTTTTGCCT
GTGCCTCCTC CACCTGTCAG ACCTTCGATT GCTTTCAACG ACACTGCCAG GGGGGAAGAT
GATTTAACTT TCAAGTTGGC CGATGTTTTG AAAGCTAACA TCAATGTGCA AAGGTTGGAA
ATGGACGGCT CTCCTCAGCA TGTGATCTCT GAGTTCGAGG CTTTGTTGCA ATTCCACGTA
GCCACATATA TGGATAACGA TATTGCTGGC CAGCCCCAGG CTCTCCAGAA GACAGGTCGT
CCTATCAAAT CTATTAGAGC AAGATTAAAG GGTAAGGAAG GTAGATTGAG AGGTAACTTG
ATGGGTAAGC GTGTCGACTT TTCCGCTCGT ACCGTTATTT CGGGTGATCC AAATCTTGAT
TTAGACCAGG TTGGAGTACC AATTTCCATC GCTCGAACAT TGTCGTACCC TGAGGTTGTC
ACCCCATACA ATATTCATAG ATTGACTGAG TACGTCAGAA ACGGTCCAAA CGAACATCCT
GGTGCCAAGT ACGTTATCAG AGACACCGGT GATCGTATCG ACTTGCGTTA TAACAAGAGA
GCTGGTGACA TTGCATTACA ATACGGCTGG AAGGTTGAAC GTCACTTGAT GGACAATGAC
CCTGTGTTGT TCAACCGTCA ACCCTCCTTG CACAAGATGT CCATGATGGC CCACAGAGTT
AAAGTTATGC CATATTCCAC ATTTAGATTG AATTTGTCAG TCACATCGCC ATATAACGCC
GATTTTGATG GTGATGAAAT GAACTTGCAT GTACCTCAAT CTCCAGAGAC TAGAGCTGAA
TTGTCGGAAA TTTGCGCTGT ACCTCTTCAA ATTGTTTCTC CACAATCGAA TAAGCCAGTT
ATGGGTATTG TGCAAGATAC GTTGTGTGGT ATTCGTAAGA TGACCTTGCG TGACAACTTC
ATTGATTACG ACCAAGTTAT GAATATGTTA TACTGGATTC CGAACTGGGA CGGTGTGATT
CCTCCTCCAG CTATTGCTAA GCCCAAACCA TTGTGGACCG GTAAGCAGTT GTTGTCTATG
GCCATTCCAA AGGGTATACA CTTGCAAAGA TTTGATGGGG GAAAAGATTT GTTAAGTCCC
AAGGACACCG GTATGTTGAT TGTAGATGGT GAGATCATGT TTGGAGTTGT CGATAAGAAG
ACGGTTGGTG CCACAGGTGG TGGTTTAATT CACACTGTTA TGAGAGAAAA AGGCCCTCGT
GTGTGTGCTC AGCTTTTCAG CTCTATTCAA AAGGTGACCA ACTACTGGTT ATTACATAAT
GGTTTCTCTA TCGGTATTGG TGATACCATT GCTGATGTAA GTACCATGAA AGATATCACT
TCAACCATTA GTGAGGCCAA AATCAAGGTG CAGGAAATCA TCTTGGACGC ACAACTGAAC
AAGTTGGAAC CTGAACCAGG TATGACATTA AGAGAATCGT TCGAACATAA CGTTTCTCGT
GTCCTTAATC AAGCTCGTGA TACCGCTGGT CGTTCCGCAG AAATGAATTT AAAGGACTTA
AATAACGTCA AGCAGATGGT GGTTTCAGGT TCCAAGGGTT CTTTCATTAA TATCTCGCAA
ATGTCTGCAT GTGTGGGTCA ACAGATTGTA GAAGGGAAGA GAATTCCCTT TGGTTTTTCT
GATCGTACCT TGCCTCATTT CACAAAGGAT GATTACTCTC CCGAGTCGAA GGGTTTTGTT
GAAAATTCTT ACTTGAGAGG GTTGACTCCT CAAGAATTCT TCTTCCACGC CATGGCTGGT
AGAGAAGGTC TTATCGATAC TGCCGTCAAA ACTGCCGAAA CTGGTTATAT TCAACGTCGT
TTGGTGAAAG CTTTGGAGGA TATCATGGTC CATTACGATG GTACAACCAG AAACTCCTTG
GGTGATATCA TTCAATTCGT CTATGGTGAA GACGGTATCG ATGGTACCCA AGTTGAGAAG
CAATCTGTTG ATACTATTCC AGGATCCAAC GATAGTTTCG AACGTCGTTT CAGAATCGAC
GTCTTGGACT CTTCAAAATC CATTCCAGAA TCATTGTTAG AATCCGGCAA GGAAATTAAG
GGTGATGTCA AGTTACAGAA GGTTTTGGAC GAAGAGTACA AACAGCTCTT GGATGACCGT
AAGTACTTAA GAGAAGTCTG TTTCCCTAAT GGTGACTTCT CATGGCCATT ACCAGTGAAT
TTGCGTCGTA TTATTCAGAA TGCTCAACAG ATTTTCCACA ATGGTCGTTA CAAGGCTTCT
GATTTGAGAT TGGATGAAGT TATCGTTGGC GTTAGATCTC TTTGTGAAAA ATTGCTTGTT
GTTCGTGGTG ATACTGAATT AGTCAAAGAA GCCCAAGCAA ATGCCACATT ATTATTCCAA
TGTTTGGTTA GATCTAGATT GGCATCACGT AGAGTGATTG AAGAATTCAA GTTGAACAGA
TCTTCATTTG AGTGGGTTGT GGGTGAAATT GAAACTCAAT TCCAGAAGTC CATTGTTCAC
CCTGGTGAAA TGGTTGGTGT GATCGCAGCA CAGTCCATCG GTGAACCAGC CACCCAAATG
ACCTTGAACA CTTTCCATTA TGCCGGTGTG TCTTCTAAGA ACGTTACTTT GGGTGTTCCT
CGTCTTAAGG AAATTCTTAA TGTTGCTAAG AACATTAAAA CTCCTGCATT GACTGTGTAC
TTAGATCCAG CGTTATCTGA CGATATTGAA AAGGCCAAGG TTGTCCAATC TGCGATCGAG
CACACAAGTT TGAAAAATGT GACTTCATCC ACAGAAATCT ACTATGATCC AGATCCAAGA
ACAACTGTAA TCGAAGAAGA TTACGATACC GTGGAAGCCT ACTTCTCGAT TCCTGATGAA
AAGGTGGAAG AATCTATTGA AAAGCAATCT CCTTGGTTAC TTCGTTTAGA ATTAGATCGT
GCCAAGATGT TGGATAAACA ATTGACCATG GCTCAAGTTG CTGAAAAGAT TTCCCAGAAC
TTTGGTGAAG ATTTGTTCGT CATATGGTCT GATGATACTG CTGATAAGTT GATCATTCGT
TGTCGTGTCG TTAGAGATCC AAAGTCTCTT GATGAGGAAG CAGATGCTGA AGAAGATCAG
ATATTGAAGC GTATCGAAGC TCACATGTTG GAATCAATTT CTTTACGTGG TATCCCAGGT
ATCACAAGAG TTTTCATGAT GCAACATAAG GTCAACACCC CTGATGCCAC TGGTGAATTT
AAACAAGGAA AGGAATGGGT ATTGGAAACT GATGGTGTCA ATTTGGCTGA TGTCATGGCA
GTTCCAGGTG TTGACTCTAG TCGTACCTAC TCCAATAACT TCATTGAAAT CTTGTCTGTT
CTTGGTATTG AAGCCACACG TGCAGCCTTG TTTAAGGAAA TTCTTAATGT GCTTTCATTC
GATGGTTCTT ATGTGAACTA TCGTCATATG GCTCTTTTGG TTGATGTTAT GACTTCTCGT
GGTCACTTGA TGGCCATTAC ACGTCACGGT ATTAACAGAT CGGACACCGG TGCTTTGATG
CGTTGTTCTT TCGAAGAGAC TGTTGAAATA TTGTTGGAAG CTGGTGCATC TGCCGAGTTA
GATGATTGTC GTGGTATTTC TGAAAACGTA ATGTTAGGCC AAATGGCTCC ATTAGGTACT
GGTGCTTTCG ATGTTATGCT TGACGACAAG ATGTTGCAAA CTGCTCCTTC AAATGTTGCA
GTTGCTGCTG GAAATGACGA ATTTGCTGAC GATGGAGGTG CCACTCCATA CAGAGAGTAC
GACATGGAAG ATGACAAGAT TCAATTTGAA GAAGGAGCAG GTTTCTCGCC GATTCACACT
GCACAAGTTC AAGATGTTTC TGGAGGACTT ACTTCTTATG GAGGACAGCC AACTTCTCCT
TCTGCTACCT CGCCATTTAG TTATGGAAGT ACTTCTCCAT CATTTGGAGG TTCAGTGTCG
CCAGGTTACG GAGGAACTTC CCCAAGTTAT TCTCCAACAT CGCCAAGCTA CTCACCTACT
TCTCCAAGCT ACTCACCTAC ATCTCCAAGT TACTCTCCTA CGTCACCAGC ATATTCACCT
ACTTCCCCAT CTTACTCACC TACTTCCCCA AGCTACTCAC CTACTTCCCC AAGCTACTCA
CCTACTTCCC CAAGCTACTC ACCTACTTCC CCAAGTTACT CTCCAACTTC GCCAAGTTAC
TCACCTACTT CTCCAAGTTA TTCGCCTACT TCTCCTTCTT ATTCTCCTAC TTCTCCTTCT
TACTCTCCTA CATCTCCTTC TTACTCACCT ACTTCTCCAA GCTACTCACC TACTTCCCCA
AGTTACTCGC CAACGAGTCC CTCATACTCT CCTACTTCGC CTAGTTACTC TCCAACTTCG
CCAAGTTATT CACCAACTTC ACCACAATAC TCACCAACCT CTCCTCAGTA CTCTCCAACC
TCACCACTGT ACTCGCCAAC CTCACCACAA TACTCCCCTA CTTCGCCACA GTATTCACCA
GGATCTCCTG AATATTCGCC TAATTCGCCA AAGACGGAGG ATAAGAAGAA CGAAGACTAG
AGATATCAGA CATGTATTTG ATCACATACT TAGCTTATTT GTAATTATAT AACGTAATAA
AGATTCTTTG
 
Protein sequence
MSRQFPYSSA PLRSVKEVQF GLLSPEEVRA ISVAKIEYPE TMDQTTKTPR EGGLNDPRLG 
SIDRNFRCQT CGEDMAECPG HFGHIELAKP VFHIGFIAKI KKVCECVCMH CGKLLLDENN
PAMAQAIKIR DPKKRFNAVW QLCKAKMVCE TDIIEEGATE TTTRGGCGHT QPTIRRDGLK
LWGTWRHNKN FEENEQPERR LLTPSEILNV LKHISSLDCL RLGFNEDYAR PEWMLITVLP
VPPPPVRPSI AFNDTARGED DLTFKLADVL KANINVQRLE MDGSPQHVIS EFEALLQFHV
ATYMDNDIAG QPQALQKTGR PIKSIRARLK GKEGRLRGNL MGKRVDFSAR TVISGDPNLD
LDQVGVPISI ARTLSYPEVV TPYNIHRLTE YVRNGPNEHP GAKYVIRDTG DRIDLRYNKR
AGDIALQYGW KVERHLMDND PVLFNRQPSL HKMSMMAHRV KVMPYSTFRL NLSVTSPYNA
DFDGDEMNLH VPQSPETRAE LSEICAVPLQ IVSPQSNKPV MGIVQDTLCG IRKMTLRDNF
IDYDQVMNML YWIPNWDGVI PPPAIAKPKP LWTGKQLLSM AIPKGIHLQR FDGGKDLLSP
KDTGMLIVDG EIMFGVVDKK TVGATGGGLI HTVMREKGPR VCAQLFSSIQ KVTNYWLLHN
GFSIGIGDTI ADVSTMKDIT STISEAKIKV QEIILDAQLN KLEPEPGMTL RESFEHNVSR
VLNQARDTAG RSAEMNLKDL NNVKQMVVSG SKGSFINISQ MSACVGQQIV EGKRIPFGFS
DRTLPHFTKD DYSPESKGFV ENSYLRGLTP QEFFFHAMAG REGLIDTAVK TAETGYIQRR
LVKALEDIMV HYDGTTRNSL GDIIQFVYGE DGIDGTQVEK QSVDTIPGSN DSFERRFRID
VLDSSKSIPE SLLESGKEIK GDVKLQKVLD EEYKQLLDDR KYLREVCFPN GDFSWPLPVN
LRRIIQNAQQ IFHNGRYKAS DLRLDEVIVG VRSLCEKLLV VRGDTELVKE AQANATLLFQ
CLVRSRLASR RVIEEFKLNR SSFEWVVGEI ETQFQKSIVH PGEMVGVIAA QSIGEPATQM
TLNTFHYAGV SSKNVTLGVP RLKEILNVAK NIKTPALTVY LDPALSDDIE KAKVVQSAIE
HTSLKNVTSS TEIYYDPDPR TTVIEEDYDT VEAYFSIPDE KVEESIEKQS PWLLRLELDR
AKMLDKQLTM AQVAEKISQN FGEDLFVIWS DDTADKLIIR CRVVRDPKSL DEEADAEEDQ
ILKRIEAHML ESISLRGIPG ITRVFMMQHK VNTPDATGEF KQGKEWVLET DGVNLADVMA
VPGVDSSRTY SNNFIEILSV LGIEATRAAL FKEILNVLSF DGSYVNYRHM ALLVDVMTSR
GHLMAITRHG INRSDTGALM RCSFEETVEI LLEAGASAEL DDCRGISENV MLGQMAPLGT
GAFDVMLDDK MLQTAPSNVA VAAGNDEFAD DGGATPYREY DMEDDKIQFE EGAGFSPIHT
AQVQDVSGGL TSYGGQPTSP SATSPFSYGS TSPSFGGSVS PGYGGTSPSY SPTSPSYSPT
SPSYSPTSPS YSPTSPAYSP TSPSYSPTSP SYSPTSPSYS PTSPSYSPTS PSYSPTSPSY
SPTSPSYSPT SPSYSPTSPS YSPTSPSYSP TSPSYSPTSP SYSPTSPSYS PTSPSYSPTS
PSYSPTSPQY SPTSPQYSPT SPLYSPTSPQ YSPTSPQYSP GSPEYSPNSP KTEDKKNED