Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_75374 |
Symbol | RPO21 |
ID | 4852036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3464674 |
End bp | 3469963 |
Gene Length | 5290 bp |
Protein Length | 1739 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393744 |
Product | DNA-directed RNA polymerase II largest subunit (RNA polymerase II subunit 1) (B220) |
Protein accession | XP_001386996 |
Protein GI | 126276379 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGCC AGTTCCCTTA CTCAAGCGCT CCACTCCGTT CCGTCAAGGA AGTTCAGTTC GGCTTGTTGT CGCCAGAAGA AGTGCGTGCC ATTTCCGTAG CCAAGATTGA ATACCCCGAG ACCATGGACC AGACCACCAA GACTCCTAGA GAAGGTGGTT TGAACGATCC TCGTTTAGGT TCAATCGACA GAAACTTTAG ATGTCAGACC TGTGGCGAGG ATATGGCCGA ATGTCCGGGC CATTTCGGTC ACATTGAGTT GGCCAAGCCG GTGTTCCACA TTGGGTTCAT CGCCAAAATC AAGAAGGTAT GTGAGTGTGT ATGTATGCAC TGCGGAAAGT TGTTGCTCGA TGAAAACAAC CCCGCCATGG CTCAAGCTAT CAAAATAAGA GACCCCAAGA AGAGGTTCAA CGCCGTCTGG CAGTTGTGTA AGGCTAAAAT GGTTTGTGAA ACCGATATCA TAGAAGAGGG AGCTACTGAA ACCACAACCA GAGGTGGTTG TGGTCATACT CAGCCCACCA TTCGTAGAGA TGGGTTGAAG TTATGGGGTA CCTGGAGACA TAACAAGAAT TTTGAAGAAA ACGAACAGCC TGAACGTAGA TTGTTGACTC CTTCAGAGAT CTTGAATGTT CTTAAACATA TAAGCTCCTT GGACTGTTTG AGACTTGGCT TCAACGAGGA CTATGCCAGA CCAGAGTGGA TGTTGATCAC AGTTTTGCCT GTGCCTCCTC CACCTGTCAG ACCTTCGATT GCTTTCAACG ACACTGCCAG GGGGGAAGAT GATTTAACTT TCAAGTTGGC CGATGTTTTG AAAGCTAACA TCAATGTGCA AAGGTTGGAA ATGGACGGCT CTCCTCAGCA TGTGATCTCT GAGTTCGAGG CTTTGTTGCA ATTCCACGTA GCCACATATA TGGATAACGA TATTGCTGGC CAGCCCCAGG CTCTCCAGAA GACAGGTCGT CCTATCAAAT CTATTAGAGC AAGATTAAAG GGTAAGGAAG GTAGATTGAG AGGTAACTTG ATGGGTAAGC GTGTCGACTT TTCCGCTCGT ACCGTTATTT CGGGTGATCC AAATCTTGAT TTAGACCAGG TTGGAGTACC AATTTCCATC GCTCGAACAT TGTCGTACCC TGAGGTTGTC ACCCCATACA ATATTCATAG ATTGACTGAG TACGTCAGAA ACGGTCCAAA CGAACATCCT GGTGCCAAGT ACGTTATCAG AGACACCGGT GATCGTATCG ACTTGCGTTA TAACAAGAGA GCTGGTGACA TTGCATTACA ATACGGCTGG AAGGTTGAAC GTCACTTGAT GGACAATGAC CCTGTGTTGT TCAACCGTCA ACCCTCCTTG CACAAGATGT CCATGATGGC CCACAGAGTT AAAGTTATGC CATATTCCAC ATTTAGATTG AATTTGTCAG TCACATCGCC ATATAACGCC GATTTTGATG GTGATGAAAT GAACTTGCAT GTACCTCAAT CTCCAGAGAC TAGAGCTGAA TTGTCGGAAA TTTGCGCTGT ACCTCTTCAA ATTGTTTCTC CACAATCGAA TAAGCCAGTT ATGGGTATTG TGCAAGATAC GTTGTGTGGT ATTCGTAAGA TGACCTTGCG TGACAACTTC ATTGATTACG ACCAAGTTAT GAATATGTTA TACTGGATTC CGAACTGGGA CGGTGTGATT CCTCCTCCAG CTATTGCTAA GCCCAAACCA TTGTGGACCG GTAAGCAGTT GTTGTCTATG GCCATTCCAA AGGGTATACA CTTGCAAAGA TTTGATGGGG GAAAAGATTT GTTAAGTCCC AAGGACACCG GTATGTTGAT TGTAGATGGT GAGATCATGT TTGGAGTTGT CGATAAGAAG ACGGTTGGTG CCACAGGTGG TGGTTTAATT CACACTGTTA TGAGAGAAAA AGGCCCTCGT GTGTGTGCTC AGCTTTTCAG CTCTATTCAA AAGGTGACCA ACTACTGGTT ATTACATAAT GGTTTCTCTA TCGGTATTGG TGATACCATT GCTGATGTAA GTACCATGAA AGATATCACT TCAACCATTA GTGAGGCCAA AATCAAGGTG CAGGAAATCA TCTTGGACGC ACAACTGAAC AAGTTGGAAC CTGAACCAGG TATGACATTA AGAGAATCGT TCGAACATAA CGTTTCTCGT GTCCTTAATC AAGCTCGTGA TACCGCTGGT CGTTCCGCAG AAATGAATTT AAAGGACTTA AATAACGTCA AGCAGATGGT GGTTTCAGGT TCCAAGGGTT CTTTCATTAA TATCTCGCAA ATGTCTGCAT GTGTGGGTCA ACAGATTGTA GAAGGGAAGA GAATTCCCTT TGGTTTTTCT GATCGTACCT TGCCTCATTT CACAAAGGAT GATTACTCTC CCGAGTCGAA GGGTTTTGTT GAAAATTCTT ACTTGAGAGG GTTGACTCCT CAAGAATTCT TCTTCCACGC CATGGCTGGT AGAGAAGGTC TTATCGATAC TGCCGTCAAA ACTGCCGAAA CTGGTTATAT TCAACGTCGT TTGGTGAAAG CTTTGGAGGA TATCATGGTC CATTACGATG GTACAACCAG AAACTCCTTG GGTGATATCA TTCAATTCGT CTATGGTGAA GACGGTATCG ATGGTACCCA AGTTGAGAAG CAATCTGTTG ATACTATTCC AGGATCCAAC GATAGTTTCG AACGTCGTTT CAGAATCGAC GTCTTGGACT CTTCAAAATC CATTCCAGAA TCATTGTTAG AATCCGGCAA GGAAATTAAG GGTGATGTCA AGTTACAGAA GGTTTTGGAC GAAGAGTACA AACAGCTCTT GGATGACCGT AAGTACTTAA GAGAAGTCTG TTTCCCTAAT GGTGACTTCT CATGGCCATT ACCAGTGAAT TTGCGTCGTA TTATTCAGAA TGCTCAACAG ATTTTCCACA ATGGTCGTTA CAAGGCTTCT GATTTGAGAT TGGATGAAGT TATCGTTGGC GTTAGATCTC TTTGTGAAAA ATTGCTTGTT GTTCGTGGTG ATACTGAATT AGTCAAAGAA GCCCAAGCAA ATGCCACATT ATTATTCCAA TGTTTGGTTA GATCTAGATT GGCATCACGT AGAGTGATTG AAGAATTCAA GTTGAACAGA TCTTCATTTG AGTGGGTTGT GGGTGAAATT GAAACTCAAT TCCAGAAGTC CATTGTTCAC CCTGGTGAAA TGGTTGGTGT GATCGCAGCA CAGTCCATCG GTGAACCAGC CACCCAAATG ACCTTGAACA CTTTCCATTA TGCCGGTGTG TCTTCTAAGA ACGTTACTTT GGGTGTTCCT CGTCTTAAGG AAATTCTTAA TGTTGCTAAG AACATTAAAA CTCCTGCATT GACTGTGTAC TTAGATCCAG CGTTATCTGA CGATATTGAA AAGGCCAAGG TTGTCCAATC TGCGATCGAG CACACAAGTT TGAAAAATGT GACTTCATCC ACAGAAATCT ACTATGATCC AGATCCAAGA ACAACTGTAA TCGAAGAAGA TTACGATACC GTGGAAGCCT ACTTCTCGAT TCCTGATGAA AAGGTGGAAG AATCTATTGA AAAGCAATCT CCTTGGTTAC TTCGTTTAGA ATTAGATCGT GCCAAGATGT TGGATAAACA ATTGACCATG GCTCAAGTTG CTGAAAAGAT TTCCCAGAAC TTTGGTGAAG ATTTGTTCGT CATATGGTCT GATGATACTG CTGATAAGTT GATCATTCGT TGTCGTGTCG TTAGAGATCC AAAGTCTCTT GATGAGGAAG CAGATGCTGA AGAAGATCAG ATATTGAAGC GTATCGAAGC TCACATGTTG GAATCAATTT CTTTACGTGG TATCCCAGGT ATCACAAGAG TTTTCATGAT GCAACATAAG GTCAACACCC CTGATGCCAC TGGTGAATTT AAACAAGGAA AGGAATGGGT ATTGGAAACT GATGGTGTCA ATTTGGCTGA TGTCATGGCA GTTCCAGGTG TTGACTCTAG TCGTACCTAC TCCAATAACT TCATTGAAAT CTTGTCTGTT CTTGGTATTG AAGCCACACG TGCAGCCTTG TTTAAGGAAA TTCTTAATGT GCTTTCATTC GATGGTTCTT ATGTGAACTA TCGTCATATG GCTCTTTTGG TTGATGTTAT GACTTCTCGT GGTCACTTGA TGGCCATTAC ACGTCACGGT ATTAACAGAT CGGACACCGG TGCTTTGATG CGTTGTTCTT TCGAAGAGAC TGTTGAAATA TTGTTGGAAG CTGGTGCATC TGCCGAGTTA GATGATTGTC GTGGTATTTC TGAAAACGTA ATGTTAGGCC AAATGGCTCC ATTAGGTACT GGTGCTTTCG ATGTTATGCT TGACGACAAG ATGTTGCAAA CTGCTCCTTC AAATGTTGCA GTTGCTGCTG GAAATGACGA ATTTGCTGAC GATGGAGGTG CCACTCCATA CAGAGAGTAC GACATGGAAG ATGACAAGAT TCAATTTGAA GAAGGAGCAG GTTTCTCGCC GATTCACACT GCACAAGTTC AAGATGTTTC TGGAGGACTT ACTTCTTATG GAGGACAGCC AACTTCTCCT TCTGCTACCT CGCCATTTAG TTATGGAAGT ACTTCTCCAT CATTTGGAGG TTCAGTGTCG CCAGGTTACG GAGGAACTTC CCCAAGTTAT TCTCCAACAT CGCCAAGCTA CTCACCTACT TCTCCAAGCT ACTCACCTAC ATCTCCAAGT TACTCTCCTA CGTCACCAGC ATATTCACCT ACTTCCCCAT CTTACTCACC TACTTCCCCA AGCTACTCAC CTACTTCCCC AAGCTACTCA CCTACTTCCC CAAGCTACTC ACCTACTTCC CCAAGTTACT CTCCAACTTC GCCAAGTTAC TCACCTACTT CTCCAAGTTA TTCGCCTACT TCTCCTTCTT ATTCTCCTAC TTCTCCTTCT TACTCTCCTA CATCTCCTTC TTACTCACCT ACTTCTCCAA GCTACTCACC TACTTCCCCA AGTTACTCGC CAACGAGTCC CTCATACTCT CCTACTTCGC CTAGTTACTC TCCAACTTCG CCAAGTTATT CACCAACTTC ACCACAATAC TCACCAACCT CTCCTCAGTA CTCTCCAACC TCACCACTGT ACTCGCCAAC CTCACCACAA TACTCCCCTA CTTCGCCACA GTATTCACCA GGATCTCCTG AATATTCGCC TAATTCGCCA AAGACGGAGG ATAAGAAGAA CGAAGACTAG AGATATCAGA CATGTATTTG ATCACATACT TAGCTTATTT GTAATTATAT AACGTAATAA AGATTCTTTG
|
Protein sequence | MSRQFPYSSA PLRSVKEVQF GLLSPEEVRA ISVAKIEYPE TMDQTTKTPR EGGLNDPRLG SIDRNFRCQT CGEDMAECPG HFGHIELAKP VFHIGFIAKI KKVCECVCMH CGKLLLDENN PAMAQAIKIR DPKKRFNAVW QLCKAKMVCE TDIIEEGATE TTTRGGCGHT QPTIRRDGLK LWGTWRHNKN FEENEQPERR LLTPSEILNV LKHISSLDCL RLGFNEDYAR PEWMLITVLP VPPPPVRPSI AFNDTARGED DLTFKLADVL KANINVQRLE MDGSPQHVIS EFEALLQFHV ATYMDNDIAG QPQALQKTGR PIKSIRARLK GKEGRLRGNL MGKRVDFSAR TVISGDPNLD LDQVGVPISI ARTLSYPEVV TPYNIHRLTE YVRNGPNEHP GAKYVIRDTG DRIDLRYNKR AGDIALQYGW KVERHLMDND PVLFNRQPSL HKMSMMAHRV KVMPYSTFRL NLSVTSPYNA DFDGDEMNLH VPQSPETRAE LSEICAVPLQ IVSPQSNKPV MGIVQDTLCG IRKMTLRDNF IDYDQVMNML YWIPNWDGVI PPPAIAKPKP LWTGKQLLSM AIPKGIHLQR FDGGKDLLSP KDTGMLIVDG EIMFGVVDKK TVGATGGGLI HTVMREKGPR VCAQLFSSIQ KVTNYWLLHN GFSIGIGDTI ADVSTMKDIT STISEAKIKV QEIILDAQLN KLEPEPGMTL RESFEHNVSR VLNQARDTAG RSAEMNLKDL NNVKQMVVSG SKGSFINISQ MSACVGQQIV EGKRIPFGFS DRTLPHFTKD DYSPESKGFV ENSYLRGLTP QEFFFHAMAG REGLIDTAVK TAETGYIQRR LVKALEDIMV HYDGTTRNSL GDIIQFVYGE DGIDGTQVEK QSVDTIPGSN DSFERRFRID VLDSSKSIPE SLLESGKEIK GDVKLQKVLD EEYKQLLDDR KYLREVCFPN GDFSWPLPVN LRRIIQNAQQ IFHNGRYKAS DLRLDEVIVG VRSLCEKLLV VRGDTELVKE AQANATLLFQ CLVRSRLASR RVIEEFKLNR SSFEWVVGEI ETQFQKSIVH PGEMVGVIAA QSIGEPATQM TLNTFHYAGV SSKNVTLGVP RLKEILNVAK NIKTPALTVY LDPALSDDIE KAKVVQSAIE HTSLKNVTSS TEIYYDPDPR TTVIEEDYDT VEAYFSIPDE KVEESIEKQS PWLLRLELDR AKMLDKQLTM AQVAEKISQN FGEDLFVIWS DDTADKLIIR CRVVRDPKSL DEEADAEEDQ ILKRIEAHML ESISLRGIPG ITRVFMMQHK VNTPDATGEF KQGKEWVLET DGVNLADVMA VPGVDSSRTY SNNFIEILSV LGIEATRAAL FKEILNVLSF DGSYVNYRHM ALLVDVMTSR GHLMAITRHG INRSDTGALM RCSFEETVEI LLEAGASAEL DDCRGISENV MLGQMAPLGT GAFDVMLDDK MLQTAPSNVA VAAGNDEFAD DGGATPYREY DMEDDKIQFE EGAGFSPIHT AQVQDVSGGL TSYGGQPTSP SATSPFSYGS TSPSFGGSVS PGYGGTSPSY SPTSPSYSPT SPSYSPTSPS YSPTSPAYSP TSPSYSPTSP SYSPTSPSYS PTSPSYSPTS PSYSPTSPSY SPTSPSYSPT SPSYSPTSPS YSPTSPSYSP TSPSYSPTSP SYSPTSPSYS PTSPSYSPTS PSYSPTSPQY SPTSPQYSPT SPLYSPTSPQ YSPTSPQYSP GSPEYSPNSP KTEDKKNED
|
| |