Gene PICST_80709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80709 
SymbolRPB3 
ID4850947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp527781 
End bp528854 
Gene Length1074 bp 
Protein Length315 aa 
Translation table 
GC content46% 
IMG OID640392655 
Product45 kDa subunit of RNA polymerase II 
Protein accessionXP_001387335 
Protein GI126273908 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGTGCATCTA AAATGGAGGT CGACAAGTCT GACGGTCCGT CAGTGACCAT CAGAGAATCT 
GAGCGTGATC ACGTCAACTT CATTCTCCGA GATGTCGATA TGGCCATGGC CAACTCTGTG
AGAAGAGTGA TGTTAGCAGA AGTGCCTACT TTGGCCATCG ACTTGGTGGA GATCGATGTC
AACACTTCTG TGTTAGCAGA CGAATTCTTG TCGCATCGTT TAGGTTTGGT TCCGCTTGTG
TCCGAAGGTA TTGAGAATTT GACGTATTCC CGTGACTGTA CTTGTGACAA CTATTGTCCC
AAGTGCTCTG TTAGATTAGA ACTCACAGCC AAATGTGATA CTGATTCGAC TATGAATGTT
TATGCTACAG ATTTGGCCAA GTTTCACAAC GGTTCACGTT TGGGAGATCC CGTCGTAAGA
GATGTTCAAA AGAGAGGCCC ACTTATCTGT AAATTGAGAA AGCACCAGGA GTTAAGATTG
ACTTGTATAG CCAAGAAGGG TATAGCCAAG GAACATGCCA AATGGTCTCC CTGTTCTGCC
GTTGGGTTCG AATACGATCC TTGGAACAAG TTGAAACACA CCGACTACTG GTACGAAGTC
GATGCTGACG AAGAATGGCC CAAGTCTGAG AACTGCGAAT GGGAAGAAGT GCCAGATCCA
GATGCTCCTT TTGACTACAA GGCTAAACCC ACTAGTTATT ATATAGACGT GGAAACTGTG
GGCAACTTGC CACCCAACGA AGTGGTATTG CGTTCCATAG AGACGTTGCA GAGAAAGCTT
GCTGACATCG CTATCGAATT GAACAAAGAG TCTGTAGAAG CCAGCAGCAC TGCCAACAAC
GGAGGCTTAA CTACTTATGG AAGATCCCAG TACGACAACG GTGGCGATAG TCCAGGCATG
GGAAGGACTC CCTACGGTGG CGATTCCGGC TTTGGCGGTG CCTCTTCCTG GAACGCATAG
TCTCTGTACA TTCACATACT CACTGTGTAT GTCGATTTCA CTTCTACCAC CGTTTAGTAG
TTTCAACTGT TAAATGTTTG TATCATTAGC TGTCTATAAA TAAAAGTAAT ACTC
 
Protein sequence
MEVDKSDGPS VTIRESERDH VNFILRDVDM AMANSVRRVM LAEVPTLAID LVEIDVNTSV 
LADEFLSHRL GLVPLVSEGI ENLTYSRDCT CDNYCPKCSV RLELTAKCDT DSTMNVYATD
LAKFHNGSRL GDPVVRDVQK RGPLICKLRK HQELRLTCIA KKGIAKEHAK WSPCSAVGFE
YDPWNKLKHT DYWYEVDADE EWPKSENCEW EEVPDPDAPF DYKAKPTSYY IDVETVGNLP
PNEVVLRSIE TLQRKLADIA IELNKESVEA SSTANNGGLT TYGRSQYDNG GDSPGMGRTP
YGGDSGFGGA SSWNA