Gene PICST_86849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_86849 
SymbolDIS3 
ID4851890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3115055 
End bp3118130 
Gene Length3076 bp 
Protein Length989 aa 
Translation table 
GC content42% 
IMG OID640393598 
Product3'-5' exoribonuclease required for 3 end formation of 5.8S rRNA 
Protein accessionXP_001387162 
Protein GI126275931 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTACGTTTC CATAATTTCT CAAGATGATT GGGAACAGAA AACGATTGTC TAGTGGACTT 
AGTGTCACCT CCAAGGTGTT CGTGAGGTCT CGTAATGGTG GAGCCCAAAA GATTGTTAGG
GAACACTACT TAAGAGACGA TATACCATGC TACTCCAAGG CATGTAAAGT TTGTCACGAC
ATCATCAAGA CGGATGCAGC AGGTGAGCTC CCTCGTTTCA TCTTGTCCGA GAACCCAAGC
AAGACTAAAT CTGGTTTGCG CCATTACATA GTAGTAGACA CCAATATCAT TCTTCATGCC
ATCGATTTGT TGGAGAACGT CAGTTGTTTC TACGACGTAA TTGTGCCTCA GACAGTTTTG
GAAGAAGTGA AAAATAGATC CTTCCCTATC TACCAGCGAC TTCGGGGTTT GGTGAAATCA
GAAGATAAGC GTTTTATCGT ATTTCACAAC GAGTACGCTG AAGCTACTTA CATCACGAGA
AACAAGAACG AGTCCATAAA CGATAGAAAC GACAGAGCCA TCCGGAAGGT GGCTTCGTGG
TTCGGTACTC ACATGAAGAC CAACCATTCT GAAAACGATA TAGGAATCAT TTTTATCTGT
AATGACGCTG ATAACCGTCA AAAGGCTCGT GCTGAGAAGA TAGATGCCCG ATCCCTTGTA
GAGTACTTCG ATGATTTACC TAATGGTGAA GACTTAAGGG ACTTGATTCC AAGTGATATC
AACAGTGGAG TAGACTTCCA TAACGAGAAT GAGGCGTCAT TTCCGGAGTA CTACTCGAAC
TCGAGAATCA TGGCTGGTGT CAAGAATGGT ACTCTTTATC AGGGTATGTT GAATATCTCT
TCGTACAACT TTCTTCAAGG TACAGTGCAA GTTCCAGCTT TCAGGAAGCC ACTTTTGATT
GTCGGCTCCC AGAACCTCAA CCGTGCCTTC AACTCGGATT CAGTTATAGT AGAGTTGTTA
CCCAAAGATA AATGGAGAGA ACCTTCTACC ACGATAGTGG AAGAAAGTAA CATTGGTTCT
AACGACAACG CCAACGATGA AGATGAAGAA GATGTCGGTT CTGCAGGCTC CGGTGTAGTT
TCCGATAAAG AAAGAATTTT GTTGGCCCAG GAAGCATTGA AGGCTACTTC TTCTTCAGCA
GGAGTGCAAA AGAGACTTCA GCCTACTGCT AGAGTTGTAG GTATTATGAG AAGATCATGG
AGATACTACG TAGGGCAAAT CGCACCATCT TCTGTTTCTA GCGACGATAC TGGTAATGCT
TCCAAGAGCT GTTTTGTCAT CTTGATGGAC AAGGTATTGC CTAAGATCAG GATCAGAACA
AGAAAAGCCA AGGAGTTCTT AGGTCAGAGA ATCGTTGTTG TTGTGGACTC ATGGCCAGCA
GATTCTCGTT ATCCAAATGG ACATTTTGTG AGAACTCTTG GTGAAATAGA AAGTGCCGAG
GCTGAAACTG AGGCATTGTT GTTGGAACAT GATGTAGAGT ACAGACCCTT CTCCAAGAAT
GTCTTGGACT GTTTGCCGGA AGAAGGTGAC AATTGGGTTG TTCCAGATCT TGTCAACACA
GACGATCCCA AGCTCAAGAA GAGGGTTGAC TTGAGAGACA AGTTGGTATG TTCTATCGAT
CCACCCAACT GTGTGGATAT CGATGATGCC TTACATGCAA AACCACTCCC CAACGGCAAC
TACGAAGTTG GTGTTCATAT AGCTGATGTG ACCCATTTCG TTAAGCCTAA CACAGCATTA
GATCAAGAAG GTGCCTCTAG GGGTACATCT GTCTACTTGG TGGATAAAAG AATCGACATG
TTGCCCATGT TGTTGGGTAC TAATTTGTGT TCGTTGAAGC CATATGTGGA CAGGTTTGCC
TTTTCAGTTA TTTGGGAATT GGATGACAAT GCTAACATTG TCAAGGTTGA GTACATGAAG
TCAGTGATCA AGTCCAGAGA AGCCTTTTCT TACGAAAGAG CCCAAACCCG GATCGACGAT
AAGTCACAAA CTGACGAGTT GACTGAATCG ATGAGAATCT TGTTGAAGCT TTCCAAAAAG
CTCAAACAGC AGAGATTGGA TGCCGGTGCC TTAAACTTAG CATCGCCGGA AGTCAAGGTC
CACATGGATA GTGAGACGTC AGATCCCAAT GAGGTTGAAA TCAAGAAGTT ATTGGAGACC
AACTCTCTTG TTGAAGAGTT CATGTTGTTG GCCAACATTT CCGTAGCCAG AAAGATTTAC
GATTCATATC CACAAACTGC GATGTTAAGA AGACACGCAC CTCCTCCAGC AACAAATTTC
GAGCAGTTGA ACGATATGTT GAGTGTGAGA AAGCCAGGCT TGTCTATTTC TTTGGAGAGT
TCCAGAGCTT TGGCGGACTC GTTGGATAGA TGTGAAGATC CACAAGATCC ATATTTCAAC
ACCCTTTTGC GTATCATGTC AACTAGATGT ATGATGGCAG CTGAGTACTT CTCATCTGGG
TCGTACGGAT ATCCTGAATT CAGACACTAT GGTTTGGCTG TAGATATCTA CACACATTTT
ACGTCACCTA TCAGAAGATA CTGTGATGTT GTAGCACACA GACAGTTGGC TGGTGCAATT
GGTTATGAAA ACTTGGACTT GAGTCACAGA GACAAAAACA AAATGGAGAC AATTGTCAAG
AATATCAATA GAAGACACAG AAGTGCACAA TTTGCTGGCC GTGCCAGTAT TGAGTACTAT
GTTGGACAAG TTATGAAAAA TAATGAAGCC GAACATGAAG GGTATGTGAT CAAGTGTTTC
AACAACGGAA TTGTTGTTCT TGTGCCCAAG TTTGGCATTG AAGGTTTGAT AAAGCTTGAG
ATGTTAGGTG ATGTGAACAC TGCTATCTAT GATGAAGACA AGTACGAGTT GAAGTTCACT
GATCTCCAAG GTAAGGAAAG AAGAGTTGCA GTTTTTGACA AAGTTAATGT AGATGTGAAG
TCAGTTAAAG ATGAAGTCAG TGGTAACAGA AAGGCGCAAT TACTTTTGCG TTGAAGTTGT
ACTATAGAGT ATCTAGGATA TGCATAGTAT CTTGGGTATA CATAAAAATA TATACACATT
AGAAAGATTA TCTATT
 
Protein sequence
MIGNRKRLSS GLSVTSKVFV RSRNGGAQKI VREHYLRDDI PCYSKACKVC HDIIKTDAAG 
ELPRFILSEN PSKTKSGLRH YIVVDTNIIL HAIDLLENVS CFYDVIVPQT VLEEVKNRSF
PIYQRLRGLV KSEDKRFIVF HNEYAEATYI TRNKNESIND RNDRAIRKVA SWFGTHMKTN
HSENDIGIIF ICNDADNRQK ARAEKIDARS LVEYFDDLPN GEDLRDLIPS DINSGVDFHN
ENEASFPEYY SNSRIMAGVK NGTLYQGMLN ISSYNFLQGT VQVPAFRKPL LIVGSQNLNR
AFNSDSVIVE LLPKDKWREP STTIVEESNI GSNDNANDED EEDVGSAGSG VVSDKERILL
AQEALKATSS SAGVQKRLQP TARVVGIMRR SWRYYVGQIA PSSVSSDDTG NASKSCFVIL
MDKVLPKIRI RTRKAKEFLG QRIVVVVDSW PADSRYPNGH FVRTLGEIES AEAETEALLL
EHDVEYRPFS KNVLDCLPEE GDNWVVPDLV NTDDPKLKKR VDLRDKLVCS IDPPNCVDID
DALHAKPLPN GNYEVGVHIA DVTHFVKPNT ALDQEGASRG TSVYLVDKRI DMLPMLLGTN
LCSLKPYVDR FAFSVIWELD DNANIVKVEY MKSVIKSREA FSYERAQTRI DDKSQTDELT
ESMRILLKLS KKLKQQRLDA GALNLASPEV KVHMDSETSD PNEVEIKKLL ETNSLVEEFM
LLANISVARK IYDSYPQTAM LRRHAPPPAT NFEQLNDMLS VRKPGLSISL ESSRALADSL
DRCEDPQDPY FNTLLRIMST RCMMAAEYFS SGSYGYPEFR HYGLAVDIYT HFTSPIRRYC
DVVAHRQLAG AIGYENLDLS HRDKNKMETI VKNINRRHRS AQFAGRASIE YYVGQVMKNN
EAEHEGYVIK CFNNGIVVLV PKFGIEGLIK LEMLGDVNTA IYDEDKYELK FTDLQGKERR
VAVFDKVNVD VKSVKDEVSG NRKAQLLLR