Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_86849 |
Symbol | DIS3 |
ID | 4851890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 3115055 |
End bp | 3118130 |
Gene Length | 3076 bp |
Protein Length | 989 aa |
Translation table | |
GC content | 42% |
IMG OID | 640393598 |
Product | 3'-5' exoribonuclease required for 3 end formation of 5.8S rRNA |
Protein accession | XP_001387162 |
Protein GI | 126275931 |
COG category | [K] Transcription |
COG ID | [COG0557] Exoribonuclease R |
TIGRFAM ID | [TIGR00358] VacB and RNase II family 3'-5' exoribonucleases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.188778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTACGTTTC CATAATTTCT CAAGATGATT GGGAACAGAA AACGATTGTC TAGTGGACTT AGTGTCACCT CCAAGGTGTT CGTGAGGTCT CGTAATGGTG GAGCCCAAAA GATTGTTAGG GAACACTACT TAAGAGACGA TATACCATGC TACTCCAAGG CATGTAAAGT TTGTCACGAC ATCATCAAGA CGGATGCAGC AGGTGAGCTC CCTCGTTTCA TCTTGTCCGA GAACCCAAGC AAGACTAAAT CTGGTTTGCG CCATTACATA GTAGTAGACA CCAATATCAT TCTTCATGCC ATCGATTTGT TGGAGAACGT CAGTTGTTTC TACGACGTAA TTGTGCCTCA GACAGTTTTG GAAGAAGTGA AAAATAGATC CTTCCCTATC TACCAGCGAC TTCGGGGTTT GGTGAAATCA GAAGATAAGC GTTTTATCGT ATTTCACAAC GAGTACGCTG AAGCTACTTA CATCACGAGA AACAAGAACG AGTCCATAAA CGATAGAAAC GACAGAGCCA TCCGGAAGGT GGCTTCGTGG TTCGGTACTC ACATGAAGAC CAACCATTCT GAAAACGATA TAGGAATCAT TTTTATCTGT AATGACGCTG ATAACCGTCA AAAGGCTCGT GCTGAGAAGA TAGATGCCCG ATCCCTTGTA GAGTACTTCG ATGATTTACC TAATGGTGAA GACTTAAGGG ACTTGATTCC AAGTGATATC AACAGTGGAG TAGACTTCCA TAACGAGAAT GAGGCGTCAT TTCCGGAGTA CTACTCGAAC TCGAGAATCA TGGCTGGTGT CAAGAATGGT ACTCTTTATC AGGGTATGTT GAATATCTCT TCGTACAACT TTCTTCAAGG TACAGTGCAA GTTCCAGCTT TCAGGAAGCC ACTTTTGATT GTCGGCTCCC AGAACCTCAA CCGTGCCTTC AACTCGGATT CAGTTATAGT AGAGTTGTTA CCCAAAGATA AATGGAGAGA ACCTTCTACC ACGATAGTGG AAGAAAGTAA CATTGGTTCT AACGACAACG CCAACGATGA AGATGAAGAA GATGTCGGTT CTGCAGGCTC CGGTGTAGTT TCCGATAAAG AAAGAATTTT GTTGGCCCAG GAAGCATTGA AGGCTACTTC TTCTTCAGCA GGAGTGCAAA AGAGACTTCA GCCTACTGCT AGAGTTGTAG GTATTATGAG AAGATCATGG AGATACTACG TAGGGCAAAT CGCACCATCT TCTGTTTCTA GCGACGATAC TGGTAATGCT TCCAAGAGCT GTTTTGTCAT CTTGATGGAC AAGGTATTGC CTAAGATCAG GATCAGAACA AGAAAAGCCA AGGAGTTCTT AGGTCAGAGA ATCGTTGTTG TTGTGGACTC ATGGCCAGCA GATTCTCGTT ATCCAAATGG ACATTTTGTG AGAACTCTTG GTGAAATAGA AAGTGCCGAG GCTGAAACTG AGGCATTGTT GTTGGAACAT GATGTAGAGT ACAGACCCTT CTCCAAGAAT GTCTTGGACT GTTTGCCGGA AGAAGGTGAC AATTGGGTTG TTCCAGATCT TGTCAACACA GACGATCCCA AGCTCAAGAA GAGGGTTGAC TTGAGAGACA AGTTGGTATG TTCTATCGAT CCACCCAACT GTGTGGATAT CGATGATGCC TTACATGCAA AACCACTCCC CAACGGCAAC TACGAAGTTG GTGTTCATAT AGCTGATGTG ACCCATTTCG TTAAGCCTAA CACAGCATTA GATCAAGAAG GTGCCTCTAG GGGTACATCT GTCTACTTGG TGGATAAAAG AATCGACATG TTGCCCATGT TGTTGGGTAC TAATTTGTGT TCGTTGAAGC CATATGTGGA CAGGTTTGCC TTTTCAGTTA TTTGGGAATT GGATGACAAT GCTAACATTG TCAAGGTTGA GTACATGAAG TCAGTGATCA AGTCCAGAGA AGCCTTTTCT TACGAAAGAG CCCAAACCCG GATCGACGAT AAGTCACAAA CTGACGAGTT GACTGAATCG ATGAGAATCT TGTTGAAGCT TTCCAAAAAG CTCAAACAGC AGAGATTGGA TGCCGGTGCC TTAAACTTAG CATCGCCGGA AGTCAAGGTC CACATGGATA GTGAGACGTC AGATCCCAAT GAGGTTGAAA TCAAGAAGTT ATTGGAGACC AACTCTCTTG TTGAAGAGTT CATGTTGTTG GCCAACATTT CCGTAGCCAG AAAGATTTAC GATTCATATC CACAAACTGC GATGTTAAGA AGACACGCAC CTCCTCCAGC AACAAATTTC GAGCAGTTGA ACGATATGTT GAGTGTGAGA AAGCCAGGCT TGTCTATTTC TTTGGAGAGT TCCAGAGCTT TGGCGGACTC GTTGGATAGA TGTGAAGATC CACAAGATCC ATATTTCAAC ACCCTTTTGC GTATCATGTC AACTAGATGT ATGATGGCAG CTGAGTACTT CTCATCTGGG TCGTACGGAT ATCCTGAATT CAGACACTAT GGTTTGGCTG TAGATATCTA CACACATTTT ACGTCACCTA TCAGAAGATA CTGTGATGTT GTAGCACACA GACAGTTGGC TGGTGCAATT GGTTATGAAA ACTTGGACTT GAGTCACAGA GACAAAAACA AAATGGAGAC AATTGTCAAG AATATCAATA GAAGACACAG AAGTGCACAA TTTGCTGGCC GTGCCAGTAT TGAGTACTAT GTTGGACAAG TTATGAAAAA TAATGAAGCC GAACATGAAG GGTATGTGAT CAAGTGTTTC AACAACGGAA TTGTTGTTCT TGTGCCCAAG TTTGGCATTG AAGGTTTGAT AAAGCTTGAG ATGTTAGGTG ATGTGAACAC TGCTATCTAT GATGAAGACA AGTACGAGTT GAAGTTCACT GATCTCCAAG GTAAGGAAAG AAGAGTTGCA GTTTTTGACA AAGTTAATGT AGATGTGAAG TCAGTTAAAG ATGAAGTCAG TGGTAACAGA AAGGCGCAAT TACTTTTGCG TTGAAGTTGT ACTATAGAGT ATCTAGGATA TGCATAGTAT CTTGGGTATA CATAAAAATA TATACACATT AGAAAGATTA TCTATT
|
Protein sequence | MIGNRKRLSS GLSVTSKVFV RSRNGGAQKI VREHYLRDDI PCYSKACKVC HDIIKTDAAG ELPRFILSEN PSKTKSGLRH YIVVDTNIIL HAIDLLENVS CFYDVIVPQT VLEEVKNRSF PIYQRLRGLV KSEDKRFIVF HNEYAEATYI TRNKNESIND RNDRAIRKVA SWFGTHMKTN HSENDIGIIF ICNDADNRQK ARAEKIDARS LVEYFDDLPN GEDLRDLIPS DINSGVDFHN ENEASFPEYY SNSRIMAGVK NGTLYQGMLN ISSYNFLQGT VQVPAFRKPL LIVGSQNLNR AFNSDSVIVE LLPKDKWREP STTIVEESNI GSNDNANDED EEDVGSAGSG VVSDKERILL AQEALKATSS SAGVQKRLQP TARVVGIMRR SWRYYVGQIA PSSVSSDDTG NASKSCFVIL MDKVLPKIRI RTRKAKEFLG QRIVVVVDSW PADSRYPNGH FVRTLGEIES AEAETEALLL EHDVEYRPFS KNVLDCLPEE GDNWVVPDLV NTDDPKLKKR VDLRDKLVCS IDPPNCVDID DALHAKPLPN GNYEVGVHIA DVTHFVKPNT ALDQEGASRG TSVYLVDKRI DMLPMLLGTN LCSLKPYVDR FAFSVIWELD DNANIVKVEY MKSVIKSREA FSYERAQTRI DDKSQTDELT ESMRILLKLS KKLKQQRLDA GALNLASPEV KVHMDSETSD PNEVEIKKLL ETNSLVEEFM LLANISVARK IYDSYPQTAM LRRHAPPPAT NFEQLNDMLS VRKPGLSISL ESSRALADSL DRCEDPQDPY FNTLLRIMST RCMMAAEYFS SGSYGYPEFR HYGLAVDIYT HFTSPIRRYC DVVAHRQLAG AIGYENLDLS HRDKNKMETI VKNINRRHRS AQFAGRASIE YYVGQVMKNN EAEHEGYVIK CFNNGIVVLV PKFGIEGLIK LEMLGDVNTA IYDEDKYELK FTDLQGKERR VAVFDKVNVD VKSVKDEVSG NRKAQLLLR
|
| |