Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_77995 |
Symbol | |
ID | 4839174 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 994179 |
End bp | 998468 |
Gene Length | 4290 bp |
Protein Length | 1127 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390489 |
Product | predicted protein |
Protein accession | XP_001384860 |
Protein GI | 150865585 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5099] RNA-binding protein of the Puf family, translational repressor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAGTGAGTTC TCAATTGCAA CACGAGGCAC ACCGTATCAG ACACATAGAT AGCACTACTG TTCGAGGACC GTTTTATTCT TCTGACCATC ATTCTTACCA TATTTTGAAT TGTAGTGCAG TGATCTTCAT CGACGATATC AGGAACGAAG ACTGCAAGAT TCTACTACCA CTACAAGACC AGCAGTATTA CAACAGCTAG CACTCCTCCG TGATATTAGC ACTAGAAACA GCTCCAGCAG TAACGTCGTT ACCATAATTT GCAGCAGATC CATCCATTTC ATCACATCTG CTTCTAAATC ATCGACTACT CCACTATATA TCCCCATACC CATCACACCA CCGTAGCCCA TGGACACCCT CACACCATGA ACCCCACCGC GCCCGTCAAC ACCAACCCGT CGTCGCCTGC GATCTCCATC ACGCCCTCCA ACTCCAACGG CGCCAATGTC AATGTCTTCA AGTCTCGTAG TCGCGCAGGT ACCCTTCCTT CGCTGTTCTT GAACACACCC TCGCTACTTC TCAACAACAA TAATAACCTA AACAACAGCA ACAACTTAAA CAATAACTAC AATAATCTCA ACAATAATAG TAACATCAAT AATAACAACA ATAATAATTT CTCATTTCCC AACTCAAATG GCTCCTTGTC TACTTCTGCT TCTCCTTTAG TCAACGACAT TCACCACGAT ACGTTAGGCT TGCCCACCAT GGACTCCATC TCGTTGGCCG TGTCTGGCTC GTCTCCTTCG CCATCGTCCA ACGCCATCGA CATACCTACC TCATCTTCAA CTGCTACCCA CGCAGGAGTT CCTACTTCTT CGCGAAGAAT GCGTCTGGGC TCCCTTTTCT CCACTAATTC TATCTGGAAC GACGACAACG TTTCCGTGCA CTCATCTTCA CAGCACTCCA ATGGACTTCT CGATAACACA AGTTTACACT CCTTCGATGG CACTACTAAT TCCAACATCA ACAACATCAA CAATAACAAC AACAATAACA TCGTAAACAG TAATAGCAAT AATAGCAGCA ATGGATCCGG AAATACTTCG TCATTCATAT CGCCTGTTTT GGCAGCCCAG AACTCGTCTC TAACTTCTTC TGGAGCTCAT GGAACTACTT CTGCGCTCAA TGCGAATACC AGAAATAGAT CATACACAAC AACAGCAGCC ATTTCCAACA TCAACATACT CCCTATGACT AGCTCCAGTG GGTTTCTGAT GGCGGATTCT CATAATTCTT CCAGAATGTC TACTTCTCCC TTTGTAAACG TCTCTAACAA GACGAATGAC ATGAACTATC TCTTAGATAA CCTCATGCTC AACATGAACA ACGGCCCTGC TAACGCCAAT CTCTCGAATA TCGTCTCTAA CTCCAGACAC AGAGCCCAGA CATATTCGGG AACGACTCCC ACTATTCCGG AATCTACTTT AAACCATCCA CAAGGTGCCC AGGTTTTACA ACAACAGCAA CAACAGCAGC AGCAACAATA CCCTCAAATT CAACTCCAGT TTCCACACCA ACTTGCTGAA AATGTAGCTT CAGAGCAGCC GGTTTTATTG AATGACTACG ATTTCTCGCA GTTGGTTATC ACCACCAACT TTGAAAATCC CAGTCTTGGA CCTACGAGAG TTCTTCTATT TGACAACTTG CCTCAATTCG TAGATGCATT CAAGTTGTAC AATATCTTGA GCAATTCTCT AGGCAACCAA CGAACTTTGG GAGGAGTCAG AGCTATCAGA ATAACTTCCA CAGCTTCTTC GAAGCTTGCT CTCGTAGAAA GCAGCTCAGT AGACATAGCA ATGTCACTCA AGGCCAATTT CAACCATTTG GAGTTGGTAC CTGGCGTGAT TTTATATGTT GCTTTCGCAA AGATAGTCGA GCCTCAGTCT ACTATAACTC AGCATCAAGC GGCTCCCGTC GTTCAAGCGA CTGCTGAGAC TGCTGCACCC ATTTCTTCTT CCAATGGAAC TAATGGAAAT AGCAAGGCTA CTTCAACTAA CGGAGTTTCA GCTTCAAAGC CTAATGGTTC CAATGAATCC AAGAATACTG ACTTAATTGT TATACAAAAG AGCTTGATTC ACACCATCAG TAAGTTGAGT ACGAAAGCTA ACCACGCTGA TTTGAACAAG GTGGTCTCGA TTATCAACAA GTCGATTGCT TATCCAAATG ACCACTACCA AGACAACTTT GGTCCCTTAC CTGATCCAAT ACCTTTGCGT CAATTTGATT CTCCTAAGTT GAGAGAGTTG CGTAAGATCT TGGAGAATAA CGAAAATGCC TTGAATAATG AACCGCTCAC GAATTCGTCA CCCGTATCTA GTGGTGATGG AGAAGAAGGT GACGTTGGCA ACAAAGTCAT GACTCAGTTG GAATTAGAAG AATTGTGTTT GGCTATGCTT GACGAGTTGC CAGAGTTGTG TTACGACTAC TTGGGTAACA CTATCGTACA GAAGCTTTTC AATTTGGTTG AGTCGCCCTT AATCAAGTTG ATGATGGTGA AAGAAATCAC TCCCTTCTTG ACCCAATTGA GTATCCACAA AAACGGTACC TGGGCCATCC AGAAGATCAT AAATCTCTGT GGAAATGACT TCCAGCAGAA GTACTTGATT GGAGCTAGTT TGAAGCCTTA TGCAGCCAAG TTGTTCAACG ACCAGTTTGG TAACTATGTT TTACAGGGCT GTATCAAGTT TGGTTCGCCT TTCAATGATT TCGTTTTTGA AGCAATGTTG GATAACTTCA TCGAGATCAG TTTCGGAAGA TTTGGGGCTA GATGTATCAG AACGATTCTC GAAACTGCCA ACGAATCCAA TGCCATATCC AACGAACAAG TAGTGCTTGT GGCAGGACTT ATTGTAGAGT TTGCTAACGA ATTAGTTGTG AACAACAATG GTTCGTTGTT GATTACTTGG TTCTTAGATA CATTTAACGA CAAGGGTGCC GCTGTAGATG ACAGATTTGA GTTGTTAACC AATAAGTTTT TGCCACATTT GGCAAAGTTG TGTACCCACA AGCTTGCCAA TTTGACAATC CTCAAGATCT TGAACAACAG ATCAGACTTG AAACGGAAAC AGTTGATCAT GGACACCATA TTTGGAAGAA TGGAGGACTA TGAGGGTTAT TCTGATGAAA TTGACGACTC ATCTAGACCT ACATCCAAAT TACTAGAGTT AATTTTGAGC GAATATCCAG AAAATGCTGC AGGACCCTTA TTCATTTACA AGATCTTGAC TAACCCCTTG CTTTTGAACT TGAATGACGT CAAGACTGGA AACAACGAAA TGGATCCTAG AAGAAACTCA AGATACCAGC AATTTATTGT GGGTCAAATC AGACGAATCT TGTTAGAATT GAACATCACC AACTTCCAGC CATACAAAAA ATTGATGGAT GAAGCTGGTC TTTCGAGTAA CAGATTGAAC AGAGCAACTT CTATGACAGG AAACGGAAAA CGTAACAAGA GAGGAGGCAA TTCTCGGTCT GGCATTAGCA GCCATGGTGT GTCATCTAAG ATTTCACCAG AGTCGGGTGG CATACCTGGG CAGCCTGTGA TGGCAACTAT GTCGTATGGG GCTCCTCCTC AGTACTATAT TCCTGCTCAA CAGTACCAGC AACAAAAACC ACTTCCACAA GGATCTCCCA AGGGCTACAA TGCAGTTCCT CAGATGATGA GAGGTCAAGG AATGCCACCA CAGGGAGTTG TACAGCAACA GTACCAAAAC ATGCCTCCAA TGATGAACCA GCAACAATAC CCTATTAGTC AACAACCGCT CTACTACCAA CCAGAACAAC AGATGCATGA ATTGCAACAG CAACAAGATA TCTTAGCCAT GCAACAGTTG GAGCAATTGT CTTTGAGTTC TGCCGCTTTG GGCTACAATT CAAACCCGGG AACGCCTGGA GGCAACAACC AAAGAAGTCT GTTCTTCTAG CCAGCAGTAA CCACTGTATA TAAAGGTATG CAAAAAGACC TGAACTCAAA CTATTTATCA TGTTCATTTA TGAAACAGAA AAAATTACAA CTTTATACAT TTTTTTATCT ACTCTGAATT TATCTCGACT TTTTCTTTCA TTTCACTTAC TAAAGCTACG AGGGATGCTA TACAAAAAAA TAATAAGAAA ACTACTGGTT AATGGTTTCT TTTCTGAAAA TTGTTTGACT TTACTAGGGC CACCATATTG GTGTTACTTC CTTTTTTTCA CTTGTTTCCA TCTTGTTTCT GGTCACCCAG ACCTGTTATC TAAAAATAGC AACGTATTTA TAATGTTTAA TAAATAATCA ACCTATCTAA
|
Protein sequence | MNPTAPVNTN PSSPAISITP SNSNGANVNV FKSRSRAGTL PSSFLNTPSL LLNNNNNLNN SNNLNNNYNN LNNNSLPTMD SISLAVSGSS PSPSSNAIDI PTSSSTATHA GVPTSSRRMR SGSLFSTNSI WNDDNVSVHS SSQHSNGLLD NTSLHSFDGT TNSNINNINN NNNNNIVNSN SNNSSNGSGN TSSFISPNSS LTSSGAHGTT SALNANTRNR SYTTTAAISN INILPMTSSS GFSMADSHNS SRMSTSPFVN VSNKTNDMNY LLDNLMLNMN NGPANANLSN IVSNSRHRAQ TYSGTTPTIP ESTLNHPQGA QVLQQQQQQQ QQQYPQIQLQ FPHQLAENVA SEQPVLLNDY DFSQLVITTN FENPSLGPTR VLLFDNLPQF VDAFKLYNIL SNSLGNQRTL GGVRAIRITS TASSKLALVE SSSVDIAMSL KANFNHLELV PGVILYVAFA KIVEPQSTIT QHQAAPVVQA TAETAAPISS SNGTNGNSKA TSTNGVSASK PNGSNESKNT DLIVIQKSLI HTISKLSTKA NHADLNKVVS IINKSIAYPN DHYQDNFGPL PDPIPLRQFD SPKLRELRKI LENNENALNN EPLTNSSPVS SGDGEEGDVG NKVMTQLELE ELCLAMLDEL PELCYDYLGN TIVQKLFNLV ESPLIKLMMV KEITPFLTQL SIHKNGTWAI QKIINLCGND FQQKYLIGAS LKPYAAKLFN DQFGNYVLQG CIKFGSPFND FVFEAMLDNF IEISFGRFGA RCIRTILETA NESNAISNEQ VVLVAGLIVE FANELVVNNN GSLLITWFLD TFNDKGAAVD DRFELLTNKF LPHLAKLCTH KLANLTILKI LNNRSDLKRK QLIMDTIFGR MEDYEGYSDE IDDSSRPTSK LLELILSEYP ENAAGPLFIY KILTNPLLLN LNDVKTGNNE MDPRRNSRYQ QFIVGQIRRI LLELNITNFQ PYKKLMDEAG LSSNRLNRAT SMTGNGKRNK RGGNSRSGIS SHGVSSKISP ESGGIPGQPV MATMSYGAPP QYYIPAQQYQ QQKPLPQGSP KGYNAYQNMP PMMNQQQYPI SQQPLYYQPE QQMHELQQQQ DILAMQQLEQ LSLSSAALGY NSNPGTPGGN NQRSSFF
|
| |