Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66941 |
Symbol | |
ID | 4837292 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1231011 |
End bp | 1234209 |
Gene Length | 3199 bp |
Protein Length | 794 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640388607 |
Product | predicted protein |
Protein accession | XP_001382453 |
Protein GI | 150863841 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5099] RNA-binding protein of the Puf family, translational repressor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.979611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCACTAACAC CTTTGGTGCC GTTACTTGCC CTTGCCCGCC GTCTGCAGCA GACCTACGTT ACCACTCCAT CTTCCCCGCC TGACATCCCC CGCCAAAAGC GTGAAGTCTG TCTGTCTGAT TATTTTGCAT CTACTCATCT TTTGATTTTC CAGCGAGCTT GACTTTGCCA GTTGGTTATT GATCCGACCA AGACGTGAAT GTCTTTACAG TAATCTTTGT CTTTGTATAT TAACCTCGGA TCCATATTCA CTGGATCTAT TCTTGTTCCA CGTCTCAAAC GCATACATCT GGCAATTCCA GATCGTCCAA TCACATCATC CCTAGAGGAA TCTAACGCCA TACAAACCAG AACTTCATAC TCTGGTTCAG CATAAAACAC ATAATACCTA GATCATAGAT ATCATTTCCA GATAGTTACC GGCGTACTAA TTGCGAGCAG AATCTATCAT TTCTATTATC AGAATCTTCA ATATTTGATC TGAATCCACA AATAAGAATC TTAATAAACC CTGATTGAAT ACTTGTCCGA TTCTAAAATA TATTGGTAAC TACTATTGAT CTAATACTGT TAAAAGATAT CAATAATCTT CAACTATTCA TCACCACTGT ACCATATCTG TACACTAGTA CTTGCATATT AATCATAATG ACAAACTTAC AGTCGCGTTC TGCTTCTCTT TCTACTACCG TTGACGACTC GTCTATTGTG TCCAGTCCTC CTGCTGCTCT TCCTTCGTTT GGTACCAAGC CAAGCATGAG ATCTGCCTCA ATCGGCTCGT CTTTCTTCCA CAAAGACACG CCCAACTTCT TTGGTTCTAA ATCGGCTGCT GTTGCTGATA CATCCTCTGG ATCTATTGAA GAAAAAGAAT CGTCGAGTTC TAACACCAAC ACCAATACTA ACGTCAATTC CAACTCAAAA ACAGATGCCG TTCTTCCTAC GGCTGGTTCG AATGACTTGG ATATCGTGGG TGCCATCGGA AATCTCGACT TGGATGACGA TTTTGCTGTA GACTCGAACG AGGCTCTTTC CCAAGGTCAT CCTTCCAAGT CCACCACTCC TTTTCTGCAA CAATCAACCT TTTTGTATGG GGACAACGTG AACAGTTACC AGTTCTACCA CCCTCATTAC CCTCCGCCAC CTCACCAAGG CTCGTTGACA CCTAACCCCT CTCTTGGAGG CTTTGCTCCT ACTACCTTTG GAGCTTCAAA CACGCCTTCC TCTACTTGGA ACAACTCGTT CATTCCTTCG TCTGCTCCTC CATTTACTTA TAACAACGCT GGAAACAACG GTAACAAGGA TTTGGCTGCA GATTTGCCCT CTTTTGTCAA GTCTTTTGAA CTTCCTTCGA CTGACGAATC TGAATCTGGT TCAGGCAATG ATAAGAACGC CGACAAGAGC TCTAACGCAT CTAACTCTGC CAATGTCAAC TCCAATGTAG CCCTTTTCTC CTCTCCTTTT GGCGGAATCT TGGACTCCCA ACTCCATTTT ATGCCTACTG CCAATCTTTC TGGCGGCCAC ATGGTCAACG ACAAGAACGG TGAATCTGAA AACGACTTGG GCTTGCTGGA GAAGGGCTCC AGCAACTTCC AGGGATCTTC TCCTCTTGAC TTCAATTCCC ACCAGATGAA CGTTAAGAAA GGAGGCTACG ACTTAGCTGG TGTTCCTCCA TTTCCTACAG GTTTGAACGA CAACTTATCG ATGCTTCCCA ACGTAATGGG TCATCCTGGA GCTCCGGGAG GCCCCATGCC CAACATGTGG AACCAGCAGA TGATGAATTC TCAGTTGCAC CCCACAAATA TGAGAAACAC CTATGACGGC AGAGGCATGG CTCCTCCACC TCCTCCTCAG AGCGGAATGC ATCCTAACCA TCACATGATG GGCCAATTCA ACAACGGTGC CAACTCAGGA ACTCGTGGAA ACAGCAATAG CAGCGGCAAC CACCACAATG GCCGCCACCG TGGCAATTAC ATGAACGATG GTATGAATGT CCATCGCAAG ATGCACAACA GCATCAACAG TAGCAATGGC CACGGCAGAA GAAAAGGCGA CGATGCTTCA AAGTATGCCA ATGCCAAATT GAGTGATTTT ACGGGCGAGA TCTTCTCGTT GTGTAAAGAC CAGCATGGTT GTAGATTCTT GCAACGTCAG TTGGACTTGG GCCGTGAAGT TGCAGAAGGC AGGAATACCG ATTCTTCTGT CTTGTCAAAC GACATTGCTG CCACTATGAT ATTCAACGAA ATCTACTTGA AAATTGTTGA ATTGATGACG GATCCGTTCG GCAACTACTT GATTCAGAAG TTATTTGAGA ACGTTTCCGT AGACCAGAGA ATTATCTTGG TCAAAAACGC CGCTCCTGAG TTCATTCGTA TTGCATTAGA TCCACACGGT ACGAGAGCTT TGCAAAAGTT GGTGGAGTGC ATTTCTACCG AAGAGGAGTC GAAGTTGATT ATTGGTTCCT TGAGCCCTCA TATTGTATCT TTATCGAGAG ACTTGAACGG TAACCATGTA GTACAGAAGT GTTTGCAGAA GTTGAAGCCG GAAGAGAACC AGTTCATCTT CGAAACAGCA TCATTGCATT GCAACGAAAT CGCTACTCAC AGACACGGCT GTTGTGTCTT ACAGAGATGT TTGGACCACG GCAACTCGGA CCAACGTAGG CAGTTGTCGT TAAAGGTGGC TGAAAATGCC ACCAACTTAT CGCTTGACCC ATTCGGCAAC TACGTAGTTC AGTATGTCTT GTCCAGAGGT GACGAGGGAT CGATCCAGAT CATCATGGAT CACATCAAGT CCAACATCAT TTCTTTATCG CTCCACAAAT TCGGTTCCAA CGTCATTGAA AAGTCGTTGA GAATCAACAA GTTGACCAAC ACCTTGATCG ACGTCTTGTT GAAGCACCAG GACAGATTCT CGGACATGTT GAATGACGCC TTTGGTAACT ACGTGTTGCA AACCAGTTTG GATGTAGCCA ATCCGCAGGA CTTGAACAGC TTGTCTCAAG CGTTGCAGCC CTTGTTGCCC AATATCAAGA ATACTCCTCA TGGCAGAAGG ATTATGACCA AGATCCAGAG CATTATGTGA GGTTTGCTTG TTGTGCCATT TTTGTCTTGA CGAAAGTCTT GCTTGTTTAT ATAGCTGTAT TTTAGACTGT ACATTATACG GAAGCATACT ATGTATGATT AATATAATTA TGATAGCGT
|
Protein sequence | MTNLQSRSAS LSTTVDDSSI VSSPPAALPS FGTKPSMRSA SIGSSFFHKD TPNFFGSKSA AVADTSSGSI EEKESSNAVL PTAGSNDLDI VGAIGNLDLD DDFAVDSNEA LSQGHPSKST TPFSQQSTFL YGDNVNSYQF YHPHYPPPPH QGSLTPNPSL GGFAPTTFGA SNTPSSTWNN SFIPSSAPPF TYNNAGNNGN KDLAADLPSF VKSFELPSTD ESESGSGNDK NADKSSNASN SANVNSNVAL FSSPFGGILD SQLHFMPTAN LSGGHMVNDK NGESENDLGL SEKGSSNFQG SSPLDFNSHQ MNVKKGGYDL AGVPPFPTGL NDNLSMLPNV MGHPGAPGGP MPNMWNQQMM NSQLHPTNMR NTYDGRGMAP PPPPQSGMHP NHHMMGQFNN GANSGTRGNS NSSGNHHNGR HRGNYMNDGM NVHRKMHNSI NSSNGHGRRK GDDASKYANA KLSDFTGEIF SLCKDQHGCR FLQRQLDLGR EVAEGRNTDS SVLSNDIAAT MIFNEIYLKI VELMTDPFGN YLIQKLFENV SVDQRIILVK NAAPEFIRIA LDPHGTRALQ KLVECISTEE ESKLIIGSLS PHIVSLSRDL NGNHVVQKCL QKLKPEENQF IFETASLHCN EIATHRHGCC VLQRCLDHGN SDQRRQLSLK VAENATNLSL DPFGNYVVQY VLSRGDEGSI QIIMDHIKSN IISLSLHKFG SNVIEKSLRI NKLTNTLIDV LLKHQDRFSD MLNDAFGNYV LQTSLDVANP QDLNSLSQAL QPLLPNIKNT PHGRRIMTKI QSIM
|
| |