Gene PICST_66941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66941 
Symbol 
ID4837292 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1231011 
End bp1234209 
Gene Length3199 bp 
Protein Length794 aa 
Translation table12 
GC content45% 
IMG OID640388607 
Productpredicted protein 
Protein accessionXP_001382453 
Protein GI150863841 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5099] RNA-binding protein of the Puf family, translational repressor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.979611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCACTAACAC CTTTGGTGCC GTTACTTGCC CTTGCCCGCC GTCTGCAGCA GACCTACGTT 
ACCACTCCAT CTTCCCCGCC TGACATCCCC CGCCAAAAGC GTGAAGTCTG TCTGTCTGAT
TATTTTGCAT CTACTCATCT TTTGATTTTC CAGCGAGCTT GACTTTGCCA GTTGGTTATT
GATCCGACCA AGACGTGAAT GTCTTTACAG TAATCTTTGT CTTTGTATAT TAACCTCGGA
TCCATATTCA CTGGATCTAT TCTTGTTCCA CGTCTCAAAC GCATACATCT GGCAATTCCA
GATCGTCCAA TCACATCATC CCTAGAGGAA TCTAACGCCA TACAAACCAG AACTTCATAC
TCTGGTTCAG CATAAAACAC ATAATACCTA GATCATAGAT ATCATTTCCA GATAGTTACC
GGCGTACTAA TTGCGAGCAG AATCTATCAT TTCTATTATC AGAATCTTCA ATATTTGATC
TGAATCCACA AATAAGAATC TTAATAAACC CTGATTGAAT ACTTGTCCGA TTCTAAAATA
TATTGGTAAC TACTATTGAT CTAATACTGT TAAAAGATAT CAATAATCTT CAACTATTCA
TCACCACTGT ACCATATCTG TACACTAGTA CTTGCATATT AATCATAATG ACAAACTTAC
AGTCGCGTTC TGCTTCTCTT TCTACTACCG TTGACGACTC GTCTATTGTG TCCAGTCCTC
CTGCTGCTCT TCCTTCGTTT GGTACCAAGC CAAGCATGAG ATCTGCCTCA ATCGGCTCGT
CTTTCTTCCA CAAAGACACG CCCAACTTCT TTGGTTCTAA ATCGGCTGCT GTTGCTGATA
CATCCTCTGG ATCTATTGAA GAAAAAGAAT CGTCGAGTTC TAACACCAAC ACCAATACTA
ACGTCAATTC CAACTCAAAA ACAGATGCCG TTCTTCCTAC GGCTGGTTCG AATGACTTGG
ATATCGTGGG TGCCATCGGA AATCTCGACT TGGATGACGA TTTTGCTGTA GACTCGAACG
AGGCTCTTTC CCAAGGTCAT CCTTCCAAGT CCACCACTCC TTTTCTGCAA CAATCAACCT
TTTTGTATGG GGACAACGTG AACAGTTACC AGTTCTACCA CCCTCATTAC CCTCCGCCAC
CTCACCAAGG CTCGTTGACA CCTAACCCCT CTCTTGGAGG CTTTGCTCCT ACTACCTTTG
GAGCTTCAAA CACGCCTTCC TCTACTTGGA ACAACTCGTT CATTCCTTCG TCTGCTCCTC
CATTTACTTA TAACAACGCT GGAAACAACG GTAACAAGGA TTTGGCTGCA GATTTGCCCT
CTTTTGTCAA GTCTTTTGAA CTTCCTTCGA CTGACGAATC TGAATCTGGT TCAGGCAATG
ATAAGAACGC CGACAAGAGC TCTAACGCAT CTAACTCTGC CAATGTCAAC TCCAATGTAG
CCCTTTTCTC CTCTCCTTTT GGCGGAATCT TGGACTCCCA ACTCCATTTT ATGCCTACTG
CCAATCTTTC TGGCGGCCAC ATGGTCAACG ACAAGAACGG TGAATCTGAA AACGACTTGG
GCTTGCTGGA GAAGGGCTCC AGCAACTTCC AGGGATCTTC TCCTCTTGAC TTCAATTCCC
ACCAGATGAA CGTTAAGAAA GGAGGCTACG ACTTAGCTGG TGTTCCTCCA TTTCCTACAG
GTTTGAACGA CAACTTATCG ATGCTTCCCA ACGTAATGGG TCATCCTGGA GCTCCGGGAG
GCCCCATGCC CAACATGTGG AACCAGCAGA TGATGAATTC TCAGTTGCAC CCCACAAATA
TGAGAAACAC CTATGACGGC AGAGGCATGG CTCCTCCACC TCCTCCTCAG AGCGGAATGC
ATCCTAACCA TCACATGATG GGCCAATTCA ACAACGGTGC CAACTCAGGA ACTCGTGGAA
ACAGCAATAG CAGCGGCAAC CACCACAATG GCCGCCACCG TGGCAATTAC ATGAACGATG
GTATGAATGT CCATCGCAAG ATGCACAACA GCATCAACAG TAGCAATGGC CACGGCAGAA
GAAAAGGCGA CGATGCTTCA AAGTATGCCA ATGCCAAATT GAGTGATTTT ACGGGCGAGA
TCTTCTCGTT GTGTAAAGAC CAGCATGGTT GTAGATTCTT GCAACGTCAG TTGGACTTGG
GCCGTGAAGT TGCAGAAGGC AGGAATACCG ATTCTTCTGT CTTGTCAAAC GACATTGCTG
CCACTATGAT ATTCAACGAA ATCTACTTGA AAATTGTTGA ATTGATGACG GATCCGTTCG
GCAACTACTT GATTCAGAAG TTATTTGAGA ACGTTTCCGT AGACCAGAGA ATTATCTTGG
TCAAAAACGC CGCTCCTGAG TTCATTCGTA TTGCATTAGA TCCACACGGT ACGAGAGCTT
TGCAAAAGTT GGTGGAGTGC ATTTCTACCG AAGAGGAGTC GAAGTTGATT ATTGGTTCCT
TGAGCCCTCA TATTGTATCT TTATCGAGAG ACTTGAACGG TAACCATGTA GTACAGAAGT
GTTTGCAGAA GTTGAAGCCG GAAGAGAACC AGTTCATCTT CGAAACAGCA TCATTGCATT
GCAACGAAAT CGCTACTCAC AGACACGGCT GTTGTGTCTT ACAGAGATGT TTGGACCACG
GCAACTCGGA CCAACGTAGG CAGTTGTCGT TAAAGGTGGC TGAAAATGCC ACCAACTTAT
CGCTTGACCC ATTCGGCAAC TACGTAGTTC AGTATGTCTT GTCCAGAGGT GACGAGGGAT
CGATCCAGAT CATCATGGAT CACATCAAGT CCAACATCAT TTCTTTATCG CTCCACAAAT
TCGGTTCCAA CGTCATTGAA AAGTCGTTGA GAATCAACAA GTTGACCAAC ACCTTGATCG
ACGTCTTGTT GAAGCACCAG GACAGATTCT CGGACATGTT GAATGACGCC TTTGGTAACT
ACGTGTTGCA AACCAGTTTG GATGTAGCCA ATCCGCAGGA CTTGAACAGC TTGTCTCAAG
CGTTGCAGCC CTTGTTGCCC AATATCAAGA ATACTCCTCA TGGCAGAAGG ATTATGACCA
AGATCCAGAG CATTATGTGA GGTTTGCTTG TTGTGCCATT TTTGTCTTGA CGAAAGTCTT
GCTTGTTTAT ATAGCTGTAT TTTAGACTGT ACATTATACG GAAGCATACT ATGTATGATT
AATATAATTA TGATAGCGT
 
Protein sequence
MTNLQSRSAS LSTTVDDSSI VSSPPAALPS FGTKPSMRSA SIGSSFFHKD TPNFFGSKSA 
AVADTSSGSI EEKESSNAVL PTAGSNDLDI VGAIGNLDLD DDFAVDSNEA LSQGHPSKST
TPFSQQSTFL YGDNVNSYQF YHPHYPPPPH QGSLTPNPSL GGFAPTTFGA SNTPSSTWNN
SFIPSSAPPF TYNNAGNNGN KDLAADLPSF VKSFELPSTD ESESGSGNDK NADKSSNASN
SANVNSNVAL FSSPFGGILD SQLHFMPTAN LSGGHMVNDK NGESENDLGL SEKGSSNFQG
SSPLDFNSHQ MNVKKGGYDL AGVPPFPTGL NDNLSMLPNV MGHPGAPGGP MPNMWNQQMM
NSQLHPTNMR NTYDGRGMAP PPPPQSGMHP NHHMMGQFNN GANSGTRGNS NSSGNHHNGR
HRGNYMNDGM NVHRKMHNSI NSSNGHGRRK GDDASKYANA KLSDFTGEIF SLCKDQHGCR
FLQRQLDLGR EVAEGRNTDS SVLSNDIAAT MIFNEIYLKI VELMTDPFGN YLIQKLFENV
SVDQRIILVK NAAPEFIRIA LDPHGTRALQ KLVECISTEE ESKLIIGSLS PHIVSLSRDL
NGNHVVQKCL QKLKPEENQF IFETASLHCN EIATHRHGCC VLQRCLDHGN SDQRRQLSLK
VAENATNLSL DPFGNYVVQY VLSRGDEGSI QIIMDHIKSN IISLSLHKFG SNVIEKSLRI
NKLTNTLIDV LLKHQDRFSD MLNDAFGNYV LQTSLDVANP QDLNSLSQAL QPLLPNIKNT
PHGRRIMTKI QSIM