Gene PICST_71938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_71938 
Symbol 
ID4838949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp330868 
End bp332156 
Gene Length1289 bp 
Protein Length202 aa 
Translation table12 
GC content39% 
IMG OID640390264 
Productpredicted protein 
Protein accessionXP_001384365 
Protein GI126135682 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2007] Ribosomal protein S8E 
TIGRFAM ID[TIGR00307] ribosomal protein S8.e 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTATACTTT TGTTCATCAG TGATATTCTA AAGTGTATGT GTTAACCATA GCAAATAAAC 
AGCAGGTAGT CGAGACACAT GTAGCTGCTG CAAATACGAA AGCCAAACCT AAGGGAATTC
CCATACAAGT CGATACCATT GTCGTCGTTG CAGAATCGAA ATTCATTAAA TGAACATACA
CGTATTCCAG TTTAAGACAT CCATCCAAAG TCCAATCAAA TGCAAACCAT TAGTGTAGAA
CAAGCTGAGA GCAGCTATGA TACTGTTCAA GTTTGAAGAG TTTTGACTGG CTGAATCGAG
AGATATTCTG GATCACCACA GCCATAAGAA TTTTCACCCA AATCATCGAA TTGACTAAAT
TTTCCAATAC TTGACAAATG ATATACAAGT ACCTTAAACA AAATCAAATC AAATTAGTGT
TAGGGATTTA AATACGTCCA CCTTGTTTTG GCTTTCATAT TGTTTTTATG TTTTTCAAAT
GTTCACTTAC TAACATCGTA TAGGTAAAAA TGGGTATTTC TAGAGATTCA CGTCACAAGA
GATCTGCTAC TGGTGCCAAG AGAGCCCAGT TCAGAAAGAA GAGAAAGTTC GAATTAGGTA
GACAATCTGC CAACACCAAG ATTGGTGCTA AGAGAATTCA CTCCGTCAGA ACTAGAGGTG
GTAACCAAAA GTTCAGAGCT TTGAGAGTTG AAACCGGTAA CTTCTCCTGG GGTTCTGAAG
GTGTTTCCAG AAAGACCAGA ATTGCCACTG TCGTCTACCA CCCATCTAAC AACGAATTGG
TCAGAACCAA CACCTTGACT AAGGCTGCTA TTGTCCAAAT CGATGCCACT CCATTCAGAC
AATGGTACGA AAACCACTAC GGTTCTACCT TAGGTAAGAA GAAGAACCAA CCTGCTGCTA
CCGAAGAAGA AGTCAAGAGA TCTAGAAAGG TCGAAAGAAA GTTGGCTTCC AGAGCCGGTC
AAGCTGCTAT TGAATCTGCT GTTGACGCCC AATTCGGTTC CGGTAAGTTG TACGCTGCCA
TCTCTTCCAG ACCAGGTCAA TCTGGTAGAT GTGATGGTTA CATCTTGGAA GGTGAAGAAT
TGGCCTTCTA CTTGAGAAGA TTGACTGCTA AGAAGTAAAC AGCTTAGATA TAAACCTTCC
TTCATATACA TGTACCATCA ATAAAAAGAA TCATCCGCTC CATTTCTCTA TGTCTAAATA
TTAATATAAC ATTTTTACAT TATTTAAATA GAAAGGGGTG GATGAAGGGT TAGTATTCTT
AGTTACTTCA ATAATATAAC GTGTAAATT
 
Protein sequence
MGISRDSRHK RSATGAKRAQ FRKKRKFELG RQSANTKIGA KRIHSVRTRG GNQKFRALRV 
ETGNFSWGSE GVSRKTRIAT VVYHPSNNEL VRTNTLTKAA IVQIDATPFR QWYENHYGST
LGKKKNQPAA TEEEVKRSRK VERKLASRAG QAAIESAVDA QFGSGKLYAA ISSRPGQSGR
CDGYILEGEE LAFYLRRLTA KK