Gene PICST_32903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32903 
Symbol 
ID4840069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp924779 
End bp926044 
Gene Length1266 bp 
Protein Length421 aa 
Translation table12 
GC content40% 
IMG OID640391384 
Productpredicted protein 
Protein accessionXP_001385536 
Protein GI150866063 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0052] Ribosomal protein S2 
TIGRFAM ID[TIGR01011] ribosomal protein S2, bacterial type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.174506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAGA AAGAGTTACG TGAAAAGGCT ATTGCTGATG CTCTTGCTAG AGAAGAAGAG 
GAATACGCGA AGATAAAGGC CATGCGCGAG TTAAACAACC AGACATCCGA ATTAATAACT
TCTTTAAACA AAATTCCATC AAATTTGATC AATTCGAGAT TGCAAAAGTT GCAGCAAGAT
TTGCAAAACT TGCCAGAATA CAAAGTGAAG GAATTAGATG AAGAACTAGA AGAATTTATG
TTCTCCCACA TGAAATTGCC ATATGGTGAA GTGTACAACA GGCCATGGAG TGGAATCTCA
AACGAAGAAT CATCTTCTAT ATCCTCTAAC AAAGATGAAG TGATACAACA GAGACTTGTT
TCAACAACTT CGAGTTCATA TTCCACTCAA TTTCCAAACT TAAAGCCAAC GCCAGACTAT
AGAGGGTACT CTGAACAAGA GTTATTTTTG AGACAACTCG CACATTCTCG TCATTCTGGT
TCCTTGGGAT CTAAGTTATC AAATGTATAT AGACCACAAG ACGACATCAA GAACCCAACG
AAGTTAAAGG ATGTCTCTAT AGCTACATTA ATGGCAGCAG GTTGTCATTT GGGACACTCC
AAATCAAATT GGAGACCAAC AACCCAGCCA TTCATTTATG GTGAATATGA TGGTATTCAT
TTAATTGACT TGAATGAGAC CGCTGCCGCC CTTAAGAGAG CTAGTAATGT CATCAAGGGA
GTTTCTAAAA AGGGAGGTAT CATTCTTTAC GTTGGCACCT CAAAGAATGT ATTCCAAAAC
AGAGCGTTGG AAGAAGCTGC AATCCGTTCT AATGGTTACT ATGTTACCAA GAGATGGATT
CCAGGTACGA TTACCAACTA CACTGAAGTC ACTAAGCAAA TCCAGGGAAC ACAGAAGATC
GAAGTCGATA TGGAAAATAA ACCAACTGGA CGTAACTTAG GTGTTGAACA GGGCCAGCTT
GTGAAGCCAG ACTTAGTCGT GATCCTTAAC CCTGTTGAAA ACAGAAACTG TATCAACGAG
TGTATCTTGC TGAGAGTTCC CACTATTGGT TTGTGTGACA CTGACATGGA ACCATCCTTA
TTGACTTACC CAATTCCATG TAACGACGAT TCCGTCAGGT CAGTAACCTT TATGACGGGT
ATTTTGTCCA AGTCTGCTGA AGAAGGTTTG TTGGAAAGGT TGGAAGAAGT AAACAAGTAC
AACCAATCCA AAGTTAAAGG TCTGATTCAA AAGAGAGAAT TGAAACATAG CAGACGTTCC
AGCTAA
 
Protein sequence
MSQKELREKA IADALAREEE EYAKIKAMRE LNNQTSELIT SLNKIPSNLI NSRLQKLQQD 
LQNLPEYKVK ELDEELEEFM FSHMKLPYGE VYNRPWSGIS NEESSSISSN KDEVIQQRLV
STTSSSYSTQ FPNLKPTPDY RGYSEQELFL RQLAHSRHSG SLGSKLSNVY RPQDDIKNPT
KLKDVSIATL MAAGCHLGHS KSNWRPTTQP FIYGEYDGIH LIDLNETAAA LKRASNVIKG
VSKKGGIILY VGTSKNVFQN RALEEAAIRS NGYYVTKRWI PGTITNYTEV TKQIQGTQKI
EVDMENKPTG RNLGVEQGQL VKPDLVVILN PVENRNCINE CILSRVPTIG LCDTDMEPSL
LTYPIPCNDD SVRSVTFMTG ILSKSAEEGL LERLEEVNKY NQSKVKGSIQ KRELKHSRRS
S