Gene PICST_77059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_77059 
Symbol 
ID4838231 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp907345 
End bp908459 
Gene Length1115 bp 
Protein Length189 aa 
Translation table12 
GC content41% 
IMG OID640389546 
Productpredicted protein 
Protein accessionXP_001383457 
Protein GI126133865 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0522] Ribosomal protein S4 and related proteins 
TIGRFAM ID[TIGR01018] ribosomal protein S4(archaeal type)/S9(eukaryote cytosolic type) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.425894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.528605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGTTAACATG CCACGTATGT AATACTAGAT GATACTAACA GAACAGCAAT AACTTCATAC 
AAATGATTCT GCACCAGCTC TCGAGTTGTT CGCTGTTTCA TTGAGGTTCG GGTCCTCGTT
TCTTTGAATT ACTTCAGAGA TAATTTTTTA TTTATTGGGG CTGAATTCCC GACACTTCAG
TCTCAAGCTA CATAACGCAA GTAACCTTGA GTTAGCATAC TTTGTATGTT ACTCTTGGGT
CTTCGACTTT GCCAAGCTAT ACCGTATCAC CTTCAGGGTT GATTTTTTGA ATGTAGAAGT
TCGAAAATTC TAGCACACCT AGACGGTGAA AACGGTATAT TGCCCAGCAA GTTACATTGA
GTGATGTGGT TTTAAAAAAT TAGATCTGTA GTAACTAAGT AACGTAGAGT GTACCCGTGT
TGTTTTGTGC CCAAATTCTT AAAAACAATA TGCTAACATG TAAATAGGTG CTCCAAGAAC
TTACTCTAAG ACTTACACCG TCCCAAAGCA ACCTTACGAA TCCACTCGTT TGGACGCTGA
ATTGAAGTTG GCCGGTGAAT ACGGTTTGAA GAACAAGAGA GAAATCTACA GAATTGGTTT
CCAATTGTCC AAGATCAGAA GAGCTGCTCG TGACCTTTTG ACCAGAGATG AAAAGGACCC
AAAGAGATTG TTCGAAGGTA ACGCTTTGAT CAGAAGATTG GTCAGAGTCG GTGTCTTGTC
CGAAGACAAG ATGAAGTTGG ATTACGTCTT GGCTTTGAGA ATCGAAGATT TCTTGGAAAG
AAGATTGCAA ACCCAAGTCT TCAAGTTGGG TTTGGCTAGA TCCATCCACC ACGCCAGAGT
CTTGATCACC CAAAGACACA TTGCCGTTGG TAAGCAAATC GTCAACATCC CATCTTTCAC
CGTCAGATTG GACTCCCAAA AGCACATTGA CTTTGCTCAC AACTCCCCAT ACGGAGGTGG
CAGAGCCGGT AGAGTTAAGA GAAAGAACCA AGGTAAGGGT GGTGACGAAG GTGCCGAAGA
CGAAGAATAA ATTACTTAAT CTACTAATAT CATTCTTGTA TTTAATAGTT TATCTTCCCC
TTTTCTCATC AATATAATTG GTTTTTATAT TTCCA
 
Protein sequence
MPRAPRTYSK TYTVPKQPYE STRLDAELKL AGEYGLKNKR EIYRIGFQLS KIRRAARDLL 
TRDEKDPKRL FEGNALIRRL VRVGVLSEDK MKLDYVLALR IEDFLERRLQ TQVFKLGLAR
SIHHARVLIT QRHIAVGKQI VNIPSFTVRL DSQKHIDFAH NSPYGGGRAG RVKRKNQGKG
GDEGAEDEE