Gene PICST_50481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50481 
Symbol 
ID4840822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp572975 
End bp574390 
Gene Length1416 bp 
Protein Length471 aa 
Translation table12 
GC content45% 
IMG OID640392137 
Productpredicted protein 
Protein accessionXP_001386705 
Protein GI150866939 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0522] Ribosomal protein S4 and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.830079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.176883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGAA AGACCCAGAA CTTGCACTCG TTGAGTAGGG GCCGTGTCCG TGCCTCTATG 
AACAAGTACA ATTTGTTCAA CTTATATAAA AAGGCTCCAG TGAGATATGA TGGAAAGACT
TTGTACCAAC AGAAGTGGAA CGCTAAAGCT GAAACCAGAG CTTATCACGG TGAACATTTG
ACTGAAAAGC GTTGGAAGGC CATCTTTGAT CCTTCATTAG AAACCGTAGC TCAATTGGAT
GCTTCATTGA AGGGTTCTAA AGTAGCACCT ACACCTATGA CTCTCCAGAC CTATGCCTCT
TTAGAGAAGA GATTGGAGTT GGCTGTCTTC AGATCGATGT TTGCTTCCTC TGTGAGACAA
GCTCGTGAGT TCATTCTTGG TGGAAGTGTT CTGGTCAACG GTGTTGTGAT AAAGCACCCT
TCATTCCCCT TGAAGAGTGG AGACATTTTC CATGTCAAGC CAGAGAAGGT CTTGTTAGCT
ATGGGCAGAA CCAAACCTTC GCTTGAAAAA GCCATTAAGG TCGACAATCA ACAGATCAGC
GCCTGGAACC GCTATGTGAA AGCAGCACAA GAAAATCCCC GCGAAGTATG GGAAGCCAAA
CAGAAGAAAC CAGCATCTTT AAATACCATC AGAAACATCA ACGGTTCCGA ATCGGCCGAG
GAATTCAACA AGAAAATCGA ACAAACTATG AAATCGCAAC AGAATGATGC TACGCGTGAG
TCTATTTTAT TGAAGATTAT CAGTTTGGGA AGAGGAATCG AAAGCAACGG CGGAGTCGTT
TCTGCAGAAA CGTTTAAGGA ATTCAACTAC GACAACGAAA GCAACAGCAA CAATGCCCAG
AAGGCTCATA ACGTGTACAA AAAGTTGTCT GATGCTAAAC ACAAATTGAT TGGCGAACAC
AATATCGAAA ATGCAGCTGA GTTTGTCAAT AAGAAAGCAG ATGATTCTGA ATCTGCCGCG
GACAAACAGT TAGCTCGTTC CGTCAAGCAG ATTCTCCGTG AGCTCCAGAA GTCTACCTGG
GAAGCCATAC GTGTTGGAGC CCAGCAGCAA CAATCTGGCA AGGTGCTCAC TGCCTCATTC
ACGTCTGATT TCGTTAAGCT GTTGGTGCCA CATCCAGCCT TGAACAAGGA ATCTATTCTT
GAAGACGAGA CTCTAGCCAA CATCAAGTTC CCATGGCAAA AATCGTTATT TGGCCGTCAA
GATCCTTCCA AGCCATACTT CACTCCATGG ACACCACGTC CTTTTATCGG TGCCTTTGCC
ATCTTGCCGT CGCACATCGA GGTATCATTC AGCACATGTC ATGCTGTTTA CTTAAGAGAC
CCTATCGCCA GACCAGGTCA TTCTGAGGTC ATTTCTCCTT TCCCCGACCA CACCCACGAA
AGAGCCTATA TGTTCTACGC CAGAAAGGGA TTGTAG
 
Protein sequence
MPRKTQNLHS LSRGRVRASM NKYNLFNLYK KAPVRYDGKT LYQQKWNAKA ETRAYHGEHL 
TEKRWKAIFD PSLETVAQLD ASLKGSKVAP TPMTLQTYAS LEKRLELAVF RSMFASSVRQ
AREFILGGSV SVNGVVIKHP SFPLKSGDIF HVKPEKVLLA MGRTKPSLEK AIKVDNQQIS
AWNRYVKAAQ ENPREVWEAK QKKPASLNTI RNINGSESAE EFNKKIEQTM KSQQNDATRE
SILLKIISLG RGIESNGGVV SAETFKEFNY DNESNSNNAQ KAHNVYKKLS DAKHKLIGEH
NIENAAEFVN KKADDSESAA DKQLARSVKQ ILRELQKSTW EAIRVGAQQQ QSGKVLTASF
TSDFVKSLVP HPALNKESIL EDETLANIKF PWQKSLFGRQ DPSKPYFTPW TPRPFIGAFA
ILPSHIEVSF STCHAVYLRD PIARPGHSEV ISPFPDHTHE RAYMFYARKG L