Gene PICST_68812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68812 
SymbolSIK1 
ID4851526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2064760 
End bp2066440 
Gene Length1681 bp 
Protein Length499 aa 
Translation table 
GC content43% 
IMG OID640393234 
Productnucleolar protein involved in pre- rRNA processing 
Protein accessionXP_001387632 
Protein GI126274754 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.116382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0643077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCAGAACTC GAAATCATTA AAGACAACAT GGCTGGTTTG GACTATCTTC TCTTTGAAGA 
AGCTACCGGT TACGGGATCT TCAAGGTCTT GATCCAGCAG GATGACATCG CTTCTAGACA
GAAGGAAGTA CAGGAAGCTT CCAACGACTT GGGCAAGTTC TCCAAGATGA TTGAATTGGT
CTCTTTTGCA CCATTCAAGG GTGCTGCTCA AGCTTTGGAA AACGCCAACG ACATCTCTGA
AGGTTTAGTA TCCGACTACT TAAAGTCAAT CTTGGAATTG AACTTACCAA AGGGTTCTTC
TAAGAACAAG ATTGCCTTGG GTGTATCTGA CAAGAACTTG GGTCCTTCTA TCAAGGAAAT
ATTCCCTTAC GTTGATTGTT TGTCCAACGA AATCGTCCAG GACTTCTTGA GAGGTATCAG
AGTCCACGGC GACAAGTTGT TCAAGGATTT GCACGAAGGT GATATTGAAA GAGCACAGTT
AGGTTTGGGT CATGCCTTCT CTAGAGCTAA GGTTAAGTTC TCAGTACAAA AGAATGACAA
CCACATCATT CAGGCTATTG CTTTGTTGGA CCAGTTGGAC AAGGATATCA ACACCTTCTC
CATGAGAGTC AAGGAATGGT ACGGATGGCA CTTTCCAGAG TTGGCCAAAA TTGTCCCAGA
CAATTACACT TTTGCCAAGT TGGCTCTTTT CATCAAAGAC AAGGCTTCTT TGACTGAAGA
CTCGTTGCAT GACATCGCTG CTTTGGTTAA CGAAGACTCT GGTGTTGCCC AGAGAATCAT
AGATAATGCC AGAATCTCTA TGGGACAAGA CATCTCGGAA CAGGACATGC AGAACGTTTC
AACTTTCGCT GAAAGAGTGG TAAACATCAG TGACTACCGT ACCAAGTTGT TCCAGTATTT
AACAGATAAG ATGCACACTG TTGCTCCTAA CTTGTCGACG TTGATTGGAG AAGTTGTTGG
TGCCAGATTG ATCTCTCACG CTGGTTCTTT GACCAACTTG TCTAAGCAAG CCGCCTCTAC
TGTTCAAATC TTGGGTGCTG AAAAGGCCTT GTTCAGAGCT TTGAAGACTA AGGGTAACAC
TCCTAAATAC GGGTTAATCT ATCACTCGTC TTTCATTGGT AAGGCTTCTG CCAAGAACAA
GGGTAGAATT TCCAGATACT TAGCTAACAA GTGTTCCATT GCTTCCAGAA TCGACAACTA
CTCGGATGAG CCATCTACTG CCTTTGGTGA AATATTAAAG AAGCAGGTGG AAGAAAGATT
GAACTTCTAC GACACCGGTG CCCCACCTAT GAAGAATTCC GATGCCATTA AAGCTGCTTT
GGCTTTAGGT GCTAGCGACT TGGCTGGAGT ACCAGCCTCC AACGAAGATG ACGAGCCTGA
AACTCCTAAG AAGGAAAAGA AGGAGAAGAA GGAAAAGAAG GAAAAGAAGG AAAAGAAGGA
AAAGAAGGAA AAGAAGGAAA AGAAGGAAAA GAAGCGTAAG GCTGAAGATG ATGAATCTCC
AAAGAAGAAG AAGAAGTCCA AGAACTAGAT TATCACCTCT TTTTAAACAC TCGGCATTTC
TCGACGACCT TCAATTCGTC AACCCCAGTT GCTTTTTTAT CCATTGTCTG GTCACGGCCT
GATCGACAAT ATTATTGTAA ACTATAGTAC TTTCTATTGC ATGTTAATAT ACTAGATACC
G
 
Protein sequence
MAGLDYLLFE EATGYGIFKV LIQQDDIASR QKEVQEASND LGKFSKMIEL VSFAPFKGAA 
QALENANDIS EGLVSDYLKS ILELNLPKGS SKNKIALGVS DKNLGPSIKE IFPYVDCLSN
EIVQDFLRGI RVHGDKLFKD LHEGDIERAQ LGLGHAFSRA KVKFSVQKND NHIIQAIALL
DQLDKDINTF SMRVKEWYGW HFPELAKIVP DNYTFAKLAL FIKDKASLTE DSLHDIAALV
NEDSGVAQRI IDNARISMGQ DISEQDMQNV STFAERVVNI SDYRTKLFQY LTDKMHTVAP
NLSTLIGEVV GARLISHAGS LTNLSKQAAS TVQILGAEKA LFRALKTKGN TPKYGLIYHS
SFIGKASAKN KGRISRYLAN KCSIASRIDN YSDEPSTAFG EILKKQVEER LNFYDTGAPP
MKNSDAIKAA LALGASDLAG VPASNEDDEP ETPKKEKKEK KEKKEKKEKK EKKEKKEKKE
KKRKAEDDES PKKKKKSKN