Gene PICST_81803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81803 
SymbolYAK1 
ID4836709 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1756658 
End bp1759045 
Gene Length2388 bp 
Protein Length704 aa 
Translation table12 
GC content38% 
IMG OID640388024 
Productserine-threonine protein kinase 
Protein accessionXP_001383106 
Protein GI150864334 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGG ATCTGGTCTG TGTCGAAGAT GAGTTCTCAC GTATAAACAT CAACAAGTCG 
CCCACAAGAC GAAAAAAAGT CTACCCACAA AACCTTTATC CTCCTCAGAA CCAGATATAT
TTCCCCCCTC CAGATCCGCC ATCCACGGCT ACCTCACCAT CCTCCACATC AGATGTTTCA
TCTTCGGATT CCTTAATCCA CCGATATGAT GGCAATGGCA TTGTCGATAC AAATATTGTC
GGAAAGACAA ACAATACAAG TAACACTATC AACAATAACA CCATCATTAC TAATAATGTT
ATTGCTAACA CTATTAGTAC TCCTTTGAGA AACCATTACG AGTCCAATTA CAATAGCAAT
AGCAATAACT GTGGTGCTCC CTCTACCCCT TCCACTGTAC CATCTCTAGA AAAGAAGTCA
TTATCCACTA CGAAAATCCA TCCGTCAGAC AAAAGACTAC TAAGCTATCC GATGTCTGAA
AAAGCAGCTT CGAAACAAGA CTACCAGTTT TCATTTATGA GAGAAACATC GTCTTCTATC
CAGAAGAAAC ACAGCACATA CTCGGGTGGA CTCTCCAAAC CAAATCAGCC GTATCTTACG
ACTGGTTCTG CCACTAAAAA ACGCCTCGAA TACTCAAGAC AACTTTCTCA TAATCTCCGA
TCAAGAAGAT TGTTGTCTTC GAATTTTCCT TCGAAAACAC TGCCTTCGGA CATCACCAAT
TTGCCCTACT CCTTACCCGC TAAGTACGGA GGAATGCCGG GGGAAGACTC GCCTACAAAG
CAAAAGGTAA TACGGAATGC GTCGCAGCCA CTTCCGTTAA AGTCAGTCAA TAACGTGTTC
AATAGACTCT ATCCTAATAG CTTGGAATCT GAAGTGAAAC CACGAGTAGC TCCTGTTTTG
CAAACAACAA GAAGAAGTAC ACCTTCGCAG ATCCTTAATC GCCAGGAGAA AGTAGGTATG
GAAAACAAAG ATATCAACAG TGTGAAAGTT AACGATATTA CGGAACTATT CTCCATATTA
TATGCGTTTG ATGAACAACT CTTTCCCAAC GAAGGAGATT CGGATTGTCC AGAAGCGAAG
CCAATACAGG CAGTTGAGCT TGCTAAGCTC AAGATGAATA TCTATGAAAG GGGAGAGATC
ATCCGGAAAC AGCAACTTTA TTTTGTGCCA CAAGGAATAG AGAGAAATTT GAATATCAAG
AACTATCAGA ATAACTTTGG TTTTGATGAT GCGAATGGAA ACTACATTAT CGTCGAAGGA
GATCACATCA ACTATCGTTT TGAAGTGTTG AAAATGTTGG GCAATGGATC ATTTGGAAAC
GTGATAATGA GCAAAGATCA CAAATATTCT AGTCGGCTTG TTGCTACTAA AATCATTAAG
AACGATCTTA ATTGGTCCTT GCAAGCTATC AACGAAATAA AAATGCTCAG ACTATTAAAT
GAAAAGGAAA CAAACGAAAA CATCCTTAAG TACTACGACC ATTTTAATTT CAGGAGTCAT
ATGTGCATTG TCACAGAATT ATTGTCTATA AACCTTTACT CATTGCTAGA AGTAAGTCAG
TTCAGAGGAT TCTCCTTGAA TATTGTTCAA TCAATAACAA AGCAAATCTT GAATGGTTTG
CAATATATGC ATAGACTCAA TGTTATACAT TGTGATATAA AGCCCGAAAA CATTATGATC
CAGTTGCCAC ATCTGCCACA AGTTGGTACT TTTGTCGTTA AGATAATTGA TTTTGGATCA
TCGTGCTTAA GTAATGAAAT TTCATTCACG TATATCCAGT CAAGATTCTA CAGAGCACCA
GAAGTCATCA TAGGTGCCAA TTATACCGAA GGAATTGATG TTTGGTCCTT AGGCTGTGTC
ATTGCTGAAC TTTTTACTGG TGTTCCTCTA TTGCCAGGCA AGAACGAAAT AGAACAGGTA
GGATTAATTC TAGAATTGTT CGGAGCACCA AAAAGTACAA CTATCTTAAG ATTGAGGAAG
TCTTTGACAA GGTCGGTTCA AAAGAAGTCT TTCGAAATGA ACGAAAATAG TCCCCATGTC
AATGAAAAGC TAATAAAGAA AACGCTATTG TTCAGGTTGT TTGACATTAA TGGAAAAATC
AATATGTCGT TGCTTAACTA CCATAATGCC AACTCAACAG CGTATTCGGC TAAGAAGCAG
TTTAAGTTAA ATTCCAGAAA CTTGGAAATA TATTTGAGTT TGAACAAATG TCCCTCGCAG
TTGAACAAGC TGTTCCTTCG TTTCCTTCAG AAGATATTTG TATGGGATCC CGTTGAACGT
TCTAGTATTT TGCAGTTGAC GGAAGAACCG TTTGTCACTT CTCAAGATTA ATTACATATT
TATTGCATTA TATTATACAC ATAAGGAATC TATCCGTTTG TCTATTAT
 
Protein sequence
MSEDSVCVED EFSRININKS PTRRKKVYPQ NLYPPQNQIY FPPPDPPSTA TSPSSTSDTN 
NTSNTINNNT IITNNVIANT IKKKSLSTTK IHPSDKRLLS YPMSEKAASK QDYQFSFMRE
TSSSIQKKHS TYSGGLSKPN QPYLTTGSAT KKRLEYSRQL SHNLRSRRLL SSNFPSKTSP
SDITNLPYSL PAKYGGMPGE DSPTKQKVIR NASQPLPLKS VNNVFNRLYP NSLESEVKPR
VAPEKVGMEN KDINSVKVND ITELFSILYA FDEQLFPNEG DSDCPEAKPI QAVELAKLKM
NIYERGEIIR KQQLYFVPQG IERNLNIKNY QNNFGFDDAN GNYIIVEGDH INYRFEVLKM
LGNGSFGNVI MSKDHKYSSR LVATKIIKND LNWSLQAINE IKMLRLLNEK ETNENILKYY
DHFNFRSHMC IVTELLSINL YSLLEVSQFR GFSLNIVQSI TKQILNGLQY MHRLNVIHCD
IKPENIMIQL PHSPQVGTFV VKIIDFGSSC LSNEISFTYI QSRFYRAPEV IIGANYTEGI
DVWSLGCVIA ELFTGVPLLP GKNEIEQVGL ILELFGAPKS TTILRLRKSL TRSVQKKSFE
MNENSPHVNE KLIKKTLLFR LFDINGKINM SLLNYHNANS TAYSAKKQFK LNSRNLEIYL
SLNKCPSQLN KSFLRFLQKI FVWDPVERSS ILQLTEEPFV TSQD