Gene PICST_83871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83871 
SymbolPKH2 
ID4839581 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp113732 
End bp116531 
Gene Length2800 bp 
Protein Length861 aa 
Translation table12 
GC content44% 
IMG OID640390896 
Productaspartic proteinase precursor 
Protein accessionXP_001385043 
Protein GI150865712 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.925422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGTC CACCACTACC AACACAGCAA CAGCTGCAAC TGCATCTTCA ACAAAAACAA 
ATCTCTCCCA CACCCGTGAA GCGTACGGCT CGGGATTACC AGTTTGGAAC AAGGATTGGT
GAAGGTTCGT ACTCCACTGT GTTTTCTGCA ATGGATATCC ACAACTCAAA GACATATGCT
ATCAAGGTTC TTTCCAAACG ACATATTGTC AAGGAGGACA AGATCAAGTA TGTCAACATC
GAAAAGATAA CGTTGCATCG TCTTGGTCAA CAGCATCCTG GTATTGTTCA GTTGTACTAC
ACATTTCAAG ACGAAAAGAG TCTTTTCTTT GTGCTTGATT TTGCTGAATA CGGGGAGCTT
CTTTCAATTA TCCGTAAGTT CGGCTCGTTA TCAGAAGCTG TGCTGAAGTT CTACATGTGT
CAGATTGTTG ACGCTGTCAA ATTCATTCAT CTGAAGGGTG TGATCCATCG CGACTTGAAA
CCCGAAAACA TCCTTGTAGC ACACGATTTT AGTCTAAAGA TCACCGACTT TGGTGCAGCC
AAGCTTCTCG GAAACTCTGA CGACAACGAT GAGAAAATCG ATTACAACTC CGTAGACGAA
GCCCAGAACG TTCCGGTTAA GGTTAGTGAT GAAGATCGTA AGGGTTCCTT TGTAGGTACG
GCAGAGTACG TTTCGCCAGA GCTTCTCAAA CACAACATTT GTGGATTTGA AGCTGATGTA
TGGGCTCTTG GATGTATTTT GTACCAGTTT TTTCACGGAG TACCACCGTT CAAGGGCAAC
ACTGAGTACT TGACGTTCGA GAAAATTATC AATATCGATT ACTCGTACCG TCTGAAGTAC
CCACTTCCTC CAGATGTAAT CGAGATAATA GACAAAATCT TGTTGGCCGA TCCCCAACAA
CGGTCTACAA TACCTCAAAT CCAAAAGAGC CGTTGGTTCC AAGACGTTCC CTGGGACGAC
CTCAATTTCA TCTGGCATAG AAAAGTGCCC AGATTTGAGC CATTTGGCCC AGGCTCAAAC
AATGCACCTT CACCAGTAAT GTCGACATTC AAAACAGGCT CCAATAGAAA TATGAACAAG
TCTAACTCGT ACCAGCAATT GCATTCGCAA ATCCAGCATT CAGACTTTGC TTACATTCCC
TCTGTTGGTG TCAAGAAATC GTACCAGCCA GCTACTCGTA TCAAAAAGAA TATCGTTGCA
CCACAACAGC TTGGTCCACC AGCACAGATA TTACAACAAA CTCAGCAACA ACCTCCACCC
CCAGCACCAC TCGTACCTCC AATCTCACCA ACTCATCCTG CTTTTGTATC TGCACCATCT
CCTCCTAGAA CTCAAAACCA ACACCATGCG CATGCTTACA GGGCCCAAAT GCTCCAACAA
CCGAACATGA CACTTCCACC AAGTCAACAG CAACAGCAAC CACAGCAATC TCGTCTGGCA
GATAGCAGAA ACATAGCAAT GAATACAAGC TTGTCTGTAG ATAGCACTCC AAAGATATCT
CCACCTCCAC AAGCTCTGGA TTCTCCCTCA AAAAGCCCTC ACTATACAAA CTTGCGTACC
AATACAGCCT TTGCCATGAC TAATAGTACA TCTTCAAGCG ATAGCAATAA CCAGTCTTCT
GATGGTCAGC TAGCTAGCGG CTCTAGTTCC GGTAGCGCTT CCAGTTCCAG AAATGTCTCT
AGTTCTAAGC ACAAATCGCA GCCTCAGCCT CCTGCGATTC CAAATCTGTT GGTAAGTGCC
GCTGCTGCCG CTGCTGCTGG GAGTGGCATG AAGCAGGCGC AAAGACTGGC TCCTACTTTA
CCGCTGGCGA AGTTGGCAAT TGCTACCAAG TCTAAAGAAG ACTTGAAGCA GAAGACAACA
AAAACCAAAA TTATAGAAGC CAAGAATACT ATAAAGTTCA AGGAAATCTC GAATCTATTG
AGCCCCAATG AAAAAATCTT GAAGATGGAC ACGATATTGA AGCTGGAGTT GAGCAATAAG
ATCCTAAAGA GGCAACCAGC TGAACAGCTC GATGATTCCC TCATCGATGA TTTGATCACA
AAATACCTGA GGCAACTTGA GAAGAACGCT GAAGTTGTAG TCACTGTCAT TACCAATTTA
GCACGGGTCT TCTTTGTCAC AGCTAGTTTG GGTGTAATGC TCGTAGACCT CAAGGCAAAT
AACGGCGGAG ACTATCTGAT GTATGATTAT GAATTTGAGA GTCTAGCTGT TGACGATGAT
GGCAACGACA GCGAGGAAGT CTACGGCTAT TTGATTCTAG AATTGATTAG AAAAGGCGGA
GACTTGATCT TCTTGAAAAG AATCAGCGAT TTCGAAAGAT TATCGCTTGA AGATTCTGTC
AAAGTTGTGG ATAGAAGTGG GGATCAAGTT AAGCTAGGCA AGAACTATGG TTGGATCGAC
TGTTTGTTGA TGGCCAAAGA CATGGTTTCT CGAGAAAAAA GCCTGCCAGC CGTCCGAAAG
GAGAAGTCTC CTACTCCTAC ACTGTCTCCA TCTCTAAGTT CCAAATCAAG TTCAGTCCCT
ACTGCTGCAT CTAAGAAGAA ACCTACAAAG ACGACTGCAG TTCCAAAAAA GCCCAAGAAA
TCAACAGTAC AAACTAATAC TGGCAGAAGC AGTAGCACCA CCACTAACAA GACAATCATA
GCACAACCGA CTGCTGCCAA ACCGATGAGC AAGTTTGCCT ATGCCGCTGC TGCAGCCGCT
CACAAATAGA TTTCTTACAT TTTCTATAGA TATATAGACT GTAGAATAAA AGTGGAAATG
CATTATTTAC AGGGCGAGAA AGAGAGAGTT CCAAAGACCG
 
Protein sequence
MQRPPLPTQQ QSQSHLQQKQ ISPTPVKRTA RDYQFGTRIG EGSYSTVFSA MDIHNSKTYA 
IKVLSKRHIV KEDKIKYVNI EKITLHRLGQ QHPGIVQLYY TFQDEKSLFF VLDFAEYGEL
LSIIRKFGSL SEAVSKFYMC QIVDAVKFIH SKGVIHRDLK PENILVAHDF SLKITDFGAA
KLLGNSDDND EKIDYNSVDE AQNVSDEDRK GSFVGTAEYV SPELLKHNIC GFEADVWALG
CILYQFFHGV PPFKGNTEYL TFEKIINIDY SYRSKYPLPP DVIEIIDKIL LADPQQRSTI
PQIQKSRWFQ DVPWDDLNFI WHRKVPRFEP FGPGSNNAPS PVMSTFKTGS NRNMNKSNSY
QQLHSQIQHS DFAYIPSVGV KKSYQPATRI KKNIVAPQQL GPPAQILQQT QQQPPPPAPL
VPPISPTHPA FVSAPSPPRT QNQHHAHAYR AQMLQQPNMT LPPSQQQQQP QQSPSDSPSK
SPHYTNLRTN TAFAMTNSTS SSDSNNQSSD GQLASGSSSG SASSSRNVSS SKHKSQPQPP
AIPNSLVSAA AAAAAGSGMK QAQRSAPTLP SAKLAIATKS KEDLKQKTTK TKIIEAKNTI
KFKEISNLLS PNEKILKMDT ILKSELSNKI LKRQPAEQLD DSLIDDLITK YSRQLEKNAE
VVVTVITNLA RVFFVTASLG VMLVDLKANN GGDYSMYDYE FESLAVDDDG NDSEEVYGYL
ILELIRKGGD LIFLKRISDF ERLSLEDSVK VVDRSGDQVK LGKNYGWIDC LLMAKDMVSR
EKSSPAVRKE KSPTPTSSPS LSSKSSSVPT AASKKKPTKT TAVPKKPKKS TVQTNTGRST
QPTAAKPMSK FAYAAAAAAH K