Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83871 |
Symbol | PKH2 |
ID | 4839581 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 113732 |
End bp | 116531 |
Gene Length | 2800 bp |
Protein Length | 861 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390896 |
Product | aspartic proteinase precursor |
Protein accession | XP_001385043 |
Protein GI | 150865712 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.925422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGTC CACCACTACC AACACAGCAA CAGCTGCAAC TGCATCTTCA ACAAAAACAA ATCTCTCCCA CACCCGTGAA GCGTACGGCT CGGGATTACC AGTTTGGAAC AAGGATTGGT GAAGGTTCGT ACTCCACTGT GTTTTCTGCA ATGGATATCC ACAACTCAAA GACATATGCT ATCAAGGTTC TTTCCAAACG ACATATTGTC AAGGAGGACA AGATCAAGTA TGTCAACATC GAAAAGATAA CGTTGCATCG TCTTGGTCAA CAGCATCCTG GTATTGTTCA GTTGTACTAC ACATTTCAAG ACGAAAAGAG TCTTTTCTTT GTGCTTGATT TTGCTGAATA CGGGGAGCTT CTTTCAATTA TCCGTAAGTT CGGCTCGTTA TCAGAAGCTG TGCTGAAGTT CTACATGTGT CAGATTGTTG ACGCTGTCAA ATTCATTCAT CTGAAGGGTG TGATCCATCG CGACTTGAAA CCCGAAAACA TCCTTGTAGC ACACGATTTT AGTCTAAAGA TCACCGACTT TGGTGCAGCC AAGCTTCTCG GAAACTCTGA CGACAACGAT GAGAAAATCG ATTACAACTC CGTAGACGAA GCCCAGAACG TTCCGGTTAA GGTTAGTGAT GAAGATCGTA AGGGTTCCTT TGTAGGTACG GCAGAGTACG TTTCGCCAGA GCTTCTCAAA CACAACATTT GTGGATTTGA AGCTGATGTA TGGGCTCTTG GATGTATTTT GTACCAGTTT TTTCACGGAG TACCACCGTT CAAGGGCAAC ACTGAGTACT TGACGTTCGA GAAAATTATC AATATCGATT ACTCGTACCG TCTGAAGTAC CCACTTCCTC CAGATGTAAT CGAGATAATA GACAAAATCT TGTTGGCCGA TCCCCAACAA CGGTCTACAA TACCTCAAAT CCAAAAGAGC CGTTGGTTCC AAGACGTTCC CTGGGACGAC CTCAATTTCA TCTGGCATAG AAAAGTGCCC AGATTTGAGC CATTTGGCCC AGGCTCAAAC AATGCACCTT CACCAGTAAT GTCGACATTC AAAACAGGCT CCAATAGAAA TATGAACAAG TCTAACTCGT ACCAGCAATT GCATTCGCAA ATCCAGCATT CAGACTTTGC TTACATTCCC TCTGTTGGTG TCAAGAAATC GTACCAGCCA GCTACTCGTA TCAAAAAGAA TATCGTTGCA CCACAACAGC TTGGTCCACC AGCACAGATA TTACAACAAA CTCAGCAACA ACCTCCACCC CCAGCACCAC TCGTACCTCC AATCTCACCA ACTCATCCTG CTTTTGTATC TGCACCATCT CCTCCTAGAA CTCAAAACCA ACACCATGCG CATGCTTACA GGGCCCAAAT GCTCCAACAA CCGAACATGA CACTTCCACC AAGTCAACAG CAACAGCAAC CACAGCAATC TCGTCTGGCA GATAGCAGAA ACATAGCAAT GAATACAAGC TTGTCTGTAG ATAGCACTCC AAAGATATCT CCACCTCCAC AAGCTCTGGA TTCTCCCTCA AAAAGCCCTC ACTATACAAA CTTGCGTACC AATACAGCCT TTGCCATGAC TAATAGTACA TCTTCAAGCG ATAGCAATAA CCAGTCTTCT GATGGTCAGC TAGCTAGCGG CTCTAGTTCC GGTAGCGCTT CCAGTTCCAG AAATGTCTCT AGTTCTAAGC ACAAATCGCA GCCTCAGCCT CCTGCGATTC CAAATCTGTT GGTAAGTGCC GCTGCTGCCG CTGCTGCTGG GAGTGGCATG AAGCAGGCGC AAAGACTGGC TCCTACTTTA CCGCTGGCGA AGTTGGCAAT TGCTACCAAG TCTAAAGAAG ACTTGAAGCA GAAGACAACA AAAACCAAAA TTATAGAAGC CAAGAATACT ATAAAGTTCA AGGAAATCTC GAATCTATTG AGCCCCAATG AAAAAATCTT GAAGATGGAC ACGATATTGA AGCTGGAGTT GAGCAATAAG ATCCTAAAGA GGCAACCAGC TGAACAGCTC GATGATTCCC TCATCGATGA TTTGATCACA AAATACCTGA GGCAACTTGA GAAGAACGCT GAAGTTGTAG TCACTGTCAT TACCAATTTA GCACGGGTCT TCTTTGTCAC AGCTAGTTTG GGTGTAATGC TCGTAGACCT CAAGGCAAAT AACGGCGGAG ACTATCTGAT GTATGATTAT GAATTTGAGA GTCTAGCTGT TGACGATGAT GGCAACGACA GCGAGGAAGT CTACGGCTAT TTGATTCTAG AATTGATTAG AAAAGGCGGA GACTTGATCT TCTTGAAAAG AATCAGCGAT TTCGAAAGAT TATCGCTTGA AGATTCTGTC AAAGTTGTGG ATAGAAGTGG GGATCAAGTT AAGCTAGGCA AGAACTATGG TTGGATCGAC TGTTTGTTGA TGGCCAAAGA CATGGTTTCT CGAGAAAAAA GCCTGCCAGC CGTCCGAAAG GAGAAGTCTC CTACTCCTAC ACTGTCTCCA TCTCTAAGTT CCAAATCAAG TTCAGTCCCT ACTGCTGCAT CTAAGAAGAA ACCTACAAAG ACGACTGCAG TTCCAAAAAA GCCCAAGAAA TCAACAGTAC AAACTAATAC TGGCAGAAGC AGTAGCACCA CCACTAACAA GACAATCATA GCACAACCGA CTGCTGCCAA ACCGATGAGC AAGTTTGCCT ATGCCGCTGC TGCAGCCGCT CACAAATAGA TTTCTTACAT TTTCTATAGA TATATAGACT GTAGAATAAA AGTGGAAATG CATTATTTAC AGGGCGAGAA AGAGAGAGTT CCAAAGACCG
|
Protein sequence | MQRPPLPTQQ QSQSHLQQKQ ISPTPVKRTA RDYQFGTRIG EGSYSTVFSA MDIHNSKTYA IKVLSKRHIV KEDKIKYVNI EKITLHRLGQ QHPGIVQLYY TFQDEKSLFF VLDFAEYGEL LSIIRKFGSL SEAVSKFYMC QIVDAVKFIH SKGVIHRDLK PENILVAHDF SLKITDFGAA KLLGNSDDND EKIDYNSVDE AQNVSDEDRK GSFVGTAEYV SPELLKHNIC GFEADVWALG CILYQFFHGV PPFKGNTEYL TFEKIINIDY SYRSKYPLPP DVIEIIDKIL LADPQQRSTI PQIQKSRWFQ DVPWDDLNFI WHRKVPRFEP FGPGSNNAPS PVMSTFKTGS NRNMNKSNSY QQLHSQIQHS DFAYIPSVGV KKSYQPATRI KKNIVAPQQL GPPAQILQQT QQQPPPPAPL VPPISPTHPA FVSAPSPPRT QNQHHAHAYR AQMLQQPNMT LPPSQQQQQP QQSPSDSPSK SPHYTNLRTN TAFAMTNSTS SSDSNNQSSD GQLASGSSSG SASSSRNVSS SKHKSQPQPP AIPNSLVSAA AAAAAGSGMK QAQRSAPTLP SAKLAIATKS KEDLKQKTTK TKIIEAKNTI KFKEISNLLS PNEKILKMDT ILKSELSNKI LKRQPAEQLD DSLIDDLITK YSRQLEKNAE VVVTVITNLA RVFFVTASLG VMLVDLKANN GGDYSMYDYE FESLAVDDDG NDSEEVYGYL ILELIRKGGD LIFLKRISDF ERLSLEDSVK VVDRSGDQVK LGKNYGWIDC LLMAKDMVSR EKSSPAVRKE KSPTPTSSPS LSSKSSSVPT AASKKKPTKT TAVPKKPKKS TVQTNTGRST QPTAAKPMSK FAYAAAAAAH K
|
| |