Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_90238 |
Symbol | |
ID | 4840413 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1310674 |
End bp | 1313697 |
Gene Length | 3024 bp |
Protein Length | 821 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640391728 |
Product | predicted protein |
Protein accession | XP_001385948 |
Protein GI | 150866371 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGAAAACATC CCCAAACTCC GTTTTACTTC TGCTAGTCTT GGTTCCTGTT CTCTGGTGTA GTTCCCCCGC TTTTCCTTTG CTCCAATGGC CAAGAAACAG TCCTCGGCTG CTAAAGCTGC TGCGCTCAAG GCCAAACAAG CTCCTTCCCT ACTTCTGGTG GATCCAGTAG TGGATCCGGT TTTCGCTCAA GGTGCTATCC GCTCGTATCT AGTGACCGAC CCGGCTCCAA CTTTGCTTAA AAAGCGTTCT TTGCTCGGTT CCAACGAATG GATTCTTATC GCTGTTCTAC TTGTGGCTTC TTACTACGTG AGAATGTACA ACTTGTCGTA TCCCAAATCT GTGGTGTTTG ACGAAGTACA CTTCGGAGGC TTTGCCAGAA AGTATATTCT CGGTAAATAC TTCATGGATG TGCACCCACC GTTGGCAAAG ATGCTTTTTG CAGCTGTTGC TTCCTACGGA GGCTTCAAGG GTGATTTTGA ATTCAAGAGT ATTGGTGACT ACTTCCCTGA AGGTACACCC TATGTTCTCA TGAGACAGTT CCCCGCGTTG TTGGGTATCG GCACGGTACT CTTAGCGTAC TTCACTTTGC GTAGTTCCGG CGTTAGACCT GTCATTGCTT TTGCTACCAG TTTTCTTTTG TTGGTAGAGA ACTCCAACGT AACAATCAGT AGATACATTC TCTTGGACTC TCCACTTCTT TTCTTCATTG CTGCATCCAT TTTTGCCTGG AAGAAGTTCG AGATTCAGAC CCCTTTCTCC GCTGGCTGGT TCAAGAGTTT GATTGCTACC GGTGTTGCTC TTGGTTTAGC TCTTAGTTCC AAGTGGGTGG GCCTTTTCAC TGTAGCTTGG GTAGGTCTCT CCTGCGTGTA TCAGATGTGG TTCATCGTTG GTGACTTGTC CGTCAGCGCT AAGAAGGTCG TTGCCCATGC TTTCTTCAGG GGCTCCATCT TGTTGGGTGT CCCTGCCTTG TTGTACCTCT TCTTCTTTGC AGTTCATTTC CAGGTGTTGA GCAAGGAAGG TGATGGTTCT GCTTTCATGT CCAGTGCCTT CAGAGCCGGC TTGGAAGGCA ACAGCATTCC CAAGAATATT ATTGCTCAAG TGGGTTTAGG TTCGACTGTA ACAATTCGTC ACATCGACAC TCAAGGTGGC TATTTGCACT CGCACGAACA CTACTATCCT GCTGGTTCCA AACAGCAACA GATTACGTTA TACCCCCATT TGGACTCCAA CAACAAATGG TTCATCGAAC CATACAGCAA CCTCACTGTG TACAATGAAA CGTTTGTGCC TTTGACAGAT GGTATGAAGG TAAGATTAAA GCACATCAAT TCAGGCAAGA GATTGCACTC TCACGATGAA AAGCCTCCTG TCAGTGAACG TGACTGGCAG AAGGAAGCTT CTTGTTACGG GTTCGAAGGA TTTGCCGGTG ACGCCAATGA CGACTGGGTT GTAGAAATCG TAAGCCACAG GACCCCAGAG GGCGAAGCCC GCAATAACGT GATCGCCTTA ACATCTGTAA TCAGATTCAG ACACGCCATG TCTGGACACT ATTTGTTCTC GTCGGAAGTC AAATTGCCAG AATGGGGTTT TGGCCAACAA GAAGTCTCTG CTGCCTCGTC TGGTAGAAGA GCATTGACCC ACTGGTATAT CGAGACTAAT GAAAATCCTA TGTTGAGCCA GAGTGAAGCC AGGATCATCA ACTATCCTAA GTTAACGTTG TTGCAGAAGT TCACTGAGTC CCACAAGCGC ATGTGGAAGA TCAACCAGGG CTTAACCGAT CACCATAACT GGCAGTCTGA ACCCCAAGAA TGGCCATTGA TGTTGAGAGG TATTAACTAC TGGGTCAGAG AACACAGACA AGTATATTTG ATGGGTAATG CCGTGACCTG GTGGACTGTC TCGTCTGTTA TTGCTGTTTT CTTCTTGTAC ACTGCCATTC AAGCTATCAG ATGGCACACT GGTAGTCAAA TTGCGACTGA CAAGAATGTT TACAACTTCA ATTTCCAAGC CTTTTCATAC ATATTGGGTT GGGGTCTTCA TTACCTCCCA TTCTTCATTA TGGGTAGACA ATTGTTCTTG CATCACTACT TGCCAGCTCT TTACTTTGGT ATCTTGGCTC TCGGACACTT CCTTGAACTT TTTACTGGTT ACTTCTTGGC CCGCTCCCAA TTCTTACAAA GATTCGGCTT GGGCTTGGTC ACCATCTTTG TTGCTCTCAG CGCTGTTTTC TACATCAATT ACTCTCCTTT GATTTATGCC ACCAACTGGA CCAAAGACCA GTGTAAGAGA TCTAAGGCCA TCAGTACTTG GGACTTTGAC TGTAACACCT TCCACGGAAA CATCAGCGAG TACAGCATCG ACGAAGCTGC TCTGCTTACA AGTGCTAGCA AGATTGCTGA CGATCTTCTC AAAAAGGTAG AAGAAGCTCA CGAGACGCCT GGTGAACTCT TGCAACAGAA GAAGCCAGTT GATCCAGAAG AAGAACAAAA GCATCTCTCT CAGGATGGAG AAAAGAAGGC TGAAGAATCT CCAAAGCAAG GTGATGACAG TTTGCCTGTA GTTGAAGATG TTGTTGTCGA AGACGTTGTA TTGGAAGAAG CTGTTCTTGT AGAGGAGCCA ATTGCTCCAC CTATTCTTGA CGATGAAACC CCTGTTGCCG AAGCACAAAT CCCAGAAGAA GTAGAAAGCT CTCAGGAAGC TGAACCTGTC CAAGATGTCG CTGAAGAAGT GGCTGAACCA GTTCAACAAG TCCTCAAGGT TGAAGAACCG GCTGCTGAAG ACCCAGTAGC AGAGCCAGTA GAGGAAGTTG TAGAAGTTAT TGATGTTTCT GCACAATAGA CTCCGTTCGA AGTTGATTTC TAGTTACTAC ACTGGTGCAA CGTTGATAGG TGTTTATAGA TGATAGCATT AGTGAGTATG CTCTCAAAAT CCAATTACTC TTATAAATAT AAGCAATTCG TAATTAGTGC ATTACAACGT AAAAACTGAC TAGCTATATA CACAAAAGGA ACCATTACAA CTTC
|
Protein sequence | MAKKQSSAAK AAALKAKQAP SLLSVDPVVD PVFAQGAIRS YLVTDPAPTL LKKRSLLGSN EWILIAVLLV ASYYVRMYNL SYPKSVVFDE VHFGGFARKY ILGKYFMDVH PPLAKMLFAA VASYGGFKGD FEFKSIGDYF PEGTPYVLMR QFPALLGIGT VLLAYFTLRS SGVRPVIAFA TSFLLLVENS NVTISRYILL DSPLLFFIAA SIFAWKKFEI QTPFSAGWFK SLIATGVALG LALSSKWVGL FTVAWVGLSC VYQMWFIVGD LSVSAKKVVA HAFFRGSILL GVPALLYLFF FAVHFQVLSK EGDGSAFMSS AFRAGLEGNS IPKNIIAQVG LGSTVTIRHI DTQGGYLHSH EHYYPAGSKQ QQITLYPHLD SNNKWFIEPY SNLTVYNETF VPLTDGMKVR LKHINSGKRL HSHDEKPPVS ERDWQKEASC YGFEGFAGDA NDDWVVEIVS HRTPEGEARN NVIALTSVIR FRHAMSGHYL FSSEVKLPEW GFGQQEVSAA SSGRRALTHW YIETNENPML SQSEARIINY PKLTLLQKFT ESHKRMWKIN QGLTDHHNWQ SEPQEWPLML RGINYWVREH RQVYLMGNAV TWWTVSSVIA VFFLYTAIQA IRWHTGSQIA TDKNVYNFNF QAFSYILGWG LHYLPFFIMG RQLFLHHYLP ALYFGILALG HFLELFTGYF LARSQFLQRF GLGLVTIFVA LSAVFYINYS PLIYATNWTK DQCKRSKAIS TWDFDCNTFH GNISEYSIDE AASLTSASKI ADDLLKKVEE AHETPVAEPV QQVLKVEEPA AEDPVAEPVE EVVEVIDVSA Q
|
| |