Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_55575 |
Symbol | |
ID | 4837011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1479070 |
End bp | 1482060 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640388326 |
Product | Protein required for cell viability |
Protein accession | XP_001382504 |
Protein GI | 150863879 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.154341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.828998 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCTA AGATCGAGGA ACTTCCTCCG AAAGTGGAGA ATGTGTCCAA GTCCAAGAAA AAGAATCCGT TTGCTACACC AAAAAAGACA ACTGTTCGCA GGAACGCTCT GGAAGTCTAT CCCACCCACA AAGGTTTGAA CAAACTACAA TACATAGGTG ATAAACCTAT AGACCTTCTA TTCCATGACC TCGAAGTAAA ACTCGAAGGA AATTACCAAG ATCTCACTAT TGATGTGTTA TACCAACGTC TTATTCAGGT GAAAGTGGAG GAAGCCGATG ATGTAGACTA TATGAAGAGA TTCCAAGTCT TGGAATATTT ACTTGACAAA TTGATTGAAA TCCAGAATTT ATCAAACGAG AACGATCTCA AAGACAAGAA TTTGATAAAA ATCTCACTTC ATGATATACG AACCTTCAGC AAGGTTGTGA ATTTGATTAT TGTGCATGGT GTCTATCCTG CTATTACTGC GTTTAAAATT GGGATACCGT TTGAAAAGAG AAGGTTGAAC CATTTCAATG TCAGCATGGG CAAGAACCCG GTAAAAATCG ATAAAATACC TATAAATTCG AAGCTGTCGA CTCCATTTGA ACGTATACAG AAACTATTGA TGTTGATGTA TACCAAATTG TACGTGGTCT TTCAAGTACA ATCGGACGTC AAGGACTTGC TTAGCAAAGG TACTGGAATA TCAGACTTTC TCACTATTGC AATTACATTG ATCACAGTTC CATATTTTTC GAAAGATGTG ATAGCGAAGG TTCTTTCTGA TTTTCCTAAC ATCATAAAAT TAGTAGAAAC ATACGAATTG TACCAGACAT ATACACTTCT TTTATCGACG CAGTCACCTT CATACTTTAA GCTGTTTGTG ATGCAGAAAC TTCTGACCAT CCACTACGAT ACACCCACTG GAGTGTTAAC TCTCATTGAA TTCGTTCTTG GATTACGTGA TAATGACGAA ATAGAAGTGG AAAAGTACGA ACATGTGTCG AATGTAGTCT TGCTGAAGCC GAAGAGCATA TCCACGGTAG ATTACTTTAC AAACATTGGG AACCAATGCT ATAATCTTCT TGTAAATATT AACAGACCAA TGGTTACCAG CTGTGTGGTT TTCATCTTGG AGAATCTCTG GAATCGGAAC CAAATGGTCA CCAGAGACTT CTTCTTGAAA CGGATTTGGA ACAATTTCAG CCCACCAAAC AGCAATTCAG ACGAGATATT GGTTACTGAA GCTCAACTCA ACAATAATGT CAATGTGTTG ATTTCATTGA CCAAGAAGGG CTTACCGGTG GAGTTGTTGA AGGTGGTTTT TGAGCCAATT ATTCTTTCAG TGTGGTCATA CTTGAATTTC CTTAAGAAAA ATAAAAAGTC CACTGAAATC ATAAGTGGTA TACTTGTAAG TTACTTCACG ATGGTAAAAG ACTCAGAAAT AGAAACAAAG GACGTTTATG GGTTGGATGC AATCGCAAAG AATTTATTGT ACGATGCTGA AGATCACGAA TTTGCCATTG GTCCGAATGA ATTAGTACAG ATTCAGAGAA AACAGAGGAA AATTGAAAAT TCAAGCAAGG ACCAAAAAGT GAACATGTTT ATTTCTGAAC TAGACATAAG TTGTGAAAAT TTTGTGGCTC TTTTGGATAA TTTGGATGAC GATTTAGTTC AAGCTATATT CCTCAGTACT CTCAAGCGGT GGTTACGTAG CGGAGACAGT TCAAATGGCA ACGAAAACCC CTTTATTGTT CTTATTGATT TACGATTGCT TGAGTCCATT GGAAACAAAT TCAAGGATAG TTTGGCCAAG ACTCCATTCG AAGTACTTCA GATAGTGCAA AACTTTTTAT CTCCCCAGGC AAGAGAAAAG TTGCAAGTTG AACATGTCAA CTTGGTTTCA CAGAGCGGTG ATGTGGATTC AGATGACGAG GATGATTTTG ATGAAAATGT CGAGGCTCAA GCACTTCCAA TAGTGTTGGA GCTTTTGTCA GCAATTCTTT CCGAGACTGA AGTTTCTTTG GACGAGAAGT CATTCGAAAG TTTGCGCGCT ATTCAGAAGT CGTTGGCAAG ACTCTCGGCT ACAGATGTTC CATCTAGTAT CAAAAGTGCG TCGACTTCAC TCAACGAAAG AATCGATGAC TTATTGAACG GAGACATACC AGTTCAGAGT GAAGAGGAAG CTGAAAAGGC TGACCTAAAG CGTGCAGTCA CCAGTCTCAA CGATCCGCTA GTCCCAATTA GAGCTCATGG GCTCTATTTG CTTAGGCAAC TTATTGCAAA TAGGAGTAGC GTGATATCTC TCGAGTTTGT AGTGGATTTG CATTTGGTTC AATTAAAAGA TCCTGATCCC TTCATCTTCT TGAACGTCAT AAAAGGACTT GAGAATTTGA TTGAGTGGGA CGAGAAGCGT ATGCTTCTGA TTTTGTGTGT TTTGTACTTG AATGAGTCCA AAGAAACCGA TCTTGACGAG AGATTAAAAA TAGGAGAGGT TTTGTTGAGA TACATACAAG GTGCCAACGA GATGTTTTCC GGAGAGTCTG CAAAGAGAAT TGTCAGTACG GCATTACACT TGATAAGAAG AAAAGTACCC GAGGAAGAAA ATGAAGACAA CCGATTGAGA ATGTCTAGTA TGTCGTTGTT GGGTACTTGT TGTAAGGTCA ATCCACTTGG TATTGTTGAC CAATTAGAGA ATGCATTGGA CTGCGCACTT GGAATTTTAC AATTTGAAAC TGATAAGGAT AGTGCCATCA TGCGTCGTGC TGGTATAGTA TTGATCCACG ATTTGATAAT TGGTACTTCC AACCAAAAGG AAGTACCATT TCCGGAAAGT TATAGATTCA AAGTGGTCAA TACTTTGCGC TACGTTAAAG ATACTGACAA TGATATTTTG GCTAGAGAGC AGGCAGAGAC GGTGTTGGAT TCTATAGAGG AATTGTCCAG TCTTGCCTTT GAGCAGCTCG AAGAAGACAG TGAAGATCAG TTCAAGTCCA TGAGAGTATA G
|
Protein sequence | MPPKIEELPP KVENVSKSKK KNPFATPKKT TVRRNASEVY PTHKGLNKLQ YIGDKPIDLL FHDLEVKLEG NYQDLTIDVL YQRLIQVKVE EADDVDYMKR FQVLEYLLDK LIEIQNLSNE NDLKDKNLIK ISLHDIRTFS KVVNLIIVHG VYPAITAFKI GIPFEKRRLN HFNVSMGKNP VKIDKIPINS KSSTPFERIQ KLLMLMYTKL YVVFQVQSDV KDLLSKGTGI SDFLTIAITL ITVPYFSKDV IAKVLSDFPN IIKLVETYEL YQTYTLLLST QSPSYFKSFV MQKLSTIHYD TPTGVLTLIE FVLGLRDNDE IEVEKYEHVS NVVLSKPKSI STVDYFTNIG NQCYNLLVNI NRPMVTSCVV FILENLWNRN QMVTRDFFLK RIWNNFSPPN SNSDEILVTE AQLNNNVNVL ISLTKKGLPV ELLKVVFEPI ILSVWSYLNF LKKNKKSTEI ISGILVSYFT MVKDSEIETK DVYGLDAIAK NLLYDAEDHE FAIGPNELVQ IQRKQRKIEN SSKDQKVNMF ISELDISCEN FVALLDNLDD DLVQAIFLST LKRWLRSGDS SNGNENPFIV LIDLRLLESI GNKFKDSLAK TPFEVLQIVQ NFLSPQAREK LQVEHVNLVS QSGDVDSDDE DDFDENVEAQ ALPIVLELLS AILSETEVSL DEKSFESLRA IQKSLARLSA TDVPSSIKSA STSLNERIDD LLNGDIPVQS EEEAEKADLK RAVTSLNDPL VPIRAHGLYL LRQLIANRSS VISLEFVVDL HLVQLKDPDP FIFLNVIKGL ENLIEWDEKR MLSILCVLYL NESKETDLDE RLKIGEVLLR YIQGANEMFS GESAKRIVST ALHLIRRKVP EEENEDNRLR MSSMSLLGTC CKVNPLGIVD QLENALDCAL GILQFETDKD SAIMRRAGIV LIHDLIIGTS NQKEVPFPES YRFKVVNTLR YVKDTDNDIL AREQAETVLD SIEELSSLAF EQLEEDSEDQ FKSMRV
|
| |