Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_70098 |
Symbol | |
ID | 4837153 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 205470 |
End bp | 207461 |
Gene Length | 1992 bp |
Protein Length | 597 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388468 |
Product | predicted protein |
Protein accession | XP_001382274 |
Protein GI | 150863712 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5384] U3 small nucleolar ribonucleoprotein component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.417891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.628267 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAAG ATTTGTTGGA GACGTTGAGG AATAATCCTC AGGAAATCTT CGAGCTCTAC AAGCCCAAAG AGTCTGATGA GACCTCATCG CAGAATATCT TCAACGAGAT GACCAAGACC TTCTTGGACC CGTTCACCAA AAAGTATTCT GTTTTGGACG AAATATACGT TGATGGTTTA GATTCCAGTC AAGTTTTTGG CCAGACGAAA ATGGTTTTGG ACGGGGTTGG AGAAACACTT CTTGCTTCTG TGATTCCAGA GTTGAAGGAG AAGTATGGAG TAGCACAGGA AAGCGAGCCA GAAGACGAGG AAGACTCTTC CAGCGATGAA GAAGGAGAGT TTGGTATCCC AATCGAAGAA GAAGAAGACT ACTTGGATGA AGAAGAAGAA AAAGAAGAAG AAAATGACGA ACTTGAAGCT GAGCAGAGTC TTGATGAAGA TCAAAAGGAT GAAGATCAAG AAGATGAAGA TCAAAAAGAT GATGAAGAAG AAGAAGAGCA AGATATTCCT GTCAAGAAGG ATGTATTTGG ACTTAACGAC GAGTTCTTCG ACATCGATGA ATACAACAAA CAAGTGATGA AGTTAGAAGA AGCTGCCGAG AACGATGACT ACGACGAGAA GGAAGAAGAA ATCGACTATT TTGCAGCTTT GAGTGATGAA GATGAGGAGG AAGAAGAGGA GGAAATGGCA TACTATGATG ACTTCTACGA CAAACCCGGA AGTTCGAACA AGTTATCTAA TATTAAAGAC CATGAAACAA AAGAAGAAGA GGAGGAGGAG GAGGAAGAAG AAGAAGGAGA TTTCAGTGAA GGAGAAATAG ACAACGCCAT GGGTTCGGCA ATGTTGGACC TATTTGCTGA CGAAGTCGAT AATGAAGAGG TCTCTTCCAA GAATGAGAAA ACCATGTCCT CTTTCGAGAA ACAGCAGCAA CAGATTCAAG CAGAGATAGC TAAATTAGAG GCAGAACTTG TAGCAGATAA GAAATGGACT ATGAAGGGTG AAGTTGGCTC TAAAGACAGA CCACAGGATT CTCTTCTTGA TGATCCAGAG TCTGCAAATA TGGCTTTTGA CAGAACGTCA AAGCCTGTAC CTATTGTTAC ACAAGAGAGC ACAGAAGCAT TAGAAGATTT GATTAGACGC AGAATCAGAG AAGAACAATT CGACGAAGTT CCAAAGAGAT TAGTAGCCGA TGTGGCTAGA TTCCACAACA AGCAGAAATT TGAATTGTCT GAACAAAAAT CAAGTAAGTC ATTGTCTGAA ATGTATGAAG ATCAGTACAA AAATGTTGAT ACAGAAAAAG AAGTCAGTGA AGAAATCCAA AAGCAGCATG ACGAAATAAC AGAGCTATTT ACCAAGGTAA GCCACCGGCT AGACGCTCTT TGTTCGGCAC ATTTCATCCC CAAGCCTCAT CAATTTAAAA CTATTGAAAT CAAGGTCAGT GACAATGCCG CCTCAATTAA TATGGAAGAC GCTCAACCAT TGCATGTTTC GAGTGAATCC ACTTTGGCGC CTCAAGAAAT ATATAAGATT GGCGATGACA AGCCTGTTGC GAACGGAGCT AAGGGTAGAT CTGAAGTCCA ATTAAAATCT GGTTTGTCAT TCTCCAAGGA TGAGTTATCT AGAGAAGACA AGCAGAGATT GAGAAGAGCC AACAAAAGAA AGAGAGCTAA GGAGTTCAAC CAAAGAAAGG AATTACAAGA ACAGAAACAG AAGCAAACCG GTGCTGCTCC AGCCAATAAA CGCCAAAAAG TGGGCGAGGT TATCAATACA TTATCTAAGG CTAAGAATAT CACTGTTATT GGCAAGAAGG GAGAAATGAG AGATGTAAAG GGTAACGTGA AAAAGCTGCA AGGGGCACAA ACTTCGAACA ACTTTAAGTT GTAGAGAAAA CATGAATAAT TTAATGCATA ACTGGCAGCT GCCAGATATT TTATAGCGTG TATAATATTT GTATTCATAC AAAACAACTA TATTTGACAG AAGAATTACT TT
|
Protein sequence | MSQDLLETLR NNPQEIFELY KPKESDETSS QNIFNEMTKT FLDPFTKKYS VLDEIYVDGL DSSQVFGQTK MVLDGVGETL LASVIPELKE KYGVAQESEP EDEEDSSSDE EGEFEEKEEE NDELEAEQSL DEDQKDEDQE DEDQKDDEEE EEQDIPVKKD VFGLNDEFFD IDEYNKQVMK LEEAAENDDY DEKEEEIDYF AALSDEDEEE EEEEMAYYDD FYDKPGKEEE EEEEEGDFSE GEIDNAMGSA MLDLFADEVD NEEVSSKNEK TMSSFEKQQQ QIQAEIAKLE AELVADKKWT MKGEVGSKDR PQDSLLDDPE SANMAFDRTS KPVPIVTQES TEALEDLIRR RIREEQFDEV PKRLVADVAR FHNKQKFELS EQKSSKSLSE MYEDQYKNVD TEKEVSEEIQ KQHDEITELF TKVSHRLDAL CSAHFIPKPH QFKTIEIKVS DNAASINMED AQPLHVSSES TLAPQEIYKI GDDKPVANGA KGRSEVQLKS GLSFSKDELS REDKQRLRRA NKRKRAKEFN QRKELQEQKQ KQTGAAPANK RQKVGEVINT LSKAKNITVI GKKGEMRDVK GNVKKSQGAQ TSNNFKL
|
| |