Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68309 |
Symbol | |
ID | 4840727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 485918 |
End bp | 488831 |
Gene Length | 2914 bp |
Protein Length | 927 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392042 |
Product | predicted protein |
Protein accession | XP_001386294 |
Protein GI | 150866630 |
COG category | [L] Replication, recombination and repair [R] General function prediction only |
COG ID | [COG0494] NTP pyrophosphohydrolases including oxidative damage repair enzymes |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00286141 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCAAAC TGAAAATCTA CAATTCTCGA TTCACGAATC ATATTCACTC TCTCAGCTAC CTATTCGCCA GCTTCACTAT ACATAGCAGT AGCTTTTTAT CCTTTTCTAC ATTACTGTCC ATCAGTAACT CACTCATGTC CATCCAATTG CGAGACGGCT TAGCCAACCA GCTGGTTGAT CTTGTGCTTG AAGACCTATT GGTGAGGTTT CTAGTCAATT GTCCCGAGGA AGATCTTTCG CTGATCGAAA GAGTGTTTTT CCAAGTTGAA GAAGCGCAAT GGTTCTATAC CGACTTTGTC CGAGTGCTAA ACCCGGCACT TCCTAACATG AAGATGAAGC TGTTCTGCTC CAAATTTTTG GAGAAGTGCC CTCTATTCTG GAAATGGGGA GACCCCAACG ATGCGCTATC CCGATTCGGC AAGTATAAAC TGACGATTCC CGTTCGTGGA GTCGCTCTTT TCAATAGAGA CTTGACGAAA GTGCTATTGG TGAAGGGGAC AGAGTCTAAC TCGTGGTCGT TTCCCCGAGG AAAAATTTCC AAAGACGAAT CTGACATCAA CTGTGCCATT CGAGAGGTCG AAGAAGAGAC TGGCTTCAAC GCCAAAGACC TAATCAATGA AAGCGACGTT ATTGAGAGAA CCTTCAAAGG CAAGAATTAT AAAATTTACT TGGTGAGAGA CGTGCCTGAA GATTACAACT TTCTGCCTGT AGCTCGTGGG GAAATCGCTA TGATTGAATG GCATGACATT AAAACTTTGC AAAAGAAGAT TAGAGCTTCT CCCAACAACT ACTTCATCGT CGAGACTGTT ATAAAACCAA TGATTCAATG GATAAATAAG AAGAAGGGTG GGTTAAACGA GGCAGAGTTG ATGCTAAAGG CAGAAATCCA GTTGAAGGCT TTATTGGGTG TAGGAAAGCG CGAAGAAAAC GTAGATGCTG GAAGAGAATT GTTGAACATT CTCCAGAAAG TCAGCCCCAG CCACTCAGCC AGCGACTCCA ATGTAGTACC TCTCTCATCT ACTGTGCCTG GTGCACCTCA ACAACAGAAC TATATCCAAT TCGCCTTACC GCAACACTTA CAGAACCAAA TCCCGTTCTT CTCTTCATCT TCAACCCACC ATCCACAGCC TATGTTGCCA TTCTTCAATC CCTTCGGCTT CTATCCTGGC GGATCTCCTT TACCTCCTCA TGCCATAACT CCTGTTCCAA TGCCTGTCCC ACCTCATCAG ATTCCTTTCC TGAATGTCTC CCCTCAAAAA CAGAGACATC CATATGAAAT ACACCAGCCA ACATCAGAGT CTCTTCAGAA ACCCACAGGT AAAAACTCCA AGGAATTCTT ATCGATTTTG AATACAAAGT CACTGAAAAT AACAGATGAG GCTACATCAA AAGATTATGT TAGATCTACG GAAGCAGAGA ATAACCGTAC AAAAGCTCAA GACTTGCTTA ACTTAGTAGG AAAGCAGAGA AAGGAGTCTG TTACATCTGA GTCAAGATCA ATTTTGGACC TTGTCAATAG GAAACAAGTA AGTTCATCTC CATCTCCAGA ACAAGACCCG GGCAAGACTT TATTGAACAT TCTCAACGAG AAGAAACATC CTGAGATTGT GCCTACTAGA GATTCGAGAT TTTTGGGTAT AAATTCACCT ATAGCATCCA ACGATCTAGA AGAATCGGTT ATTCACGGTG CTGGTCTTGG ATTACCGGCT CCAGAACCTG GCAAAATAAC TTTGCTAAAA AGACCAGACG ATGCAACAGA GGGTAGAAGA AAAAAGTCGG CTGATTTACT TAGTTTGTTG GGTAAAAAAC CTATCGTCCA ACCAAGACAG GAAGTTAAGT CATCGTCCAA CGAGATACTT GATCTCTTGA AGGGCCCAAA GAAACCGCTT TCAGAAACTA CAAGATCTCC CAATTCTTCT TCTAAAGAGT TGCTTGAATT ACTTAAACCA AAGAAGGAAC CAATAGCGGT CGACCATACT TCTTCAAATG AATTATTAGA CTTGCTCATC AAAAACAAGC CAAGCAGTGA ATCTGAGAAG AAACCAAATA ACCCATTATT GGATATGCTC CATAGTAAGG CACCACGTCA AGTATCACAA CCAAATGCCG AACATAATCA TACTTCAGCC AATGAACTTT TGGGTTTGTT GAACAAAAAA CCATCCATTC CATTAAATGA TGCTGACTCT AACTTCATTA GGGAAGAAAA GATTGAAGAG ACTCAATTCG ACAACTTTGA AGATTTTGAA GACTTTGAAG ACTTTGGAGT CATTGATAAC CAGCTATTGG GCAAGACCTC CTTCCGCAAC TTCGACATCG CAAGCGATGA AGAAGATGTA GACCATTTGA TAGATGATCT CGGAGATCCA TATTCCGCTC CAAATACAGC AGTAGAATCG TTTTCCAATC CACCAGATTT CTTCCTGGAT CCGCAGCCTT CTCTAGAACC AAAGAAGGGA AAGATCAGGC TTTTGAAGCC AGGAGAAGTA TTGAATGATA TCTTTTCTAC TAATCGTCCC AATGTTTCAT CGCCTCCTGT GCATGCTTCT AATGCTAATG GACAGAATCT TCTTGCATTG TTGAATGGAA AAAATCCCTC CAATTCAAAT GGTGCTATTC CTGTAAGCGA CAGCTTTCAA TCTATTTACG GGAATGCCAA CCCAACGGCG GGTCTTGATT CTAATACTCC TTCTTCTTTG ACAAATGCTC TTAGTGGTCA GAATAGCCCC AATAAGAATT CGGCTAATTT CCTTAAAGAC ATTCTTTGGA AACGCGAGCC ATAGTTAGCA CAGCATAAAT TAGCATAGCG TCTACGATAG TACCGGTGAA GTAGCACTAC AATAGTACGA ATTGCTATTT ATAGCAAATT TAGCATAGTG TCTAAGATAA CAAAACATAT AAATGATTTA GTCC
|
Protein sequence | MSKSKIYNSR FTNHIHSLSY LFASFTIHSS SFLSFSTLSS ISNSLMSIQL RDGLANQSVD LVLEDLLVRF LVNCPEEDLS SIERVFFQVE EAQWFYTDFV RVLNPALPNM KMKSFCSKFL EKCPLFWKWG DPNDALSRFG KYKSTIPVRG VALFNRDLTK VLLVKGTESN SWSFPRGKIS KDESDINCAI REVEEETGFN AKDLINESDV IERTFKGKNY KIYLVRDVPE DYNFSPVARG EIAMIEWHDI KTLQKKIRAS PNNYFIVETV IKPMIQWINK KKGGLNEAEL MLKAEIQLKA LLGVGKREEN VDAGRELLNI LQKVSPSHSA SDSNVVPLSS TVPGAPQQQN YIQFALPQHL QNQIPFFSSS STHHPQPMLP FFNPFGFYPG GSPLPPHAIT PVPMPVPPHQ IPFSNVSPQK QRHPYEIHQP TSESLQKPTG KNSKEFLSIL NTKSSKITDE ATSKDYVRST EAENNRTKAQ DLLNLVGKQR KESVTSESRS ILDLVNRKQV SSSPSPEQDP GKTLLNILNE KKHPEIVPTR DSRFLGINSP IASNDLEESV IHGAGLGLPA PEPGKITLLK RPDDATEGRR KKSADLLSLL GKKPIVQPRQ EVKSSSNEIL DLLKGPKKPL SETTRSPNSS SKELLELLKP KKEPIAVDHT SSNELLDLLI KNKPSSESEK KPNNPLLDML HSKAPRQVSQ PNAEHNHTSA NELLGLLNKK PSIPLNDADS NFIREEKIEE TQFDNFEDFE DFEDFGVIDN QLLGKTSFRN FDIASDEEDV DHLIDDLGDP YSAPNTAVES FSNPPDFFSD PQPSLEPKKG KIRLLKPGEV LNDIFSTNRP NVSSPPVHAS NANGQNLLAL LNGKNPSNSN GAIPVSDSFQ SIYGNANPTA GLDSNTPSSL TNALSGQNSP NKNSANFLKD ILWKREP
|
| |