Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_44719 |
Symbol | |
ID | 4838692 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1534621 |
End bp | 1537806 |
Gene Length | 3186 bp |
Protein Length | 895 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390007 |
Product | predicted protein |
Protein accession | XP_001384250 |
Protein GI | 150865150 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5096] Vesicle coat complex, various subunits |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00567496 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCC AATTGCAGAA CTCCGAGGTC TTGGCGCGGT TGAAGCCCTT CGGTATTCTG TTTGAGAAGT CGCTCTCCGA CTTGATCAAA GGGATTAGAC ATCAATCCAA GGAGTCTCCA GAGTCTTTAC TGAACTTTCT AGATGTCGTG ATCCAGGAGT GCAAAACCGA GCTTTCAACG ACGGATTTGG AGACAAAGGC TACGGCAGTG TTGAAGTTGG CATATTTGGA GATGTATGGC TTTGATATGG CTTGGTGCAA CTTCCAGATC TTGGAAGTGA TGTCTTCAGG CAAGTTCCAG CAGAAGAGAA TCGGATATTT GGCTGCGATC CAGCTGTTCA AAAACGAACA GGACTTGTTA ATCCTTGCTA CCAATCAGTT CAAAAAGGAC TTGAACTCGC ATAATCACAC CGAGATAGGT TTGGCACTTA GTGGCATTGC TACCATTGTT ACTCCCAATT TGGCGAGAGA CATCAACGAC GACGTGTTGA TGAAATTGAG CCATTCGAAA CCGTATATTC GTAAAAAGGC TATCTTGGCC ATGTACAAGA TCTTCTTACA ATATCCTGAA AGTTTGCGAG TTAATTTTAA TCGCGTTATC GCCATGTTGG ACGACGCAGA CATTTCCGTG GTTAGTGCTA CTGTCAATGT AATCTGTGAA ATTTCCAAGA AAAACCCGCA TATATTCATG ACAAGTTTGC CCAAATTCTT CTCCATCTTG GAGGACACCA AGAATAACTG GTTAATCATC AGAATATTGA AGTTGTTCCA GAGTTTGTCG CGTGTAGAAC CTCGTATGAA GAAGAAGATT CTTCCGACGA TCTTGGACTT GATCCTCAGA ACCCAAGCAT CGTCCTTGAT CTACGAGTGT ATCAACTGTA TCGTTAACGG CAACATGTTG AGTGCAGACT CTTCAAAGGA TAAGGAAACG GCAAAAATCT GCATTAAACA AATTATGGAG TTCTTCAAGA CAAAGGACTC CAACCTAAAA TTCGTGGGCT TAATTGCATT AATTAGCATC TTGAAGATAT TCCCCGTGTT TATGCACAAA GTTGATGGTG TTTCAACTAT CATAATGGAC TGTCTCACGG ATCCAGATCT TATCATAAAG AGAAAAGCAT TGGAAATCTG CCATTACTTG GTTCAAGAAG ATAATATAGC CGAAGTAGTA AAGGTCTTGT TGTTGCAGTT GATTCCAAGT GATACGAACG CTATTCCAGA GGCTTTAAAG CAGGAAGTCA CTTTGAAAAT CTTGTCAATA ACATCGAACG ACAAGTATGC GAATGTGCCC AACTTCAAAT GGTATGTGGC AGTATTGAAG GATATCATCA ATTTGACTTT ACTTCCGCTT CCTTCTTCTT CCAATGCTAG CACGATCTCT CCAGCAACAG CAAACGTCAT AGCTGCAGAA ATCGGTAAAG AATTCAAAGA GTTAGCCACC AAGGTGCCTT CTATTAGACC CACAATTCTC AACAAAGTGA TTGTGGAAGC TGTTCAGGAT GTAAGAATCT TGGACGTGTG TCCTTCATTG CTTAGGGACT TCTACTGGAT TATGGGAGAG TATATAGACG AGTTGAGATC TCCATCCGAA GAAGAAAGTG ACGTTGAAGA CGAGGATGAT ATTGAGGAAT CTTCTGTTTT GGACCTTGGC AAGAAGATCC AGATTTTCAA CGCGTTGGTA AACCACGATA TAGACAAGGT ACTTGGTTTA TCTGTAAATA CCCATTTTCC AATTTCATCC AAGTTGATTA CCTTATCTGA TTCTAATGTC CAAGTAGTGT TTATCCAGGC AATTGTTAAG TTGTACAATG GCATTGTGAC CGATTATTTG GTGCACTATT CAGTTCAAGG GAAATTCAAG CGGGAGCAAT TTAATCAATT GGCCCATTAT TTATACAAGT TGATCAACTT CCTTGGAAAC TGGGAGAACC ATAGGAACTA TGAAGTTCAG GAGAGAGCTT TGTCGTGGTT GGAATTCTTG AAGCTTTCAT TAGAAGCAAT GACACATGAA GATATTTCGG CTATCCAGAA ATTGGAAAAG GACGAGGTTG AGTATTACAG AAACTTACCG AGATCTGAAG AAGGTGAAGA TGAAGACGAT GAAGTTTATG ACGAAGAATC TTCGGAAGAA GAGGAAAGTG AAAATGACGA TACAGACAAC AGCATTAAGC CAGTCAGAGA CAATGAATAC GAAAACTTGA GCTCTTCATC TGAAGAGGAT AACGATGAAG ATGGAGATGA GAGTGAAGAA AATGGTAAAG ACGAAAACGT GGAATATCCT AATAACGAAG TCAATGGTGG GTTTGATGGC GTTGAACATA GTCCATTTCC TGAAACAGAT GACTTCCTTA CAGAACCTCT GAAAGAGAAT AGTTTGCCAA TGTTGCTAAC ACATATTCTT CCATCATTTT TCAAGAGTTA TCCTTTGAAC CCAATTGCAA AGAACTCACA GAAAAAGATT CCTATTCCAG AAGATTTGAA TCTTGACGAG CCAATCTACA CCATTCCATT TGATGTGTCT GCTGATGACG TTGACAGTTT TGTCAATGAT GAATATGATT TATTCATTGA AGACGAAGTT GATTTACATG CTGAAGAAGC ATCTTTGATC AGTTTGTCTA ACAGAGGCAG TGACGATGAC TTGAAGAAGA AACAGGAGAG ATTGGAGAAA TTGAGAGATG ATCCATACTA TCTTGGATCC AAGAAGTCTT CTAAGAAGAA GTCTATTAAC AGGAGAGTTC TCTTGGTCGA TGAAGACAAG ACCCCAAGCC CAGAAAACTT CAGTGAGAAG GGTTCGATTA ATTCAGGAGT TGCTCCTGTC AAGGAGAGAA AGAAGAAACC GTTGAAGATG AAGAAGGATA AAGTGGTCAT CTTGTCGGAA GAGACAATAG AAGGTGGTCC AGATGAAGAA GAAGATGAAG AAGCCACTGC TGTTAAAGCC AAGTCCAAAA AAAAGAAGAG TAACTTTATG ATTGATTCGT CGAATTTGGA TAATTTCGAT CTTACTTCTT CGGCCATGTC AGAGTCGGTC TCTGGTTTGG ACAAGGACTA CGAGTACAAC ATTGATTTGG ACGAGTTAAG AAAGAAATTG GCCCTGTCTT CGTTGAAAGA CAAGGAGAAG AAGGAAAAAA AGGAGAAAAA GAAAAAGAAA AAGAAGAGTT CCGCTTCTAA CGTTGAAAAA ATCAAA
|
Protein sequence | MSFQLQNSEV LARLKPFGIS FEKSLSDLIK GIRHQSKESP ESLSNFLDVV IQECKTELST TDLETKATAV LKLAYLEMYG FDMAWCNFQI LEVMSSGKFQ QKRIGYLAAI QSFKNEQDLL ILATNQFKKD LNSHNHTEIG LALSGIATIV TPNLARDIND DVLMKLSHSK PYIRKKAILA MYKIFLQYPE SLRVNFNRVI AMLDDADISV VSATVNVICE ISKKNPHIFM TSLPKFFSIL EDTKNNWLII RILKLFQSLS RVEPRMKKKI LPTILDLILR TQASSLIYEC INCIVNGNML SADSSKDKET AKICIKQIME FFKTKDSNLK FVGLIALISI LKIFPVFMHK VDGVSTIIMD CLTDPDLIIK RKALEICHYL VQEDNIAEVV KVLLLQLIPS DTNAIPEALK QEVTLKILSI TSNDKYANVP NFKWYVAVLK DIINLTLLPL PSSSNASTIS PATANVIAAE IGKEFKELAT KVPSIRPTIL NKVIVEAVQD VRILDVCPSL LRDFYWIMGE YIDELRSPSE EEMFIQAIVK LYNGIVTDYL VHYSVQGKFK REQFNQLAHY LYKLINFLGN WENHRNYEVQ ERALSWLEFL KLSLEAMTHE DISAIQKLEK DENSLPMLLT HILPSFFKSY PLNPIAKNSQ KKIPIPEDLN LDEPIYTIPF DVSADDVDSF VNDEYDLFIE DEVDLHAEEA SLISLSNRGS DDDLKKKQER LEKLRDDPYY LGSKKSSKKK SINRRVLLVD EDKTPSPENF SEKGSINSGV APVKERKKKP LKMKKDKVVI LSEETIEGGP DEEEDEEATA VKAKSKKKKS NFMIDSSNLD NFDLTSSAMS ESVSGLDKDY EYNIDLDELR KKLASSSLKD KEKKEKKEKK KKKKKSSASN VEKIK
|
| |