Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66069 |
Symbol | ASH1 |
ID | 4840489 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 532885 |
End bp | 534913 |
Gene Length | 2029 bp |
Protein Length | 466 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640391804 |
Product | GATA-type transcription factor |
Protein accession | XP_001386121 |
Protein GI | 150866495 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.651427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTAG TGCAACTGCC AGCGATGTTC AACAATCACC ACCATAACGC CTCGTACCAA CAGGGCCACC TCCGGTCCCG CTCCTATAAC GAGATGTTGA CGCTGTTCAA CGCGTCCAGT GCGTCTTCTG CTTCCTCTCA TTCTGCTCCT GGTTCCACAC CAAGACCTCC TTTCTCCACT CCCGCCTACA TCAAGAAGAG ATCGCATTCC GATCTTGTCA AACAGCTGAC GGAAGGGCTT GCTTTTTATG ACCAGAAACG GTCTAGAACA GCCCCTCCTT CTCCCCCCTA CGAAATCAAC AATAAGTTCG GTCCTGGATC CGCACTCTCT GTATCCAGTT CCTCAACTGA TAAGCTCATC ACCAGTAATG TTATCAGACC CAACCCCAAC CTCCGCTCAC GTTCCTTGCT GCCCTCCAAG AAGTTCAGTA ACCCACTCTC GCCTTCCAAT TCACCCAACT CCTCGCCAGC ATCATCTCCT TCCAAGCCTG CAACTGGAGC TGCACCTTCT ACTCCAGACA CAAGCTTGGC TGCAAAAGCG CAACCGCAAA CGCAGACACA AGTGCAAACT CGATTGCAAT TGCGCGACGA AGACGAACAT ATCCACAAAC GCATTAGGCT TCCCAGCATT TCTGCAGCAT TGCAATCGAC CAAGTCCTCC TCGATTCGAT TGAAACCCGT CATCACTCCT CCCACTGTGT CTCTCGATTA CTTTGACACC TACAAACCCA ACGACGAAAA CTGGAGATAC GAATTGCTCG ACACCATTAA CAAGGATTCA AAATATTTCC ACTTGAACCA ATACAACTAC TTGAATAAAT ACGCCACTTC AGCCAAATAC CAGCTGCAAC TGCAACAGCA TTCCATTAAC TCGTCTTATG GTTACAAACC CAACTTCGAC TCAAGGATCA GCTCTAAGAT CGCTAACCAA CGTCCATCTC TTCCCAGTGT CAGCTCTATC TGCCACGAAA AAAAAATCAA CTTCCCTTTC GAATCCAACT ACACTTACTT GAACAAAACC TACATGAACG ACGTCGAAAA GTATCCCGAA TACTTGGAAT TGGCCCAGTC TTTGATCCAG TTGTCGCAGC CACGTAGGCG AAGTGATTAC CACGACCAAT CTGCGGCAGT TTCGTCCACT GCTCCAATAC CGACAGCTGT TACACCTCCA ACATCGTACT CGCCAAACTC TTCTGCTGCA CCTCCTCCAA CCCGCTTGCC AATAGTCAAA CCACTGCAAT ATTCCCATCA AACCTACACG GAAGCACCTG TACCTGTAGT ATATCATCAA AGCGGAACAA TAACTCCTCC AGCATCTAGA TTGCCTAGCA TCCAAGCCAA CAAGTACAAC ACTTATCCAG GTATGATTCA GGAAAACAGC ACTGCCAATA GCCATTCGCC TGAATTGGTT ACTTTGCACC ATGCTCCTGT CCAAGTCCAA CCTCTTGCCA CCCCTGCTGC TCAGCAGAGT CACAAATTCA TCCCCATCAC CCCACCATCT TCCAAGTCCA AGTCGAGAAC TGAGTTGTTG AAGTCTCCAC CCAAACATCA TTACAACCAC CACAGCCCCA GAGTATGTAT TTCTTGTGGA TCGGATCAAT CGCCATGCTG GAGACCTTCA TGGTCGATCA AGGAAGGACA ATTGTGTAAC TCTTGTGGAC TCAGGTACAA GAAGACATCT GCCAGATGTC TCAATAACAA CTGTAAGAAG ATCCCAGCCA AAGGCGAGTG GTCGCTTATG CAAAGCAAAG GCAAGACCAT GTTTGATGAT GGCCACGACG GCTACAGCTG CTTAGAGTGT GGCTGGAGAG TCGAAATCAA GACTTAAACT AAATATCTAA TACTGTCGCG CCATCTTAAG ATTACAAGTA TGGCCAGTAC TAGACGACAC AAGCGTCCAG GGTTTCTCCT AAGGTTTATT GGGGAGCCAA CAACATGCAT TCGTCCGTTG AAGGACTAAA CCTGTACTAT GATGTATATT TAATTCTTTT AGTTTCTAGA AAACAGTTTA ATGGTAATTA ATAGTTATG
|
Protein sequence | MSLVQSPAMF NNHHHNASYQ QGHLRSRSYN EMLTSFNASH SDLVKQSTEG LAFYDQKRSR TAPPSPPYEI NNKFGPGSAL SVSSSSTDKL ITSNVIRPNP NLRSRSLSPS KKLPSISAAL QSTKSSSIRL KPVITPPTVS LDYFDTYKPN DENWRYELLD TINKDSKYFH LNQYNYLNKY ATSAKYQSQS QQHSINSISS KIANQRPSLP SVSSICHEKK INFPFESNYT YLNKTYMNDV EKYPEYLELA QSLIQLSQPP VTPPTSYSPN SSAAPPPTRL PIVKPSQYSH QTYTEAPVPV VYHQSGTITP PASRLPSIQA NNHSPELVTL HHAPVQVQPL ATPAAQQSHK FIPITPPSSK SKSRTELLKS PPKHHYNHHS PRVCISCGSD QSPCWRPSWS IKEGQLCNSC GLRYKKTSAR CLNNNCKKIP AKGEWSLMQS KGKTMFDDGH DGYSCLECGW RVEIKT
|
| |