Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_72552 |
Symbol | PST1 |
ID | 4839588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 944927 |
End bp | 946231 |
Gene Length | 1305 bp |
Protein Length | 417 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390903 |
Product | hypothetical protoplast-secreted beta-1,6-N-acetylglucosaminyltransferase, contains WSC domain (ECM33) |
Protein accession | XP_001385189 |
Protein GI | 126137331 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.134169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTGA AGAGCCTCTT CACAGTCTTA GCTGCTGGTG CCGTAGCCCA CGCTGCCACC TCCACTTCTA AAGATCCATG TTCGTTATCA ACCACCATTA CGGCCGTGGG TGAACTTGAA ACCTTGAATG CCTGTTCTAC CTTGGACGGT TCCATCACCA TCACTGGTCA AGAAATCATC AACGCCGACT TGAGTGGTGT CAGAGAAATC AAGGGTGACC TCAAGTTCTT CAACTCCACT TCCATCGTCT CTCTTAACTT GAACCAGTTA CAGAACATCA CCGAAGGTGG TTCTCTTTCT GTTGTGTCAT TGACCACTCT TGCTTCCATT GACTTCACTT CATTGACCAA TGTCGACCAA GTTCTCTTGA CTTCGTTGCC ATCTCTCGGA AACCTTGTAA TGGGTTTTGG TGTCGTTCAC GCTGGCCACA TTGAAATTTC CGACACCGCC ATTAACTCGT TGAGTCGTTT CGTCAGTTTC CTTAACACCG TGCGTCACTT GGAATTGAAC TCGAACAAGA ACATCACTTC CATCGACTTA ACCAACTTGA ACACTGTCAC TGAAAACTTG ATCTTGCGTT TCAACGGTGA CGACTGTGAT GTCAAGTTGG ACACTTTGGC TTGGGCTTCC AATATTACCA TTCAAGATGT CGGTGACATC GAAATTTCTA ACATCACCGC TATCAACGGT TCTCTTGTTC TTGCCTACAA CACCTTTGAC TCGTTCAACC TTGACTCGTT GAAGACCATC GGCGGTTCCA TCGAAATCTT CGCCAACGAC GAATTGACTG AATTCTCGTT CCACGACTTG GAAACCATTG GTGGTGAACT TAGTCTTAGC AACAACACCA ACTTGGAAAA CGTCACTGAT TCATTCCCCA ACTTGAACAG AATCAAGGGT GCTGTAAACA TTGACGGTGG TTTCGCAAAC TTCTCTACTC CAAAGTTGGC AAGGGTTAAC GGTGACTTCA GCTTCAACTC TACTAACGAA GACTTCAGCT GTGACTTCTT CAATAAATTG CGTGACAACA AGGACATTGA AGGTCACAAC TACGAATGTA CTGCTCCAAA GAAGTCATCG TCCTCTACCG CTAAGTCCAA GTCCACCAGT ACTTCGGAAA GTTCTTCCGA CTCAACCAGC GATGATTCTG GCTCTTCTTC TACCACGTCC AAGAAGTCTG ACGGTTATAT CTTGGTTCCA GCTTCGATGG CCTTGACCAC CATCATCGGC TCGTTCTTAG CCTTCATTCT TTAGATACTT TCGCATGTTA CATAACTACA AAACAATAAT GTATAACTAC AAGGG
|
Protein sequence | MQLKSLFTVL AAGAVAHAAT STSKDPCSLS TTITAVGELE TLNACSTLDG SITITGQEII NADLSGVREI KGDLKFFNST SIVSLNLNQL QNITEGGSLS VVSLTTLASI DFTSLTNVDQ VLLTSLPSLG NLVMGFGVVH AGHIEISDTA INSLSRFVSF LNTVRHLELN SNKNITSIDL TNLNTVTENL ILRFNGDDCD VKLDTLAWAS NITIQDVGDI EISNITAING SLVLAYNTFD SFNLDSLKTI GGSIEIFAND ELTEFSFHDL ETIGGELSLS NNTNLENVTD SFPNLNRIKG AVNIDGGFAN FSTPKLARVN GDFSFNSTNE DFSCDFFNKL RDNKDIEGHN YECTAPKKSS SSTAKSKSTS TSESSSDSTS DDSGSSSTTS KKSDGYILVP ASMALTTIIG SFLAFIL
|
| |