Gene PICST_72552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_72552 
SymbolPST1 
ID4839588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp944927 
End bp946231 
Gene Length1305 bp 
Protein Length417 aa 
Translation table12 
GC content45% 
IMG OID640390903 
Producthypothetical protoplast-secreted beta-1,6-N-acetylglucosaminyltransferase, contains WSC domain (ECM33) 
Protein accessionXP_001385189 
Protein GI126137331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.134169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTGA AGAGCCTCTT CACAGTCTTA GCTGCTGGTG CCGTAGCCCA CGCTGCCACC 
TCCACTTCTA AAGATCCATG TTCGTTATCA ACCACCATTA CGGCCGTGGG TGAACTTGAA
ACCTTGAATG CCTGTTCTAC CTTGGACGGT TCCATCACCA TCACTGGTCA AGAAATCATC
AACGCCGACT TGAGTGGTGT CAGAGAAATC AAGGGTGACC TCAAGTTCTT CAACTCCACT
TCCATCGTCT CTCTTAACTT GAACCAGTTA CAGAACATCA CCGAAGGTGG TTCTCTTTCT
GTTGTGTCAT TGACCACTCT TGCTTCCATT GACTTCACTT CATTGACCAA TGTCGACCAA
GTTCTCTTGA CTTCGTTGCC ATCTCTCGGA AACCTTGTAA TGGGTTTTGG TGTCGTTCAC
GCTGGCCACA TTGAAATTTC CGACACCGCC ATTAACTCGT TGAGTCGTTT CGTCAGTTTC
CTTAACACCG TGCGTCACTT GGAATTGAAC TCGAACAAGA ACATCACTTC CATCGACTTA
ACCAACTTGA ACACTGTCAC TGAAAACTTG ATCTTGCGTT TCAACGGTGA CGACTGTGAT
GTCAAGTTGG ACACTTTGGC TTGGGCTTCC AATATTACCA TTCAAGATGT CGGTGACATC
GAAATTTCTA ACATCACCGC TATCAACGGT TCTCTTGTTC TTGCCTACAA CACCTTTGAC
TCGTTCAACC TTGACTCGTT GAAGACCATC GGCGGTTCCA TCGAAATCTT CGCCAACGAC
GAATTGACTG AATTCTCGTT CCACGACTTG GAAACCATTG GTGGTGAACT TAGTCTTAGC
AACAACACCA ACTTGGAAAA CGTCACTGAT TCATTCCCCA ACTTGAACAG AATCAAGGGT
GCTGTAAACA TTGACGGTGG TTTCGCAAAC TTCTCTACTC CAAAGTTGGC AAGGGTTAAC
GGTGACTTCA GCTTCAACTC TACTAACGAA GACTTCAGCT GTGACTTCTT CAATAAATTG
CGTGACAACA AGGACATTGA AGGTCACAAC TACGAATGTA CTGCTCCAAA GAAGTCATCG
TCCTCTACCG CTAAGTCCAA GTCCACCAGT ACTTCGGAAA GTTCTTCCGA CTCAACCAGC
GATGATTCTG GCTCTTCTTC TACCACGTCC AAGAAGTCTG ACGGTTATAT CTTGGTTCCA
GCTTCGATGG CCTTGACCAC CATCATCGGC TCGTTCTTAG CCTTCATTCT TTAGATACTT
TCGCATGTTA CATAACTACA AAACAATAAT GTATAACTAC AAGGG
 
Protein sequence
MQLKSLFTVL AAGAVAHAAT STSKDPCSLS TTITAVGELE TLNACSTLDG SITITGQEII 
NADLSGVREI KGDLKFFNST SIVSLNLNQL QNITEGGSLS VVSLTTLASI DFTSLTNVDQ
VLLTSLPSLG NLVMGFGVVH AGHIEISDTA INSLSRFVSF LNTVRHLELN SNKNITSIDL
TNLNTVTENL ILRFNGDDCD VKLDTLAWAS NITIQDVGDI EISNITAING SLVLAYNTFD
SFNLDSLKTI GGSIEIFAND ELTEFSFHDL ETIGGELSLS NNTNLENVTD SFPNLNRIKG
AVNIDGGFAN FSTPKLARVN GDFSFNSTNE DFSCDFFNKL RDNKDIEGHN YECTAPKKSS
SSTAKSKSTS TSESSSDSTS DDSGSSSTTS KKSDGYILVP ASMALTTIIG SFLAFIL