Gene PICST_67502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67502 
SymbolWSC1 
ID4838873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp463921 
End bp465594 
Gene Length1674 bp 
Protein Length423 aa 
Translation table12 
GC content47% 
IMG OID640390188 
Productbeta-1,6-N-acetylglucosaminyltransferase, contains WSC domain 
Protein accessionXP_001384384 
Protein GI150865245 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.794835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCAACCATA GCATAGGACG AGTCCCCAGA TCTTGCTCCG TTACCTGTAA ATCAGACATC 
ACCACTGTTG AAAAAGAATC TGATCTGCTC ATCGAACCAG AATCAGAGTT TGACAGAACA
GTGCCAAACC ACCAAAACCC ACCCAAGTTT ATCGCAACAA AAGCTGAATA AGTCACAGCA
GCTCCCAGTG TGACCCATAA CTCCAGATAA GTGCATAGTA CTTGTCGTTT GTATTTTTTT
CATTATTCGC ATCGTTTCAT TTCCTCACCT TCCACTCCCA CTTCGCGGCC CCTTAATATC
CACGCCCTCT AAGCATCACA AGCTGCACCA TCTTGTAACG TCATAAATTC ACAACAATTG
CATTTACAAC CATGTTCAAA ACACTAATAC CACTAGTGCT AGCCATGGCC CTAGCCGCCA
AACCTGCCAA TGCCGACTCA TGGTCCAGCG ACGGTTGCTA CGCCCAGAAG TCGTCGCTAG
GCTTCTCTCT TCTGGACCAA AACATCTACC AGTCGTCGGG CCACTGTGAG CTACAATGTG
AAGGCAAAAG AGTTGTTGCT CTCTTGAGCG GTAAATATTG CTACTGCGGA GACACAGCTC
CGGACTCCTC AAATCAGGTT TCTTCTTCCA ATTGTAATGT TCCTTGTCCT GGCTACCCTG
AAGAAGAATG TGGGGGAGAT AACTATTTCT TGGTCTACGT AAATGCCGAT GTCGAAGATG
CTACAACGTC AACCAACACC AAGACGACGA CGACGACGTC TGTGACGTCT TCAACATCCA
AATCTACTTC GGTTTCTACC TCAACTTCTA CCAAAGTATC ATCTTCTACA GTAGAGCCAT
CTTCATCTTC TTCGCAAGAG GAAGAAACCT CTTCCTCTGC ACAAGAAACG TCCACAGATT
CACCAGAAAC TTCAAGTTCA ACTTCAGAAT CGTCTTCGCC CAAACACAAT GCTGTCACCA
CTATTGTGCT GACACTCACT ACAAACCCAG CAGGCACCAG TCCCTCTATC ATTTACAAAA
CCATCGTCAA CACCCCTTCA TCTACAGCCA GTGGCCCTAC CAACACATCT GATGTAGACG
AGACTGAAGA TAAAACGTCA TCTTCCAACA AGTCGTTGTC AGCCGGAGGT ATAGCTGGAG
CAGTTGTAGG ATCCATTGCT GGCATTGGGC TCATTGCGGG GCTTATCTTC GCCTTTATGT
GGTGGAGACG TAAGAGGAGC GACGATGAAG ACTACGATGA CGAGTTTACG CTTTCTGGAC
CTGAAAAAAC AGGATTCCCA GCACCTCTGC CACCTCCCCA GTTAACAGCC AATCCGTTCT
TGATAGCTGG AGGCTACAAC TTCGACGTGG ACCAGAACGG TCATGCCAAC GGTAGTCCAA
ATTCAACGAG CATGGGGCAC AGCCGCGAAG CATCGCAAGC TGCTGCTGGA TTCGGCCATG
GATACCATAA TCTGAACGAA GGACATAGTG GAGGAGAGCA TTCGTTCAAT AGCGATAGTA
ACAATAACAA CACTAACGAC TTCACATTCT TGGATCCACC GGCAGACTCT CCACCAGAGC
TCGGAAGACG GAGATTGAGT GTGGGATCGC TTCCAGACAT CATAGCCCGT CAACCAGGAT
CTTTGAAGGT TGTTAATAAC TAATTAATAT ATACACTGCT GGAAGCAGTT AAGG
 
Protein sequence
MFKTLIPLVL AMALAAKPAN ADSWSSDGCY AQKSSLGFSL SDQNIYQSSG HCELQCEGKR 
VVALLSGKYC YCGDTAPDSS NQVSSSNCNV PCPGYPEEEC GGDNYFLVYV NADVEDATTS
TNTKTTTTTS VTSSTSKSTS VSTSTSTKVS SSTVEPSSSS SQEEETSSSA QETSTDSPET
SSSTSESSSP KHNAVTTIVS TLTTNPAGTS PSIIYKTIVN TPSSTASGPT NTSDVDETED
KTSSSNKSLS AGGIAGAVVG SIAGIGLIAG LIFAFMWWRR KRSDDEDYDD EFTLSGPEKT
GFPAPSPPPQ LTANPFLIAG GYNFDVDQNG HANGSPNSTS MGHSREASQA AAGFGHGYHN
SNEGHSGGEH SFNSDSNNNN TNDFTFLDPP ADSPPELGRR RLSVGSLPDI IARQPGSLKV
VNN