Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67502 |
Symbol | WSC1 |
ID | 4838873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 463921 |
End bp | 465594 |
Gene Length | 1674 bp |
Protein Length | 423 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640390188 |
Product | beta-1,6-N-acetylglucosaminyltransferase, contains WSC domain |
Protein accession | XP_001384384 |
Protein GI | 150865245 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.794835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCAACCATA GCATAGGACG AGTCCCCAGA TCTTGCTCCG TTACCTGTAA ATCAGACATC ACCACTGTTG AAAAAGAATC TGATCTGCTC ATCGAACCAG AATCAGAGTT TGACAGAACA GTGCCAAACC ACCAAAACCC ACCCAAGTTT ATCGCAACAA AAGCTGAATA AGTCACAGCA GCTCCCAGTG TGACCCATAA CTCCAGATAA GTGCATAGTA CTTGTCGTTT GTATTTTTTT CATTATTCGC ATCGTTTCAT TTCCTCACCT TCCACTCCCA CTTCGCGGCC CCTTAATATC CACGCCCTCT AAGCATCACA AGCTGCACCA TCTTGTAACG TCATAAATTC ACAACAATTG CATTTACAAC CATGTTCAAA ACACTAATAC CACTAGTGCT AGCCATGGCC CTAGCCGCCA AACCTGCCAA TGCCGACTCA TGGTCCAGCG ACGGTTGCTA CGCCCAGAAG TCGTCGCTAG GCTTCTCTCT TCTGGACCAA AACATCTACC AGTCGTCGGG CCACTGTGAG CTACAATGTG AAGGCAAAAG AGTTGTTGCT CTCTTGAGCG GTAAATATTG CTACTGCGGA GACACAGCTC CGGACTCCTC AAATCAGGTT TCTTCTTCCA ATTGTAATGT TCCTTGTCCT GGCTACCCTG AAGAAGAATG TGGGGGAGAT AACTATTTCT TGGTCTACGT AAATGCCGAT GTCGAAGATG CTACAACGTC AACCAACACC AAGACGACGA CGACGACGTC TGTGACGTCT TCAACATCCA AATCTACTTC GGTTTCTACC TCAACTTCTA CCAAAGTATC ATCTTCTACA GTAGAGCCAT CTTCATCTTC TTCGCAAGAG GAAGAAACCT CTTCCTCTGC ACAAGAAACG TCCACAGATT CACCAGAAAC TTCAAGTTCA ACTTCAGAAT CGTCTTCGCC CAAACACAAT GCTGTCACCA CTATTGTGCT GACACTCACT ACAAACCCAG CAGGCACCAG TCCCTCTATC ATTTACAAAA CCATCGTCAA CACCCCTTCA TCTACAGCCA GTGGCCCTAC CAACACATCT GATGTAGACG AGACTGAAGA TAAAACGTCA TCTTCCAACA AGTCGTTGTC AGCCGGAGGT ATAGCTGGAG CAGTTGTAGG ATCCATTGCT GGCATTGGGC TCATTGCGGG GCTTATCTTC GCCTTTATGT GGTGGAGACG TAAGAGGAGC GACGATGAAG ACTACGATGA CGAGTTTACG CTTTCTGGAC CTGAAAAAAC AGGATTCCCA GCACCTCTGC CACCTCCCCA GTTAACAGCC AATCCGTTCT TGATAGCTGG AGGCTACAAC TTCGACGTGG ACCAGAACGG TCATGCCAAC GGTAGTCCAA ATTCAACGAG CATGGGGCAC AGCCGCGAAG CATCGCAAGC TGCTGCTGGA TTCGGCCATG GATACCATAA TCTGAACGAA GGACATAGTG GAGGAGAGCA TTCGTTCAAT AGCGATAGTA ACAATAACAA CACTAACGAC TTCACATTCT TGGATCCACC GGCAGACTCT CCACCAGAGC TCGGAAGACG GAGATTGAGT GTGGGATCGC TTCCAGACAT CATAGCCCGT CAACCAGGAT CTTTGAAGGT TGTTAATAAC TAATTAATAT ATACACTGCT GGAAGCAGTT AAGG
|
Protein sequence | MFKTLIPLVL AMALAAKPAN ADSWSSDGCY AQKSSLGFSL SDQNIYQSSG HCELQCEGKR VVALLSGKYC YCGDTAPDSS NQVSSSNCNV PCPGYPEEEC GGDNYFLVYV NADVEDATTS TNTKTTTTTS VTSSTSKSTS VSTSTSTKVS SSTVEPSSSS SQEEETSSSA QETSTDSPET SSSTSESSSP KHNAVTTIVS TLTTNPAGTS PSIIYKTIVN TPSSTASGPT NTSDVDETED KTSSSNKSLS AGGIAGAVVG SIAGIGLIAG LIFAFMWWRR KRSDDEDYDD EFTLSGPEKT GFPAPSPPPQ LTANPFLIAG GYNFDVDQNG HANGSPNSTS MGHSREASQA AAGFGHGYHN SNEGHSGGEH SFNSDSNNNN TNDFTFLDPP ADSPPELGRR RLSVGSLPDI IARQPGSLKV VNN
|
| |