Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29105 |
Symbol | CHS3 |
ID | 4851841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2975863 |
End bp | 2979471 |
Gene Length | 3609 bp |
Protein Length | 1202 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393549 |
Product | chitin-UDP acetyl-glucosaminyl transferase 3 |
Protein accession | XP_001387135 |
Protein GI | 126275759 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.196175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000301151 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCTTCC ATAGAAAGAG CGACAGCTCG AACAGATCCA AGTACCAGGA GTTCGATCCT GAGTCTGGCG AATTGGGCAG AAAGAGATCG CTTGTCAGAC CTGAGAGATC TCGTATAGAC CCTGACCATC CCAGATACCA CTACACCCAG GTCACCAATC AGGAAGCTGG CCACTTGAAG GTGCTTCCTT CTTCAACTGG CCTAGACCCG CACCTGCAAA CAGACCACTT GAGTCCCACC AGGTCGTACC AGCCTGCCAA CTTAAACATA TATAAGACGA CTGGTCAGAG CGATGAAGAC GAGGGCATAC CCCTCATGGA TATCCACGAC TCATCGCCAG GTGGCAATGA CTTGAAGGGT GGTGTAGAAG TGTTTGGTTT GAACGATGAA ATCAACGACG AATACAACTC TCCGTCTAAG AACAAGGTAA TTTCAAAACC AATCAGAGCT CCATATGTAG AGGAAGATGA CGACAAGAAG AGTAACATCT ATTTCTGGAA GGTGTACTGC TATGTCATCA CGTTCTGGGC CCCAGCCCCG TTGTTGAAGT TGTTTGGATT GAAAACCAAG GATCGTCAAT TTGCCTGGAG AGAAAAAATC GGTTTGATCT CGTGTATTCT CTACATCGGT ACTATTGTTG CATATTTAAC CTTTGGTTTC ACCAGGACAG TTTGTTCTAA CAAGCAAATC AGAACCAGAT ACAATGAAGT CAACAGCGGT ACCTTGGTCA TCAACGGTAG AGCCTTTGAT TTATCGAATT CGCAACATCC AAAAGCCGCA GGTATTTCAG CAGGCTCTAA TGTCCTTTAT CCACCCATTA ATGCCGGCGG TATGGATGCT TCGTTCCTTT TCCAGAATGT TAATGGTAAC TGTAAGAACT TAATTGTTCC TAAAGAGAAC TGTTCGATTC CAACCAATGA CGATAAAGAG TTAGCTTGGT ACATGCCTTG CCGTTTACTT AACCAGGATG GCTCTTCTCA GCCTAACTTC ACCACAGAAT ACTACAAAGG TTGGGCTTGT CACACAAGTT CTTCTGCTAG AGAAGCATTC TACAGTTTGG ATGTCAGCGG TGACGTCTAC TTCACCTGGG ATGACATCAG AAACAATTCC AGAAACTTGG TTGTCTACTC CGGAAATGTT CTAGATCTTG ATTTGCTCAA TTGGATTCAA ACAGACGACG TCGATTATCC AGATTTGTTC AACAAGTTGC GTGAAGACTC TACTTTCCGT GGCCATGATA TATCTCTTGT CTTAACGAAC CCGGAAGAAA GACAGGCCGC AAGATGTTTG AGTGAAGTCA TCAAGGTTGG TACTATCGAT TCCGATACCA TTGGCTGTAT TGCGTCCAGT ATCGTTTTGG TGGTTTCGTT GATTTTCATT TTGTCTGTCG TAGTTGTCAA GTTCATTATG GCTTGTTATT TCAGGTGGGT AATATCCAGA AAGCAAGGTG CTACTGAAAT CGACAACAAG TCTATGGCTC AAAGAGAGAA GGAAATCGAA AACTGGGTGG ACAACCCAGA CACCGCTATC GGTAATACAA TCAAAACTGT TCCTATTAAA GCAAGAGCTA ACTACAAGAG TGCTAAGACT AACAGACAAT CAGTCTTTTT CAAGAACAGC AACAGATTAT CGTTGGCTCC CACTGCTGAA TTGGCTCAAT ACTACGATAA TTCTGATAAG TTGTCCAAAT CTTTCAAGTA CACTACCATG ACTACGCAGG CTGCTTTGTT GGGACTGTCT TCAAAGAAAA ATGGATTGAA GACTGCCGGT ACAAGACAGT CTACTCTCTA CTTGTCGGAG CATGGTTCTT CTACAGACTT GTTGTCCAGA CCAGTATCTG CTTACAACCC ATTTGAAACC TTTGAAGATG CTTATCCAAT GAAGACTTTA TCTCCCGATC TCATTCACCC TGATGTTGTT CCTCAGCCTC CAGTAGAGTA CCAACCATTT GGATATCCTT TGGCCCATGC CATAGTTTTC GTAACGTGTT ACTCTGAAGA TGAGGATGGT ATCCGTACCA CTTTGGACTC TATTGCAACT ACCGACTACC CCAACTCTCA CAAGGTGATA GTGGTTGTCT GTGATGGTAT AATTAAGGGT TCTGGAAATG ACAGAACAAC TCCTGATATA TTGTTGGACA TGATGTCTGA ATTTGCTATT CCAAAGGATG AAGTTCAGCC ATACTCTTAC GTAGCAGTTG CCCAAGGTAG TAAGCGTCAT AACATGGCCA GGATCTATTC CGGTTTCTAT AAGTATAACG ACGATACTGT CCCAGTTGAA AAGCAACAAA GAGTTCCTAT AGTTACAATT GTTAAGTGTG GAACCCCAGA TGAAGCAGGC TCTCCAAAGC CTGGTAACAG GGGTAAGCGT GATTCCCAAA TCATTTTGAT GTCTTTCTTG CAGAAAATAA TGTTTGATGA AAGAATGACT GCCTTGGAAT ACCAAATATT GATGAGCATC TGGAGAATAA CAGGGTTGAT GTCTGAATTG TATGAAGTAG TGTTGATGGT TGACGCTGAT ACCAGGGTTT TCCCAGACAG TTTAACACAT ATGTGTGCTG AAATGATCAA GGATCCTTCT ATTATGGGAT TGTGTGGGGA AACTAAGATT GCTAACAAGA AACAATCTTG GGTTACGGCA ATTCAAGTCT TTGAGTACTA TATTTCTCAT CACCAAGCCA AGGCTTTTGA ATCTGTGTTT GGTGGTGTCA CATGTTTGCC TGGTTGTTTC TCGATGTACA GAATCAAGAC TCCAAAGGGT TCTGACGGTT ATTGGGTTCC GATTTTGGCT AATCCTGATA TTGTTGAAAG ATATGCTGAT AACGTTGTAG ATACATTGCA TAGGAAGAAC TTGTTGTTGT TAGGTGAGGA TCGTTTCCTT ACCTCTTTGA TGTTGAGAAC TTTCCCCAAG AGAAAACAAG TCTTCGTTCC AAAAGCAGCG TGTAAAACTG TTGTTCCTGA CAAATTTAAA GTTTTGCTTT CTCAGCGTCG TCGTTGGATT AACTCTACTG TTCATAACTT GTTGGAATTG GTTTTGGTTA AGGACTTGTG TGGTACCTTC TGTTTCTCGA TGCAGTTTGT TATTTTCATT GAATTGATTG GTACATTAGT TTTGCCAGCA GCTATCTCCT TCACAGTTTA TGTCATTGTC GTTGCTATTA TTTCGAAGCC TACTCCAATC TTGTCGTTGG TTCTTTTGGC CATTATCTTT GGGTTGCCGG GTTGTTTAAT TGTCATTACG GTGTCTTCTC TTTCATACAT TATCTATTTC TTCATCTACC TTTTTGCATT GCCAATCTGG AATTTTGTCT TGCCAACCTA CGCTTATTGG AAATTTGATG ATTTCAGTTG GGGTGAAACA AGATCAGTTG CGGGAGGCGA TAAGGGAGAC CACGGTAGCT CTACGGGTAA GTTCGACTCT TCCATGATCA CAATGAAACG TTGGAAGGAA TTTGAAAGAG ACAGAAGAAA CAAAGAATCG GTTGGTCACA CCAGCAACAT ATTGCCATTG CCAGGTGCCA CATGGGATCC ATCCAATTCG GAGAAGTTGT TGGATGAAAC CTACTCTGAA GGGTCAGGAT CGGGTTCCTT GCCTGGAGTA CCCCTCTAG
|
Protein sequence | MSFHRKSDSS NRSKYQEFDP ESGELGRKRS LVRPERSRID PDHPRYHYTQ VTNQEAGHLK VLPSSTGLDP HLQTDHLSPT RSYQPANLNI YKTTGQSDED EGIPLMDIHD SSPGGNDLKG GVEVFGLNDE INDEYNSPSK NKVISKPIRA PYVEEDDDKK SNIYFWKVYC YVITFWAPAP LLKLFGLKTK DRQFAWREKI GLISCILYIG TIVAYLTFGF TRTVCSNKQI RTRYNEVNSG TLVINGRAFD LSNSQHPKAA GISAGSNVLY PPINAGGMDA SFLFQNVNGN CKNLIVPKEN CSIPTNDDKE LAWYMPCRLL NQDGSSQPNF TTEYYKGWAC HTSSSAREAF YSLDVSGDVY FTWDDIRNNS RNLVVYSGNV LDLDLLNWIQ TDDVDYPDLF NKLREDSTFR GHDISLVLTN PEERQAARCL SEVIKVGTID SDTIGCIASS IVLVVSLIFI LSVVVVKFIM ACYFRWVISR KQGATEIDNK SMAQREKEIE NWVDNPDTAI GNTIKTVPIK ARANYKSAKT NRQSVFFKNS NRLSLAPTAE LAQYYDNSDK LSKSFKYTTM TTQAALLGLS SKKNGLKTAG TRQSTLYLSE HGSSTDLLSR PVSAYNPFET FEDAYPMKTL SPDLIHPDVV PQPPVEYQPF GYPLAHAIVF VTCYSEDEDG IRTTLDSIAT TDYPNSHKVI VVVCDGIIKG SGNDRTTPDI LLDMMSEFAI PKDEVQPYSY VAVAQGSKRH NMARIYSGFY KYNDDTVPVE KQQRVPIVTI VKCGTPDEAG SPKPGNRGKR DSQIILMSFL QKIMFDERMT ALEYQILMSI WRITGLMSEL YEVVLMVDAD TRVFPDSLTH MCAEMIKDPS IMGLCGETKI ANKKQSWVTA IQVFEYYISH HQAKAFESVF GGVTCLPGCF SMYRIKTPKG SDGYWVPILA NPDIVERYAD NVVDTLHRKN LLLLGEDRFL TSLMLRTFPK RKQVFVPKAA CKTVVPDKFK VLLSQRRRWI NSTVHNLLEL VLVKDLCGTF CFSMQFVIFI ELIGTLVLPA AISFTVYVIV VAIISKPTPI LSLVLLAIIF GLPGCLIVIT VSSLSYIIYF FIYLFALPIW NFVLPTYAYW KFDDFSWGET RSVAGGDKGD HGSSTGKFDS SMITMKRWKE FERDRRNKES VGHTSNILPL PGATWDPSNS EKLLDETYSE GSGSGSLPGV PL
|
| |