Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_42812 |
Symbol | CHS1 |
ID | 4838262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1462885 |
End bp | 1465878 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389577 |
Product | chitin synthase |
Protein accession | XP_001383564 |
Protein GI | 150864649 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.676694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCTT TCGAAGATGA GGATAACCCG TTTAGGGAGA ATCAGCCTGC CTTACCCAAT GCTGCCTTTA CAAGTGACAG AACTTCTCCC AAAAGATACC ACCAATTACA AGACGAAGAG GATGCGGACG ACGATTTCAT GGCTTTTCCA CTGTCTACAC ATCAGAGACA ATCTCCAAAT ATCTTTACCA GAAGTACTGC TAGAGTAAAT TCTAAATTAA AAGGAAATAA CTTAGGCGAT ACTCCAAACT TGCTGTTCGC TAACAATTCT ACCCCTAGAG CTCAATTTAC TACAAGGGAA TCTCCAAAGA GGCAGAAAAA GAGGGAAGTG CTAATTGTTG ATCATGAAGA TGATGAAAAT TATGGATCCC CTACGAGGTC ACAGTTCGGC GGGAGCTCTA TTGCCAGTGA AAGGTTTTTA GAACCACCGC AACCGATTTT CTCAAGAGAA ACATTCGCTG AAGCCAACAA CGCAGAAGAC GAAAACTCAA CAATAACAGA TGAAAAGGAT AATTATGATT ATAATTCCTA TCAGAAAGCA CATGAACTCG AAGAGAGGTC ATTGTATTCA GAGTCTACTG CTTATAGTGG GTCGTCATAT TTATCACAAC CAACACACGC AGAGAGTGAC TATTTTGGAG CATCAATTGA TGGCAATATC ATGAGTAATA TCAACAATGG CTATGTTCCT AAAAGAGAGA AGACATTAAC CAAGAGAAAA GTTAGATTGG TCGGTGGTAA GACAGGCAAT CTAGTGTTGG AAAATCCAGT TCCTGAAGAA CTTCGAAAGG TGCTAACACG AACTGAATCG CCGTTTGGAG AATTCACAAA CATGACTTAT ACGGCTTGTA CGTCTGATCC TGATAATTTC ATCAGCGACG GATTCAGTCT CAGGGCTGCC AAATTCGAAA GAGAAACCGA GATTGTCATC TGTATCACTA TGTACAACGA AGATGAGCAT GCATTTGCCA GAACCATGCA TGGGGTTATG AAAAATGTTG CCCATTTATG TTCGAGGCAT AAGTCCAAAC TATGGGGCAA GGAAGCATGG AAGAAAATTC AAGTCATTAT TGTAGCAGAC GGAAGAAACA AAGTTAACGA ATCTGTTCTC CAATTATTGA CAGCCACAGG ATGTTATCAA GACAATTTGG CTAGACCATT TGTAAACAAC AAGAAGGTAA ATGCACATTT ATTTGAATAT ACCACACAAA TATCAATCGA TGAAAACTTG AAATTCAAGG GAGATGAAAA GAACTTGGCA CCAGTTCAAG TATTATTCTG CCTTAAAGAA CAGAATCAGA AGAAAATTAA TTCTCATAGA TGGTTGTTCA ATGCCTTCTG TCCAATTTTG GATCCAAACG TTGTGGTTTT GCTTGATGTG GGAACTAAAC CAGATAACCA CGCTATCTAT AATCTATGGA AAGCTTTTGA TAGGGATTCC AACGTTGCTG GCGCTGCTGG TGAAATCAAA GCAATGAAAG GAAAGGGGTG GATCAACTTG ACGAATCCAT TAGTAGCTTC TCAAAATTTT GAGTACAAAA TGTCCAACAT ATTGGATAAA CCTCTTGAAT CATTGTTTGG TTACATCTCT GTGTTACCTG GGGCATTGTC TGCATACAGA TATAAAGCAT TGACAAACCA TGAAGACGGT ACCGGTCCTT TGGCAGCATA TTTCAAGGGA GAAGACCTTT TAAATGACCA TCATTCGGAC AAGCAGAACA GTAAAACAAA TTTCTTTGAG GCAAATATGT ATCTTGCAGA AGACAGAATT TTATGTTGGG AGTTAGTCGC AAAAAGAAAG GAGAATTGGG TTTTAAAATT TGTCAAGCTG GCTACTGGTG AAACAGATGT TCCCGAAAAT CTACCTGAAT TTATATCACA AAGAAGGCGT TGGATCAATG GTGCTTTCTT TGCAGCGCTT TATGCATTAC GACACAGCAA TAGAATTTGG GCGACCGATC ATTCCTTCGC TCGAAAATTT TGGTTCCAGA TTGAATTCGC TTATCAATTC GTTACACTTG TCTTTTCATT CTTTTCATTG AGTAACTTCT ATTTGACGTT TTACTTCTTG ACTGGTTCAT TAGTGTCAAA CAAGAACTTT GGTCATAATG GTGGATTCTG GATTTTTACA CTCTTCAACT ATTTGTGTAT TTGTATTTTG ACTTCGTTGT TTATTGTATC GATAGGAAAC AGACCACAAG CTTCCAAGAA TATTTTCAAA ACTTTGATTA TCCTTTTGAC CATTTGTGCC TTATACGCAT TGATAGTCGG GTTTTATTTT GTTTTCAACA CGATCACAGA ATTTGGAATG GGAGATTCGT CCACCTATGT CCTTGTGAGC ATTGTTGTTT CTTTGTTAGC AACGTATGGC CTATATTTCT TGATGTCTTT CCTATACATG GATCCTTGGC ACATGTTTAC ATGTTCTATC CAATACTTCT TGATGATACC TTCATACACC TGCACATTGC AGATATTTGC TTTCTGTAAC ACCCATGATG TTTCCTGGGG TACAAAAGGT GACAATAACC CCAAGATTGA CAAGAGTAAC CAATACATTA TTGAGAAAAA TGACAAAGGA GAATTTGAAG CTGTCGTAGT TGATGTGAAT ATAGACGAAG TTTACTTGGA GACTTTGTAC AACATCAGAG CTAAAAGATC AAATAGAAAG GTTGCTCATG TACATAAGGA AAAGGCTCCT ATGGAAGGAG AAGACTATGC AAAGGATGTT CGTACCAGGG TTGTTTTAAT TTGGATGATA GCAAACCTAG TATTCATCAT GACAATGCTT CAAGTTTATT CTCCTGGCGA GACTACAAAG AATATTTACC TTGCCTTCAT CTTATGGACT GTCGCTCTAT TGTCACTTTT CAGAGCAATT GGCTCCTTGG GATACCTTTT ACAAAACCTT GCAAGATTTG TCGTTGAGAG CAGAAGCAAA TGGTTACACA GGAGAGAAGG TTATGCTGTA CCCTCTCATA ATCCTTTGAA TTGA
|
Protein sequence | MNPFEDEDNP FRENQPALPN AAFTSDRTSP KRYHQLQDEE DADDDFMAFP SSTHQRQSPN IFTRSTARVN SKLKGNNLGD TPNLSFANNS TPRAQFTTRE SPKRQKKREV LIVDHEDDEN YGSPTRSQFG GSSIASERFL EPPQPIFSRE TFAEANNAED ENSTITDEKD NYDYNSYQKA HELEERSLYS ESTAYSGSSY LSQPTHAESD YFGASIDGNI MSNINNGYVP KREKTLTKRK VRLVGGKTGN LVLENPVPEE LRKVLTRTES PFGEFTNMTY TACTSDPDNF ISDGFSLRAA KFERETEIVI CITMYNEDEH AFARTMHGVM KNVAHLCSRH KSKLWGKEAW KKIQVIIVAD GRNKVNESVL QLLTATGCYQ DNLARPFVNN KKVNAHLFEY TTQISIDENL KFKGDEKNLA PVQVLFCLKE QNQKKINSHR WLFNAFCPIL DPNVVVLLDV GTKPDNHAIY NLWKAFDRDS NVAGAAGEIK AMKGKGWINL TNPLVASQNF EYKMSNILDK PLESLFGYIS VLPGALSAYR YKALTNHEDG TGPLAAYFKG EDLLNDHHSD KQNSKTNFFE ANMYLAEDRI LCWELVAKRK ENWVLKFVKS ATGETDVPEN LPEFISQRRR WINGAFFAAL YALRHSNRIW ATDHSFARKF WFQIEFAYQF VTLVFSFFSL SNFYLTFYFL TGSLVSNKNF GHNGGFWIFT LFNYLCICIL TSLFIVSIGN RPQASKNIFK TLIILLTICA LYALIVGFYF VFNTITEFGM GDSSTYVLVS IVVSLLATYG LYFLMSFLYM DPWHMFTCSI QYFLMIPSYT CTLQIFAFCN THDVSWGTKG DNNPKIDKSN QYIIEKNDKG EFEAVVVDVN IDEVYLETLY NIRAKRSNRK VAHVHKEKAP MEGEDYAKDV RTRVVLIWMI ANLVFIMTML QVYSPGETTK NIYLAFILWT VALLSLFRAI GSLGYLLQNL ARFVVESRSK WLHRREGYAV PSHNPLN
|
| |