Gene PICST_42812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42812 
SymbolCHS1 
ID4838262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1462885 
End bp1465878 
Gene Length2994 bp 
Protein Length997 aa 
Translation table12 
GC content39% 
IMG OID640389577 
Productchitin synthase 
Protein accessionXP_001383564 
Protein GI150864649 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.676694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCTT TCGAAGATGA GGATAACCCG TTTAGGGAGA ATCAGCCTGC CTTACCCAAT 
GCTGCCTTTA CAAGTGACAG AACTTCTCCC AAAAGATACC ACCAATTACA AGACGAAGAG
GATGCGGACG ACGATTTCAT GGCTTTTCCA CTGTCTACAC ATCAGAGACA ATCTCCAAAT
ATCTTTACCA GAAGTACTGC TAGAGTAAAT TCTAAATTAA AAGGAAATAA CTTAGGCGAT
ACTCCAAACT TGCTGTTCGC TAACAATTCT ACCCCTAGAG CTCAATTTAC TACAAGGGAA
TCTCCAAAGA GGCAGAAAAA GAGGGAAGTG CTAATTGTTG ATCATGAAGA TGATGAAAAT
TATGGATCCC CTACGAGGTC ACAGTTCGGC GGGAGCTCTA TTGCCAGTGA AAGGTTTTTA
GAACCACCGC AACCGATTTT CTCAAGAGAA ACATTCGCTG AAGCCAACAA CGCAGAAGAC
GAAAACTCAA CAATAACAGA TGAAAAGGAT AATTATGATT ATAATTCCTA TCAGAAAGCA
CATGAACTCG AAGAGAGGTC ATTGTATTCA GAGTCTACTG CTTATAGTGG GTCGTCATAT
TTATCACAAC CAACACACGC AGAGAGTGAC TATTTTGGAG CATCAATTGA TGGCAATATC
ATGAGTAATA TCAACAATGG CTATGTTCCT AAAAGAGAGA AGACATTAAC CAAGAGAAAA
GTTAGATTGG TCGGTGGTAA GACAGGCAAT CTAGTGTTGG AAAATCCAGT TCCTGAAGAA
CTTCGAAAGG TGCTAACACG AACTGAATCG CCGTTTGGAG AATTCACAAA CATGACTTAT
ACGGCTTGTA CGTCTGATCC TGATAATTTC ATCAGCGACG GATTCAGTCT CAGGGCTGCC
AAATTCGAAA GAGAAACCGA GATTGTCATC TGTATCACTA TGTACAACGA AGATGAGCAT
GCATTTGCCA GAACCATGCA TGGGGTTATG AAAAATGTTG CCCATTTATG TTCGAGGCAT
AAGTCCAAAC TATGGGGCAA GGAAGCATGG AAGAAAATTC AAGTCATTAT TGTAGCAGAC
GGAAGAAACA AAGTTAACGA ATCTGTTCTC CAATTATTGA CAGCCACAGG ATGTTATCAA
GACAATTTGG CTAGACCATT TGTAAACAAC AAGAAGGTAA ATGCACATTT ATTTGAATAT
ACCACACAAA TATCAATCGA TGAAAACTTG AAATTCAAGG GAGATGAAAA GAACTTGGCA
CCAGTTCAAG TATTATTCTG CCTTAAAGAA CAGAATCAGA AGAAAATTAA TTCTCATAGA
TGGTTGTTCA ATGCCTTCTG TCCAATTTTG GATCCAAACG TTGTGGTTTT GCTTGATGTG
GGAACTAAAC CAGATAACCA CGCTATCTAT AATCTATGGA AAGCTTTTGA TAGGGATTCC
AACGTTGCTG GCGCTGCTGG TGAAATCAAA GCAATGAAAG GAAAGGGGTG GATCAACTTG
ACGAATCCAT TAGTAGCTTC TCAAAATTTT GAGTACAAAA TGTCCAACAT ATTGGATAAA
CCTCTTGAAT CATTGTTTGG TTACATCTCT GTGTTACCTG GGGCATTGTC TGCATACAGA
TATAAAGCAT TGACAAACCA TGAAGACGGT ACCGGTCCTT TGGCAGCATA TTTCAAGGGA
GAAGACCTTT TAAATGACCA TCATTCGGAC AAGCAGAACA GTAAAACAAA TTTCTTTGAG
GCAAATATGT ATCTTGCAGA AGACAGAATT TTATGTTGGG AGTTAGTCGC AAAAAGAAAG
GAGAATTGGG TTTTAAAATT TGTCAAGCTG GCTACTGGTG AAACAGATGT TCCCGAAAAT
CTACCTGAAT TTATATCACA AAGAAGGCGT TGGATCAATG GTGCTTTCTT TGCAGCGCTT
TATGCATTAC GACACAGCAA TAGAATTTGG GCGACCGATC ATTCCTTCGC TCGAAAATTT
TGGTTCCAGA TTGAATTCGC TTATCAATTC GTTACACTTG TCTTTTCATT CTTTTCATTG
AGTAACTTCT ATTTGACGTT TTACTTCTTG ACTGGTTCAT TAGTGTCAAA CAAGAACTTT
GGTCATAATG GTGGATTCTG GATTTTTACA CTCTTCAACT ATTTGTGTAT TTGTATTTTG
ACTTCGTTGT TTATTGTATC GATAGGAAAC AGACCACAAG CTTCCAAGAA TATTTTCAAA
ACTTTGATTA TCCTTTTGAC CATTTGTGCC TTATACGCAT TGATAGTCGG GTTTTATTTT
GTTTTCAACA CGATCACAGA ATTTGGAATG GGAGATTCGT CCACCTATGT CCTTGTGAGC
ATTGTTGTTT CTTTGTTAGC AACGTATGGC CTATATTTCT TGATGTCTTT CCTATACATG
GATCCTTGGC ACATGTTTAC ATGTTCTATC CAATACTTCT TGATGATACC TTCATACACC
TGCACATTGC AGATATTTGC TTTCTGTAAC ACCCATGATG TTTCCTGGGG TACAAAAGGT
GACAATAACC CCAAGATTGA CAAGAGTAAC CAATACATTA TTGAGAAAAA TGACAAAGGA
GAATTTGAAG CTGTCGTAGT TGATGTGAAT ATAGACGAAG TTTACTTGGA GACTTTGTAC
AACATCAGAG CTAAAAGATC AAATAGAAAG GTTGCTCATG TACATAAGGA AAAGGCTCCT
ATGGAAGGAG AAGACTATGC AAAGGATGTT CGTACCAGGG TTGTTTTAAT TTGGATGATA
GCAAACCTAG TATTCATCAT GACAATGCTT CAAGTTTATT CTCCTGGCGA GACTACAAAG
AATATTTACC TTGCCTTCAT CTTATGGACT GTCGCTCTAT TGTCACTTTT CAGAGCAATT
GGCTCCTTGG GATACCTTTT ACAAAACCTT GCAAGATTTG TCGTTGAGAG CAGAAGCAAA
TGGTTACACA GGAGAGAAGG TTATGCTGTA CCCTCTCATA ATCCTTTGAA TTGA
 
Protein sequence
MNPFEDEDNP FRENQPALPN AAFTSDRTSP KRYHQLQDEE DADDDFMAFP SSTHQRQSPN 
IFTRSTARVN SKLKGNNLGD TPNLSFANNS TPRAQFTTRE SPKRQKKREV LIVDHEDDEN
YGSPTRSQFG GSSIASERFL EPPQPIFSRE TFAEANNAED ENSTITDEKD NYDYNSYQKA
HELEERSLYS ESTAYSGSSY LSQPTHAESD YFGASIDGNI MSNINNGYVP KREKTLTKRK
VRLVGGKTGN LVLENPVPEE LRKVLTRTES PFGEFTNMTY TACTSDPDNF ISDGFSLRAA
KFERETEIVI CITMYNEDEH AFARTMHGVM KNVAHLCSRH KSKLWGKEAW KKIQVIIVAD
GRNKVNESVL QLLTATGCYQ DNLARPFVNN KKVNAHLFEY TTQISIDENL KFKGDEKNLA
PVQVLFCLKE QNQKKINSHR WLFNAFCPIL DPNVVVLLDV GTKPDNHAIY NLWKAFDRDS
NVAGAAGEIK AMKGKGWINL TNPLVASQNF EYKMSNILDK PLESLFGYIS VLPGALSAYR
YKALTNHEDG TGPLAAYFKG EDLLNDHHSD KQNSKTNFFE ANMYLAEDRI LCWELVAKRK
ENWVLKFVKS ATGETDVPEN LPEFISQRRR WINGAFFAAL YALRHSNRIW ATDHSFARKF
WFQIEFAYQF VTLVFSFFSL SNFYLTFYFL TGSLVSNKNF GHNGGFWIFT LFNYLCICIL
TSLFIVSIGN RPQASKNIFK TLIILLTICA LYALIVGFYF VFNTITEFGM GDSSTYVLVS
IVVSLLATYG LYFLMSFLYM DPWHMFTCSI QYFLMIPSYT CTLQIFAFCN THDVSWGTKG
DNNPKIDKSN QYIIEKNDKG EFEAVVVDVN IDEVYLETLY NIRAKRSNRK VAHVHKEKAP
MEGEDYAKDV RTRVVLIWMI ANLVFIMTML QVYSPGETTK NIYLAFILWT VALLSLFRAI
GSLGYLLQNL ARFVVESRSK WLHRREGYAV PSHNPLN