Gene PICST_82260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82260 
SymbolDCW1 
ID4836647 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp291974 
End bp293410 
Gene Length1437 bp 
Protein Length456 aa 
Translation table12 
GC content47% 
IMG OID640387962 
ProductDefective Cell Wall 
Protein accessionXP_001382820 
Protein GI126132590 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.208592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTCC ATCTTCTCAC AGCATCAGTG TCGCTCTTCA TGGCTTGTGC TTCGGCCATG 
TGGCTTGACA CCAATAACGA CACCACAATC GTGGAGGCCA CCAATCTTAT TGTAGACGGT
GTACTTGATT ACTATGACGG TAAAAATTAT GGAGGTGTCG TGGGGATGTT TGTGTGGCCC
TACTATTGGT GGGAAGCAGG CGGAGTATGG GGCTCGCTCA TCGACTACAC CTACTTCACA
CAGAACGATA CGCTCGTGCC ATTAATCAAG GAGGCCTTGT TGTACCAGAC AGGTGACGAT
AACAACTATG TGCCTTTGAA TCAGTCTACA ACTGAAGGTA ACGATGATCA AGGCTTTTGG
GGCATTGCTG TCATGGGAGC TGCCGAGAGA AATTTCTCCA ACCCTGATGA CGACTCCAAG
GCATGGTTGA CTTTGGCCCA AGCTGTATTC AACACCATGA CAGCCAGATG GGATAGTGCG
GAATGTAATG GTGGTTTGAG ATGGCAGATC TTCCAGTGGA ACTCAGGTTA TGACTACAAG
AACTCTGTTT CCAACGGATG CCTCTTTCAC ATCGGTGCAA GATTGGCCAG GTTCACCGCC
AATGACACCT ACGTTGACTG GGCTGAAAAG GTCTGGGACT GGATGTATGA TATAGGCTTG
CTTGATGTTG TGAACTCTAA CGACAAAGAC TACTGGTTTG TGTACGATGG TCTTACTATT
GATAACAACT GCTCGAACAT TACCAAATAC CAATGGACCT ATAACCAAGG ATTGATGTTG
TCTGGGTGTG CCTATTTGTA CAATTATACC GAGGACCAGA AATGGCTCGA CCGTACATTG
AATTTGTTGA ATGCATCGCA AGTGTTTTTC ATGAAAATTG TCAACGAAAC TGACAACATC
CAGGGTTCGG TCATGTACGA GGCAGCTTGC CAGCCTTCTA ACAACTGTAA CAATGATCAG
AGATCGTTCA AGGCGTACTT CTCGAGATTC TTAGGGATGA CTGCCGTCAT GGTACCTCAG
ACTTACGACG GTATCAGACA GTGGTTAGTA GATTCAGCCA ACGCTGCAGC TAAAAACTCA
TGCACTGGAG GTACTGACGG CCACACGTGT GGCTTGAACT GGTTTAACTC TACGGGCTGG
GACGGTTACT GGGGTCTTGG TGAACAGATC TCGGCTCTCG AAGTCATCCA GAACTTGAGG
GTACGTGATT TTCCTCCACC ATTGACCGCT AACACTGGTG GTAGTTCCAA GGGCAACCCA
GCTGCTGGTT ACTCCACCCT TCACACAATC ACTTCTCCTT TGGAGTTGGA AACCAAGGAT
CTTGCGGGTG CTGGTATCAT TACAGCCGTT GTGGGAGTTT CGTTGGTTGC CGCAGGTGTG
TGGCTTGTAA TTTAGACAGA AAACAAATAA TATGTATAAT CAATATGCTT TGAAGGT
 
Protein sequence
MRFHLLTASV SLFMACASAM WLDTNNDTTI VEATNLIVDG VLDYYDGKNY GGVVGMFVWP 
YYWWEAGGVW GSLIDYTYFT QNDTLVPLIK EALLYQTGDD NNYVPLNQST TEGNDDQGFW
GIAVMGAAER NFSNPDDDSK AWLTLAQAVF NTMTARWDSA ECNGGLRWQI FQWNSGYDYK
NSVSNGCLFH IGARLARFTA NDTYVDWAEK VWDWMYDIGL LDVVNSNDKD YWFVYDGLTI
DNNCSNITKY QWTYNQGLML SGCAYLYNYT EDQKWLDRTL NLLNASQVFF MKIGSVMYEA
ACQPSNNCNN DQRSFKAYFS RFLGMTAVMV PQTYDGIRQW LVDSANAAAK NSCTGGTDGH
TCGLNWFNST GWDGYWGLGE QISALEVIQN LRVRDFPPPL TANTGGSSKG NPAAGYSTLH
TITSPLELET KDLAGAGIIT AVVGVSLVAA GVWLVI