Gene PICST_89614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89614 
SymbolXYL1 
ID4839234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp898459 
End bp899484 
Gene Length1026 bp 
Protein Length318 aa 
Translation table12 
GC content45% 
IMG OID640390549 
ProductNAD(P)H-dependent D-xylose reductase (XR) 
Protein accessionXP_001385181 
Protein GI126137315 
COG category[R] General function prediction only 
COG ID[COG0656] Aldo/keto reductases, related to diketogulonate reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TACAACTATA CTACAATGCC TTCTATTAAG TTGAACTCTG GTTACGACAT GCCAGCCGTC 
GGTTTCGGCT GTTGGAAAGT CGACGTCGAC ACCTGTTCTG AACAGATCTA CCGTGCTATC
AAGACCGGTT ACAGATTGTT CGACGGTGCC GAAGATTACG CCAACGAAAA GTTAGTTGGT
GCCGGTGTCA AGAAGGCCAT TGACGAAGGT ATCGTCAAGC GTGAAGACTT GTTCCTTACC
TCCAAGTTGT GGAACAACTA CCACCACCCA GACAACGTCG AAAAGGCCTT GAACAGAACC
CTTTCTGACT TGCAAGTTGA CTACGTTGAC TTGTTCTTGA TCCACTTCCC AGTCACCTTC
AAGTTCGTTC CATTAGAAGA AAAGTACCCA CCAGGATTCT ACTGTGGTAA GGGTGACAAC
TTCGACTACG AAGATGTTCC AATTTTAGAG ACCTGGAAGG CTCTTGAAAA GTTGGTCAAG
GCCGGTAAGA TCAGATCTAT CGGTGTTTCT AACTTCCCAG GTGCTTTGCT CTTGGACTTG
TTGAGAGGTG CTACCATCAA GCCATCTGTC TTGCAAGTTG AACACCACCC ATACTTGCAA
CAACCAAGAT TGATCGAATT CGCTCAATCC CGTGGTATTG CTGTCACCGC TTACTCTTCG
TTCGGTCCTC AATCTTTCGT TGAATTGAAC CAAGGTAGAG CTTTGAACAC TTCTCCATTG
TTCGAGAACG AAACTATCAA GGCTATCGCT GCTAAGCACG GTAAGTCTCC AGCTCAAGTC
TTGTTGAGAT GGTCTTCCCA AAGAGGCATT GCCATCATTC CAAAGTCCAA CACTGTCCCA
AGATTGTTGG AAAACAAGGA CGTCAACAGC TTCGACTTGG ACGAACAAGA TTTCGCTGAC
ATTGCCAAGT TGGACATCAA CTTGAGATTC AACGACCCAT GGGACTGGGA CAAGATTCCT
ATCTTCGTCT AAGAAGGTTG CTTTATAGAG AGGAAATAAA ACCTAATATA CATTGATTGT
ACATTT
 
Protein sequence
MPSIKLNSGY DMPAVGFGCW KVDVDTCSEQ IYRAIKTGYR LFDGAEDYAN EKLVGAGVKK 
AIDEGIVKRE DLFLTSKLWN NYHHPDNVEK ALNRTLSDLQ VDYVDLFLIH FPVTFKFVPL
EEKYPPGFYC GKGDNFDYED VPILETWKAL EKLVKAGKIR SIGVSNFPGA LLLDLLRGAT
IKPSVLQVEH HPYLQQPRLI EFAQSRGIAV TAYSSFGPQS FVELNQGRAL NTSPLFENET
IKAIAAKHGK SPAQVLLRWS SQRGIAIIPK SNTVPRLLEN KDVNSFDLDE QDFADIAKLD
INLRFNDPWD WDKIPIFV