Gene PICST_36326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36326 
SymbolSOR4 
ID4839583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp910736 
End bp911848 
Gene Length1113 bp 
Protein Length370 aa 
Translation table12 
GC content45% 
IMG OID640390898 
Productsorbitol dehydrogenase 
Protein accessionXP_001385184 
Protein GI150865814 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.218842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATTC CAGAAACAAT GAAGGCAATT GTGTTCCATG GGCCAGGGGA CATCCGCACA 
GAGACCAGGC CAACTCCAGT TATCGAAGAG CCATCGGATG TGATCTTAAA GGTGAAGTTC
TCTGGATTAT GCGGTACTGA TTTGCACAGC TACAGAGGCC ATATCAAGGG TCCAATTGGA
ACTATTATTG GCCATGAGTT TGTAGGAGAA GTTGTAGCTC GTGGAGCAGA CATATCCGAT
TCGGTCTTTA CTATTGGCGA AGATGTTCTT AGCACATTCA CGATTCAATG TGGACAATGC
TGGTATTGTA AGCATGGATA CTCCGGTCAA TGCGATGTAA CCAATACATT TGGTAAGGTT
GGATTAAATG GTGGACAATC TGAGTATGTA AGGGTTCCGT TTGCCAAATC TACATTGGTA
AAAAAGCCAC AGAACAACGA CGGCATTGAT GATTCTGTGT ATGTACTCAT GGCTGATATC
TTTATCACTG GGTATTACGG TGTTAAAAAG ATCCGCGATT TCTTGAGCAC AAAGCCAGCA
GTGGGTATTG AAGCGCAGGA GTTTAAGGAC GTGACCATCT TGCAATTGGG GGTAGGTCCC
GTTGGATTGT GTGCTTTGAG AGTGTTGAAG CACTTCGGAT TTACGAAAGT GGTGTGCGTG
GACAGTGTTC CTTCGAGATT GGAAGAGGCA AAGAGGCTAG GAGCATACAA GGCGATCAAT
TTCGAGACTG ACAAGACTCT GCTTGAAGAA TTTATCAGAA ACGAGACTGG TAATGTGGGA
TTTGATGCAG TATTGGAAGT AGTGGGAGCA TCTTCAGCTG TGAAGACAGC CTACGACTCT
GTTCGTCGTA ACGGGTTCAT TTCCTCGTTG GGAATGGGTC ACGAGCCTCT TCCGTTCAAC
GGCCTCGATT GCTACTTAAA GAACATCAAT ATCTCCTTTG GAAGATGTCA CTGCTGGTCT
TTGTTCCCTG AAGCCTTAGA AGTGTTCGAA TCTATGAAGG CAGACTTTGC TAGTTTTATA
GACTACACAA CTGGACTCGA TGGGGCTAAA GAGGCATTTG AGTTGTTTGA CAAGCATAAG
GTGAACAAGG TCGTGTTTGA CTTGACGAAG TAG
 
Protein sequence
MSIPETMKAI VFHGPGDIRT ETRPTPVIEE PSDVILKVKF SGLCGTDLHS YRGHIKGPIG 
TIIGHEFVGE VVARGADISD SVFTIGEDVL STFTIQCGQC WYCKHGYSGQ CDVTNTFGKV
GLNGGQSEYV RVPFAKSTLV KKPQNNDGID DSVYVLMADI FITGYYGVKK IRDFLSTKPA
VGIEAQEFKD VTILQLGVGP VGLCALRVLK HFGFTKVVCV DSVPSRLEEA KRLGAYKAIN
FETDKTSLEE FIRNETGNVG FDAVLEVVGA SSAVKTAYDS VRRNGFISSL GMGHEPLPFN
GLDCYLKNIN ISFGRCHCWS LFPEALEVFE SMKADFASFI DYTTGLDGAK EAFELFDKHK
VNKVVFDLTK