Gene PICST_89701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89701 
SymbolSOR1 
ID4839191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp30570 
End bp31958 
Gene Length1389 bp 
Protein Length385 aa 
Translation table12 
GC content43% 
IMG OID640390506 
Productpolyol dehydrogenase 
Protein accessionXP_001385027 
Protein GI150865700 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACTAAACGTA CAAAGAAATT TTCCAAAGAT TCATATATTT AGACTACCGA ATCGCCGATA 
CTAGCCTACC ATCCATAGAT AAAATCCTAT TTTGGTTCCT AAATAAGACG AATAAACTAC
AATTACAAAC TATCTTTACT TTCCATCTAA CTCACCTATT TAAACACAAT GAAGGCCATA
GTTTACCATG ACAGGGAAGA TGTCCGCTAT CACCTGGACT TCCCTGAGCC GCAGATTGTC
AGGCCTGACG ATGTCAAGAT CAAGGTCCAT TACTGTGGGA TCTGTGGTTC TGACTTGAAG
GAGTATCTCG ATGGACCGAT TTTTTTCTCC AAGAAGGGTA CCAACAACGA AGTTTCCAAC
TTGCCATATC CGCAATGCAT GGGTCACGAG ATCAGTGGTG AAGTTTACGA GGTCGGATCT
GAAGTTGACA ACCTTCAGAT TGGAGACAAA GTGGTGGTGG AAGTCACAGG TACCTGTTTT
GACAGATACC GGTTCCCCGA ATCGCCCAAC TTCAACAAAC CCAAGTGTGG AAGTTGTCTC
GAAGGTCACT ATAATGCCTG TGCATATCTT GGGTTGACTG GCTTAGGTTT TACCAACGGA
GGGTGTTCTG AATATTTGGT TACAGCTGCT AGCAAAGCCA TCAAGTTTCA AGAAGATATC
ATTCCGATGG ATGTGGCTGC TGTAATCCAG CCAATTGCTG TCAGTTGGCA CGCGGTTCGC
GTGTCTCACT TCCAAGAAGG TCAAAGTGCT TTGATCTTGG GAGGTGGCCC CATTGGGTTG
ACTACCATTT TTGCATTGAA GGGAAACAAA GCTGGCAAGA TAGTCGTTTC TGAGCCTGCT
TTGGCCAGAC GACAATTGGC AGAAAAGTTG GGCGTCACAG TCTTTGATCC TACCGGCAAA
TCTGTTGATG AATGTGTAGA GGAGTTGAGG AAGTTGTCAC CCAACGGCTA TGGCTTCAAT
CATTCATATG ATTGTTCCGG TGTGCCAGCT ACTTTCCAAA CCAGTCTCAG GGCATTGAAT
ATTAGAGGAA CGGCTACTAA TGTTGCCGTG TGGGCTCACA AATCCGTTCC ATACTTTCCT
ATGGAAGCCA CCTGGGCCGA AAAGATCATC ACCGGATCAA TTTGCTTTGT CAAGGACGAT
TTTATAGATG TCGTCAATGC TCTTCACGAA GGCACTATTC CAGTTGACGA AGTCAAGTTA
TTGATCACTT CCAAGATTCA TCTTGAGGAT GGAGTAGAGA AGGGCTTTTT AGAATTGATT
CACCACAAGG AAAAGCATAT AAAGATCTTG TTTTCTCCTA AGGAAGAATA TAGAGTAAAG
AAGTAATAGA CAAGTGTACT ATATAAGCAT CACTATTTAC TAGAATACAT AGAGCAACGC
ATGATTACT
 
Protein sequence
MKAIVYHDRE DVRYHSDFPE PQIVRPDDVK IKVHYCGICG SDLKEYLDGP IFFSKKGTNN 
EVSNLPYPQC MGHEISGEVY EVGSEVDNLQ IGDKVVVEVT GTCFDRYRFP ESPNFNKPKC
GSCLEGHYNA CAYLGLTGLG FTNGGCSEYL VTAASKAIKF QEDIIPMDVA AVIQPIAVSW
HAVRVSHFQE GQSALILGGG PIGLTTIFAL KGNKAGKIVV SEPALARRQL AEKLGVTVFD
PTGKSVDECV EELRKLSPNG YGFNHSYDCS GVPATFQTSL RALNIRGTAT NVAVWAHKSV
PYFPMEATWA EKIITGSICF VKDDFIDVVN ALHEGTIPVD EVKLLITSKI HLEDGVEKGF
LELIHHKEKH IKILFSPKEE YRVKK