Gene PICST_80053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80053 
SymbolYIM4 
ID4851311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1470488 
End bp1471406 
Gene Length919 bp 
Protein Length269 aa 
Translation table 
GC content43% 
IMG OID640393019 
ProductNADP(+)-dependent dehydrogenase acts on serine, L-allo-threonine, and other 3-hydroxy acids 
Protein accessionXP_001387516 
Protein GI126274328 
COG category[R] General function prediction only 
COG ID[COG4221] Short-chain alcohol dehydrogenase of unknown specificity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTTAGTTGT TCATTAAGGG ACGCAATTAG TACAGAATTT CTTTGCACCG TCATGTCGTT 
TGGAAAAAAA GCTGCTGAAA GACTTGCCAA CAAAATCATT CTTATCACCG GGGCTTCGTC
TGGTATTGGT GAAGCTACAG CTAGAGAGTT TGCATCTGCT GCCAATGGGA ATATCAGATT
GATTTTGACA GCCAGAAGAA AAGAAAAGTT GGCTCAATTG TCAGACTCAT TGACCAAGGA
ATTTCCAACT ATCAAAATCC ATTCTGCCAA ATTGGATGTG ACCGAACATG ATGGCATCAA
GCCTTTCATT TCTGGTTTAC CCAAGGATTT CGCCGACATC GATGTGTTGA TCAACAATGC
TGGAAAAGCT CTTGGAAAAG CATCTGTTGG TGAAATCAGT GACAGTGATA TCCAAGGCAT
GATGCAAACG AATGTCTTGG GACTCATCAA CATGACTCAG GCTGTGATTC CCATTTTTAA
GGCTAAAAAT TCTGGAGATA TCGTCAACAT CGGTTCGATT GCTGGAAGAG ACCCTTACCC
TGGTGGATCG ATCTACTGTG CCTCCAAGGC TGCTGTTAAG TTCTTCTCGC ATTCTTTGAG
AAAGGAACTC ATTAACACCA GAATCAGAGT TTTGGAAGTT GATCCAGGTG CTGTGTTGAC
CGAGTTCTCT TTGGTTCGTT TCCACGGTGA TCAGGGAGCT GCTGATGCTG TTTATGAAGG
TACCCAACCT TTGGATGCCT CTGATATCGC AGAAGTTATC GTGTTTGGTA TCACCAGAAA
GCAGAACACC GTCATAGCCG AAACCTTGGT ATTCCCAAGT CACCAGGCTT CTGCCTCTCA
TGTTTACAAG GCTCCTAAGT AGACAATTTC ATATTAGTAT TTTTGATATA TTTATATCTA
AAGAATGGAC CGTATCTAG
 
Protein sequence
MSFGKKAAER LANKIILITG ASSGIGEATA REFASAANGN IRLILTARRK EKLAQLSDSL 
TKEFPTIKIH SAKLDVTEHD GIKPFISGLP KDFADIDVLI NNAGKALGKA SVGEISDSDI
QGMMQTNVLG LINMTQAVIP IFKAKNSGDI VNIGSIAGRD PYPGGSIYCA SKAAVKFFSH
SLRKELINTR IRVLEVDPGA VLTEFSLVRF HGDQGAADAV YEGTQPLDAS DIAEVIVFGI
TRKQNTVIAE TLVFPSHQAS ASHVYKAPK