Gene PICST_29666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29666 
SymbolRDH1 
ID4837234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp816664 
End bp817680 
Gene Length1017 bp 
Protein Length338 aa 
Translation table12 
GC content44% 
IMG OID640388549 
Productshort-chain alcohol dehydrogenase 
Protein accessionXP_001382930 
Protein GI126132810 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.62197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCG CACCACCAAA TATACCATTT CCAGGTCTTC TCGGATGGAA GGCAATGATC 
AATGGGTATT TTCCCTCAAA TCCTTCCTAC ACCGAAGAGC AATACCCTAA ACTCGACGGG
AAGGTTGTAA TTGTCACCGG TGGAAATACC GGTGTTGGTT ATCAAACTGC TAAATCTTTA
GCTGGGTCTA CCAATGCTAA GGTCTACATA TTTGCCAGAA GTGAGGAAAA GGCATTGGCT
GCAATCAGGA GAATGGAACT TGAAGTTGCT CAGGAGTACA ACAAAAGTAG TATTAATGTT
CACTTCATCA AACTTGACTT GGGTGATTTG ACCACGATCA AAGCTTCTGC TGACGAATTC
CTCTCCAAGG AAAATAGATT GGACATTGTA ATTCACAATG CTGGTGTCAT GGGCACTTCA
GTAGGTTCTA AGACAGTTCA GGGGGTTGAA TTACAGTTGG GTACAAACTG CTTTGGACCA
CACTTGTTGC AGAAGTATTT CGATCCACTT GTCATCGAAA CTTCGAAGAC CAACAAACCA
TACGAGTCTC GTATAGTGTG GGTGGCGTCT TCGGCACACT TCCAGTCTCC AGAAAGAGGA
ATTCACTACG CTGACCCAAA CTTTGTGGAC ACTCCACACT TACCAAGAGT ACTCTACTGC
CAAAGTAAAG CGGTCAACAT CATGCAAGCC ATTGCATGGC CCAAGAATCA TCCTGGTGCG
GATAGAGTGT TGTCCGTTTC CTTGTGCCCA GGCTTCCTCA ACACTGATAT TCAACGACAT
GCTAGTGGTA TTTGGAAATG GTTCATACCC TGGGTCCTTC ATGATCCCAG ATATGGTGCA
TACACCGAAT TATATGCTGC ATTGAACCCT GAGTTGAAAA ACCAAGGTGA ATACTATCAA
TCCTTCGGTA GATTAGGTGA TATCAGACCA GATATCAAGC TCGAGGAAAA TGTAGATAAA
GCTTGGGCAT ACTGTGAAGA GCAGGTGAAG GCATACTACA AAACAGTTAT AAAATAG
 
Protein sequence
MSTAPPNIPF PGLLGWKAMI NGYFPSNPSY TEEQYPKLDG KVVIVTGGNT GVGYQTAKSL 
AGSTNAKVYI FARSEEKALA AIRRMELEVA QEYNKSSINV HFIKLDLGDL TTIKASADEF
LSKENRLDIV IHNAGVMGTS VGSKTVQGVE LQLGTNCFGP HLLQKYFDPL VIETSKTNKP
YESRIVWVAS SAHFQSPERG IHYADPNFVD TPHLPRVLYC QSKAVNIMQA IAWPKNHPGA
DRVLSVSLCP GFLNTDIQRH ASGIWKWFIP WVLHDPRYGA YTELYAALNP ELKNQGEYYQ
SFGRLGDIRP DIKLEENVDK AWAYCEEQVK AYYKTVIK