Gene PICST_81469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81469 
SymbolRDH2 
ID4837133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp819796 
End bp820800 
Gene Length1005 bp 
Protein Length334 aa 
Translation table12 
GC content44% 
IMG OID640388448 
Productshort-chain alcohol dehydrogenase retinol dehydrogenase Protochlorophyllide reductase 
Protein accessionXP_001382931 
Protein GI126132812 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.761909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.570556 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCACG AAAAGCCCAA AGTTCCTTGT ATGACCTGGG GCGCATTCAT GCAAATCACC 
GATAACTTCT GGCCAGGTCC ACCTACTCTT ACAGAAAAAG ACTATCCCTC TTTGACTGGG
AAGGTTGTAA TTGTCACCGG TGGAAATACC GGTGTTGGTT ATCAAACTGC TAAATCTTTA
GCTGGGTCTA CCAATGCTAA GGTCTACATA TTTGCCAGAA GTGAGGAAAA GGCATTGGCT
GCAATCAGGA GAATGGAACT TGAGGTCGCC CAGGAGTACA ATAAAAACAA AATTGATGTT
CACTTCATCA AGCTTGACTT GGGTGATTTG ACCACAATCA AAGCCTCTGC CGACGAATTC
CTCTCCAAGG AAGATAGATT GGATATTATC ATCCATAATG CTGGTGTCAT GACCCCACCA
AAGGGTTCCA AGACAGCACA AGGTTTTGAA TTACAACTAG GCACGAATGC CATAGGACCA
CATTTGTTTC AGAAATTCTT GGACCCATTA TTCATTAAGA CGTCTAAGTC CAACAAGCCT
GGAGAATCCA GAGTTGTATG GGTTGCATCT TCCGGACACT TCTTTTCTCC CGAAGGAGGA
ATCTTCTATC CAGATCCCAA TTTCAGAAAC ACCAACTTCC CATCCATGCG GATTTACGGA
CAAAGCAAGG CTTGTAATGT CATGCAATCA GTTGAATGGC CCAAACACCA TCCAGAAGCA
ACTAACGTTA TCAGTCTCAA TTTATGCCCC GGCGCCTTGA AGACAGATTT ACAAAGACAC
ACAGGCACCG CGGGCCGCAT CATGTCCGGC TTGTTACATG ATGCTAGAAA AGGTGCTTAC
ACTGAACTCT TTGCAGCCTT ATCTCCCTCC ATCACAGTCA AGGACCAAGG CATTCATGTT
ATTTCCTTTG GAAAGATTGG CTTCAACAGA AAGGATCTTA AAGATCCAGC TAATACTTCT
AAGGCTTGGG ACTTCTTGGA CAAACAAGTT GAAAAGTATT TGTAA
 
Protein sequence
MSHEKPKVPC MTWGAFMQIT DNFWPGPPTL TEKDYPSLTG KVVIVTGGNT GVGYQTAKSL 
AGSTNAKVYI FARSEEKALA AIRRMELEVA QEYNKNKIDV HFIKLDLGDL TTIKASADEF
LSKEDRLDII IHNAGVMTPP KGSKTAQGFE LQLGTNAIGP HLFQKFLDPL FIKTSKSNKP
GESRVVWVAS SGHFFSPEGG IFYPDPNFRN TNFPSMRIYG QSKACNVMQS VEWPKHHPEA
TNVISLNLCP GALKTDLQRH TGTAGRIMSG LLHDARKGAY TELFAALSPS ITVKDQGIHV
ISFGKIGFNR KDLKDPANTS KAWDFLDKQV EKYL