Gene PICST_79198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_79198 
SymbolYBO9 
ID4839816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1154099 
End bp1155272 
Gene Length1174 bp 
Protein Length346 aa 
Translation table12 
GC content44% 
IMG OID640391131 
Productbeta-hydroxysteroid dehydrogenase type 3 
Protein accessionXP_001385917 
Protein GI126138788 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.103283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATTGAACAA TGTCTGTAGT AGATTTTATT CAAGCCATTA CCGAAAACAA ATTCGGCGAG 
TATGTTCTCC TTGGAGCATT GCTCGTTGGA GTTTTCAAGC TTACTGTGTT CATTCTCAGT
GTGACTTCGC TTTTGGTTGA TTTGTTCGTC TTGCCAGCCA CAAACTTGAA GACTTACGGT
GCCAAAAAAG GCAAGTGGGC TGTTATAACT GGTGCCTCTG ATGGAATTGG AAAGGAGTAT
GCCTTCCAAT TGGCCTCCAA AGGATTCAAT GTAGTTTTGG TATCGAGAAC CCAAGCCAAG
TTGGAAACTC TTGCTTCTGA GATCGAAGCC AAGTACAAGG TGGAAACCAA AGTAGTAGCA
TTTGATGCTT CTACGGACGC TGAAGACAAC TACAAGTCTC TAGGTGATGC TATTTCCGGT
TTGCCTGTAA CTGTTTTGAT CAACAATGTT GGCCAATCGC ATTCGATTCC CGTTCCATTC
TTGGAAACTG AAAACAAGGA ATTGCAAGAT ATTATCACAA TCAACGTCAC AGCCACTTTG
AAGATCACCC AAACTGTAGC TCCAGTGATT GCCGAAACTG TTTCCAAGGA AAAGAAGAAG
GTCAGAGGTT TGATATTGAC TATGGGCTCT TTTGGTGGTT TGTTACCCAC TCCATATTTG
GCTACTTACT CTGGTTCCAA GTCGTTTTTG CAAGCTTGGT CTGCTGCTTT GGCTGGAGAG
TTGCAATCTC AAGGTGTGGA TGTGGAATTG GTTATTTCGT ACTTGGTCAC TTCTGCCATG
TCGAAGATCA GAAGAGCCTC TTTGTCGATT CCTAGCCCTA AGAACTTTGT CAGAGCCACT
TTAAACGGCA TTGGACGTCG CAACGGTGCG CAGGAACGTT ATGCAACTAG CACTCCTTAC
TGGGCCCATG CCTTGATGCA TTTCGGCATT GACCAGACTG TAGGTGTCTA CTCCAAGCTT
GCCAACAGTC TTAACTTGAA CATGCACAAG AGCATCCGTG CCAGAGCCTT GAAGAAGGCT
GCCCGTTTGG CTGCGGAAAA GAAAGATTAG ATGGAATTAT CCACTAGTAT AATTTCTATG
TAGATCCTAC TATCTAGTTG ACTGCAGTGA TTATATACTG AAAGAGCAAA CGTACAATTG
TGCAAATATA TGGATACCAT TGAGTATACT TTGG
 
Protein sequence
MSVVDFIQAI TENKFGEYVL LGALLVGVFK LTVFILSVTS LLVDLFVLPA TNLKTYGAKK 
GKWAVITGAS DGIGKEYAFQ LASKGFNVVL VSRTQAKLET LASEIEAKYK VETKVVAFDA
STDAEDNYKS LGDAISGLPV TVLINNVGQS HSIPVPFLET ENKELQDIIT INVTATLKIT
QTVAPVIAET VSKEKKKVRG LILTMGSFGG LLPTPYLATY SGSKSFLQAW SAALAGELQS
QGVDVELVIS YLVTSAMSKI RRASLSIPSP KNFVRATLNG IGRRNGAQER YATSTPYWAH
ALMHFGIDQT VGVYSKLANS LNLNMHKSIR ARALKKAARL AAEKKD