Gene PICST_38255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_38255 
SymbolSOU1 
ID4850856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp241361 
End bp242209 
Gene Length849 bp 
Protein Length282 aa 
Translation table 
GC content48% 
IMG OID640392564 
Productperoxisomal 2,4- dienoyl-CoA reductase, and sorbitol utilization protein 
Protein accessionXP_001387287 
Protein GI126273732 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACA ACCCGAGCAT CACCTCTCAT ATCAACGCTG CCGTGGGTCC TCTCCCTACT 
AAGGCTCCCA AACTTGCGTC TAACGTGTTG GATTTATTTT CTCTCAAGGG TAAGGTAGCT
TCCATTACTG GCTCTTCAGC CGGGATAGGT CTTGCTGTAG CCGAGGCATA CGCTCAGGCT
GGTGCTGATG TTGCAATCTG GTACAACTCC CAGCCTGCTA AGGAAAAGGC CGACAAGATA
GCCAAGACGT ATGGTGTACG TTGCAGAGCT TATAAGTGTA ATGTTTCAGA TCAGCAGGAT
GTTGAAACCA CTGTGGCTCA GATCGAGGCT GACTTTGGCA CCATAGATAT CTTTGTTGCC
AATGCCGGTG TTCCCTGGAC TGAAGGCGAA AGTGTAGAAA TTGACAACTT TGACTCCTGG
AAAAAGGTCA TAGACTTAGA CTTGTCTGGG GCTTACTACT GTGCACATGC GGCTGGTAAG
ATCTTTAAGA AAAACGGCAA GGGCTCCATG ATTTTCACCG CTTCTATGTC TGGTCACATT
GTGAATATTC CTCAATTCCA GGCTCCTTAC AACGCTGCCA AGGCTGCGGT GTTGCACTTG
AGCAAATCGT TGGCTATAGA ATGGGCTCCT TTTGCCAGAG TCAATACGAT TTCGCCAGGA
TACATTGTCA CCGAGATCTC GGACTTTGTC TCAGACGACA TCAAGTCCAA GTGGTGGCAG
TTTATTCCTC TTGGTAGAGA GGGAGTCACA CAAGAGTTGG TTGGTGCCTA CTTGTACTTT
GCTTCTGATG CCTCTACATA TACTACGGGA TCAGATCTTA TCGTCGATGG AGGCTACTGT
GCGCCATAG
 
Protein sequence
MTNNPSITSH INAAVGPLPT KAPKLASNVL DLFSLKGKVA SITGSSAGIG LAVAEAYAQA 
GADVAIWYNS QPAKEKADKI AKTYGVRCRA YKCNVSDQQD VETTVAQIEA DFGTIDIFVA
NAGVPWTEGE SVEIDNFDSW KKVIDLDLSG AYYCAHAAGK IFKKNGKGSM IFTASMSGHI
VNIPQFQAPY NAAKAAVLHL SKSLAIEWAP FARVNTISPG YIVTEISDFV SDDIKSKWWQ
FIPLGREGVT QELVGAYLYF ASDASTYTTG SDLIVDGGYC AP