Gene PICST_86346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_86346 
SymbolSOU2 
ID4850854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp236339 
End bp237348 
Gene Length1010 bp 
Protein Length285 aa 
Translation table 
GC content47% 
IMG OID640392562 
Productperoxisomal 2,4- dienoyl-CoA reductase and sorbitol utilization protein 
Protein accessionXP_001387285 
Protein GI126273724 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CACAAATCCA CCTTATATAA ACATAATGAC TGTCGAAACC GCCACCGCTC CACAATCCAT 
GTGCAACACC GACATTGGCT CGTTGCCAGC TGCCGACCCA GTATTGCCAA CTAACGTTCT
TGACTTCTTC AAATTGGACG GCAAGACTGC TGCCATCACC GGAGGTGCCA GAGGTATTGG
TTACGCTATT TCCGAAGCTT ACTTGCAAGC TGGTATTTCC AAGTTGGCTA TCATTGACTA
CGCCCCAAAC GAAGCTGCCC TCGATGAATT GAGATCCAGA TTCCTCAAGA GCACGATTGT
CTACCACAAC TGTGACGTCA GAAAGGCCGA TCAGGTCAAG TCTGTCATTG ACAAGATCGA
AGAAGAATTC AAGGTTATCG ACATCTTCGT TGCCAACGCT GGTATCGCCT GGACTTCCGG
TCCTATGATT GACCAAGAAA CCGATGATGA CTGGCACAAC GTCATGAACG TCGACTTGAA
CGGTGTCTAC TACTGTGCCA AGAACATCGG TAAGATTTTC CGTAAGCAAG GTAAGGGTTC
GCTTGTCATG ACTGCCTCGA TGTCTGCCCA CATTGTCAAT GTTCCACAAT TGCAAGCTGC
TTACAACGCT GCTAAGGCTG GTGTTTTGCA CTTGGGTAAG TCTTTGGCTG TTGAATGGGC
TCCATTTGCC AGAGTCAACA CCGTTTCTCC AGGATACATT TCCACCGAGT TGTCTGACTT
TGTTCCAACC GAAATGAAGA ACAAGTGGTA CGCCTTGACT CCACAGGGCA GACAAGGTGC
TCCACGTGAA TTGTGTGGTG CCTACTTGTA CTTGGCTTCG GACGCTTCCA CTTACACCAC
TGGTTCTGAC ATCAGAGTCG ACGGTGGTTA CTGTTCTGTC TAGTTAGACA GACACTAGGT
AGAGTCATTG GACTCTGCCG TTTTGCATAT TAAAAGATTT GATTTCTTCA GAGGAGCTTA
ATAAACTATA TATGATATAC ATGCTCTGTA TAATAGATAA CGATAATGTT
 
Protein sequence
MTVETATAPQ SMCNTDIGSL PAADPVLPTN VLDFFKLDGK TAAITGGARG IGYAISEAYL 
QAGISKLAII DYAPNEAALD ELRSRFLKST IVYHNCDVRK ADQVKSVIDK IEEEFKVIDI
FVANAGIAWT SGPMIDQETD DDWHNVMNVD LNGVYYCAKN IGKIFRKQGK GSLVMTASMS
AHIVNVPQLQ AAYNAAKAGV LHLGKSLAVE WAPFARVNTV SPGYISTELS DFVPTEMKNK
WYALTPQGRQ GAPRELCGAY LYLASDASTY TTGSDIRVDG GYCSV