Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_86346 |
Symbol | SOU2 |
ID | 4850854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 236339 |
End bp | 237348 |
Gene Length | 1010 bp |
Protein Length | 285 aa |
Translation table | |
GC content | 47% |
IMG OID | 640392562 |
Product | peroxisomal 2,4- dienoyl-CoA reductase and sorbitol utilization protein |
Protein accession | XP_001387285 |
Protein GI | 126273724 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CACAAATCCA CCTTATATAA ACATAATGAC TGTCGAAACC GCCACCGCTC CACAATCCAT GTGCAACACC GACATTGGCT CGTTGCCAGC TGCCGACCCA GTATTGCCAA CTAACGTTCT TGACTTCTTC AAATTGGACG GCAAGACTGC TGCCATCACC GGAGGTGCCA GAGGTATTGG TTACGCTATT TCCGAAGCTT ACTTGCAAGC TGGTATTTCC AAGTTGGCTA TCATTGACTA CGCCCCAAAC GAAGCTGCCC TCGATGAATT GAGATCCAGA TTCCTCAAGA GCACGATTGT CTACCACAAC TGTGACGTCA GAAAGGCCGA TCAGGTCAAG TCTGTCATTG ACAAGATCGA AGAAGAATTC AAGGTTATCG ACATCTTCGT TGCCAACGCT GGTATCGCCT GGACTTCCGG TCCTATGATT GACCAAGAAA CCGATGATGA CTGGCACAAC GTCATGAACG TCGACTTGAA CGGTGTCTAC TACTGTGCCA AGAACATCGG TAAGATTTTC CGTAAGCAAG GTAAGGGTTC GCTTGTCATG ACTGCCTCGA TGTCTGCCCA CATTGTCAAT GTTCCACAAT TGCAAGCTGC TTACAACGCT GCTAAGGCTG GTGTTTTGCA CTTGGGTAAG TCTTTGGCTG TTGAATGGGC TCCATTTGCC AGAGTCAACA CCGTTTCTCC AGGATACATT TCCACCGAGT TGTCTGACTT TGTTCCAACC GAAATGAAGA ACAAGTGGTA CGCCTTGACT CCACAGGGCA GACAAGGTGC TCCACGTGAA TTGTGTGGTG CCTACTTGTA CTTGGCTTCG GACGCTTCCA CTTACACCAC TGGTTCTGAC ATCAGAGTCG ACGGTGGTTA CTGTTCTGTC TAGTTAGACA GACACTAGGT AGAGTCATTG GACTCTGCCG TTTTGCATAT TAAAAGATTT GATTTCTTCA GAGGAGCTTA ATAAACTATA TATGATATAC ATGCTCTGTA TAATAGATAA CGATAATGTT
|
Protein sequence | MTVETATAPQ SMCNTDIGSL PAADPVLPTN VLDFFKLDGK TAAITGGARG IGYAISEAYL QAGISKLAII DYAPNEAALD ELRSRFLKST IVYHNCDVRK ADQVKSVIDK IEEEFKVIDI FVANAGIAWT SGPMIDQETD DDWHNVMNVD LNGVYYCAKN IGKIFRKQGK GSLVMTASMS AHIVNVPQLQ AAYNAAKAGV LHLGKSLAVE WAPFARVNTV SPGYISTELS DFVPTEMKNK WYALTPQGRQ GAPRELCGAY LYLASDASTY TTGSDIRVDG GYCSV
|
| |