Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_38255 |
Symbol | SOU1 |
ID | 4850856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 241361 |
End bp | 242209 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | |
GC content | 48% |
IMG OID | 640392564 |
Product | peroxisomal 2,4- dienoyl-CoA reductase, and sorbitol utilization protein |
Protein accession | XP_001387287 |
Protein GI | 126273732 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACA ACCCGAGCAT CACCTCTCAT ATCAACGCTG CCGTGGGTCC TCTCCCTACT AAGGCTCCCA AACTTGCGTC TAACGTGTTG GATTTATTTT CTCTCAAGGG TAAGGTAGCT TCCATTACTG GCTCTTCAGC CGGGATAGGT CTTGCTGTAG CCGAGGCATA CGCTCAGGCT GGTGCTGATG TTGCAATCTG GTACAACTCC CAGCCTGCTA AGGAAAAGGC CGACAAGATA GCCAAGACGT ATGGTGTACG TTGCAGAGCT TATAAGTGTA ATGTTTCAGA TCAGCAGGAT GTTGAAACCA CTGTGGCTCA GATCGAGGCT GACTTTGGCA CCATAGATAT CTTTGTTGCC AATGCCGGTG TTCCCTGGAC TGAAGGCGAA AGTGTAGAAA TTGACAACTT TGACTCCTGG AAAAAGGTCA TAGACTTAGA CTTGTCTGGG GCTTACTACT GTGCACATGC GGCTGGTAAG ATCTTTAAGA AAAACGGCAA GGGCTCCATG ATTTTCACCG CTTCTATGTC TGGTCACATT GTGAATATTC CTCAATTCCA GGCTCCTTAC AACGCTGCCA AGGCTGCGGT GTTGCACTTG AGCAAATCGT TGGCTATAGA ATGGGCTCCT TTTGCCAGAG TCAATACGAT TTCGCCAGGA TACATTGTCA CCGAGATCTC GGACTTTGTC TCAGACGACA TCAAGTCCAA GTGGTGGCAG TTTATTCCTC TTGGTAGAGA GGGAGTCACA CAAGAGTTGG TTGGTGCCTA CTTGTACTTT GCTTCTGATG CCTCTACATA TACTACGGGA TCAGATCTTA TCGTCGATGG AGGCTACTGT GCGCCATAG
|
Protein sequence | MTNNPSITSH INAAVGPLPT KAPKLASNVL DLFSLKGKVA SITGSSAGIG LAVAEAYAQA GADVAIWYNS QPAKEKADKI AKTYGVRCRA YKCNVSDQQD VETTVAQIEA DFGTIDIFVA NAGVPWTEGE SVEIDNFDSW KKVIDLDLSG AYYCAHAAGK IFKKNGKGSM IFTASMSGHI VNIPQFQAPY NAAKAAVLHL SKSLAIEWAP FARVNTISPG YIVTEISDFV SDDIKSKWWQ FIPLGREGVT QELVGAYLYF ASDASTYTTG SDLIVDGGYC AP
|
| |