Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65964 |
Symbol | SPS22 |
ID | 4839820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1165565 |
End bp | 1166413 |
Gene Length | 849 bp |
Protein Length | 257 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391135 |
Product | short-chain dehydrogenase/reductase (SDR) family protein |
Protein accession | XP_001385577 |
Protein GI | 126138108 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.647739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTTATTCAT CAATCGACCT CATCAATCGA TCTCATCAAT TAACCGATAC TCAATTCCCC TCAACACAAT TCAAAATGGC TGGTAACTTC AAATCAGAAA AGGACATCTC TGGCAAGGTC GCTGTCATAA CTGGTGGAAC GACCAACTTG GGTGGTGAAA CCGCTAAGGA ATTGGCTTCG CTTGGTGTCA ACTTGTTCTT GCACTACCGT TCGTCTTCTG AAGTAGCCAA GAAGTTCCAG GAAGAACTCA AGGCCAAGTA CCCAAAGCTT CAAATTGAAA TATACCAGGC CCCATTAAAG GCCGCTGCTG ATTTGACTAA ATTGTTTGAA GCTGCCAAAA AGGCATTCCC ACAAGGTATT GACATTGCCA TTAACAACAT TGGTAAGGTT GTCAAGAAGC CTCTTGTTGA AGTCACAGAA GAAGAATTCG ACGAATTGGA CCTTGTCAAC CACAAGGTAG CCTTTTTCTT CTTGAAGGAA GCTGCCCTTA ACTTGAACAA CAATGGTAGA ATTGTCACTA TTGTCACATC CTTATTGGCT GCCTACACTC CATTCTACTC ACCTTACCAA GGTACCAAGG CCCCAGTGGA ATTCTACACC AAGGCATTGT CTAAGGAATT GGGTGAGAAG GGCATAACCG TCAACAACGT TGCTCCTGGT CCAATGGACA CGTCTTTCTT GCACACATCT GAAACGCCTG AAGCTGTCCA ATATTTGGCC TCTGTAGGTC TTAACGGTAG ATTAACTGAA CTTCCTGACA TCGTTCCAAT TGTCAGATTC TTGGTTAGCG AGGGTGCCTG GATCACAGGT CAAACCATTT TCGCTTCCGG CGGTTTCACT GCTCGTTAA
|
Protein sequence | MAGNFKSEKD ISGKVAVITG GTTNLGGETA KELASLGVNL FLHYRSSSEV AKKFQEELKA KYPKLQIEIY QAPLKAAADL TKLFEAAKKA FPQGIDIAIN NIGKVVKKPL VEVTEEEFDE LDLVNHKVAF FFLKEAALNL NNNGRIVTIV TSLLAAYTPF YSPYQGTKAP VEFYTKALSK ELGEKGITVN NVAPGPMDTS FLHTSETPEA VQYLASVGLN GRLTELPDIV PIVRFLVSEG AWITGQTIFA SGGFTAR
|
| |