Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36326 |
Symbol | SOR4 |
ID | 4839583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 910736 |
End bp | 911848 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390898 |
Product | sorbitol dehydrogenase |
Protein accession | XP_001385184 |
Protein GI | 150865814 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.218842 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATTC CAGAAACAAT GAAGGCAATT GTGTTCCATG GGCCAGGGGA CATCCGCACA GAGACCAGGC CAACTCCAGT TATCGAAGAG CCATCGGATG TGATCTTAAA GGTGAAGTTC TCTGGATTAT GCGGTACTGA TTTGCACAGC TACAGAGGCC ATATCAAGGG TCCAATTGGA ACTATTATTG GCCATGAGTT TGTAGGAGAA GTTGTAGCTC GTGGAGCAGA CATATCCGAT TCGGTCTTTA CTATTGGCGA AGATGTTCTT AGCACATTCA CGATTCAATG TGGACAATGC TGGTATTGTA AGCATGGATA CTCCGGTCAA TGCGATGTAA CCAATACATT TGGTAAGGTT GGATTAAATG GTGGACAATC TGAGTATGTA AGGGTTCCGT TTGCCAAATC TACATTGGTA AAAAAGCCAC AGAACAACGA CGGCATTGAT GATTCTGTGT ATGTACTCAT GGCTGATATC TTTATCACTG GGTATTACGG TGTTAAAAAG ATCCGCGATT TCTTGAGCAC AAAGCCAGCA GTGGGTATTG AAGCGCAGGA GTTTAAGGAC GTGACCATCT TGCAATTGGG GGTAGGTCCC GTTGGATTGT GTGCTTTGAG AGTGTTGAAG CACTTCGGAT TTACGAAAGT GGTGTGCGTG GACAGTGTTC CTTCGAGATT GGAAGAGGCA AAGAGGCTAG GAGCATACAA GGCGATCAAT TTCGAGACTG ACAAGACTCT GCTTGAAGAA TTTATCAGAA ACGAGACTGG TAATGTGGGA TTTGATGCAG TATTGGAAGT AGTGGGAGCA TCTTCAGCTG TGAAGACAGC CTACGACTCT GTTCGTCGTA ACGGGTTCAT TTCCTCGTTG GGAATGGGTC ACGAGCCTCT TCCGTTCAAC GGCCTCGATT GCTACTTAAA GAACATCAAT ATCTCCTTTG GAAGATGTCA CTGCTGGTCT TTGTTCCCTG AAGCCTTAGA AGTGTTCGAA TCTATGAAGG CAGACTTTGC TAGTTTTATA GACTACACAA CTGGACTCGA TGGGGCTAAA GAGGCATTTG AGTTGTTTGA CAAGCATAAG GTGAACAAGG TCGTGTTTGA CTTGACGAAG TAG
|
Protein sequence | MSIPETMKAI VFHGPGDIRT ETRPTPVIEE PSDVILKVKF SGLCGTDLHS YRGHIKGPIG TIIGHEFVGE VVARGADISD SVFTIGEDVL STFTIQCGQC WYCKHGYSGQ CDVTNTFGKV GLNGGQSEYV RVPFAKSTLV KKPQNNDGID DSVYVLMADI FITGYYGVKK IRDFLSTKPA VGIEAQEFKD VTILQLGVGP VGLCALRVLK HFGFTKVVCV DSVPSRLEEA KRLGAYKAIN FETDKTSLEE FIRNETGNVG FDAVLEVVGA SSAVKTAYDS VRRNGFISSL GMGHEPLPFN GLDCYLKNIN ISFGRCHCWS LFPEALEVFE SMKADFASFI DYTTGLDGAK EAFELFDKHK VNKVVFDLTK
|
| |