Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_40172 |
Symbol | SOR3 |
ID | 4852048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3501783 |
End bp | 3502928 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393756 |
Product | Sorbitol dehydrogenase (L-iditol 2-dehydrogenase) |
Protein accession | XP_001387001 |
Protein GI | 126276430 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.859827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCCAT CAAAAGAATC GTTGAAATTC TCCAAGGAGA ACACTTGTCT AAAGCTTACT TCGGATCGTC AATTGGTAAT AGACAGCGAG CCAATTCCCA TCTGTGGACG AAATGAAGTA TTGGTTCATA TCAAATGCAC TGGTATCTGT GGATCAGATA TCCATGTTTG GAAGGCAGGT GGAATTGGAA ATTTGCAACT TAAATCAGAT TTGATTCTTG GACATGAGTG TTCTGGCGAA ATCATTCATA TTGGAAGTGA AGTAACCGAA GACTTTGAGA TAGGTAACAA GGTGGCTATC GAACCTCAAC TTCCTTGTGG TATATGCTTC CTTTGCACCA ACGGTAACAT GAACTTATGT TTGAATGTTG ACTTCATGGG CATGCCTGGT ATGCCCGGAA GGCTGCCTTC CATTCATGGA AGCATACAGA GATATAAGAC ATTGGATCCA AGGTTTGTTT ACAAACTTCC TGATAATGTG ACTTACGAGG AAGGTGCTTT GGTAGAAGTT CTTTCTGTTG GTTATCATGG TATCCAGAAG GCAGGTGGTT TGGAATTGGG AAAGCCTTGT GCTATTGCCG GTTGTGGTCC AATTGGTTTG GCTACTTTGA TTCTTGCTGA AGCAGCTGGA GCTTACCCAA TAGTTGTAAC TGATGTCTCT CAAGAGAAAT TGAACTTTGC AAAGTCATTG GTTCCTTCCG TATACACTTA CAAGGTTCAG ACAAACTTGA GTCCAAAAGA AAGTGCAGAA AATGTTAGAA AACTCTTCGG TAAAACTGAG TACGAAATGC CTAGTGTAGT TTTGGAATGC ACTGGTGTTG CTTCTTCTAT CAATACTTGT GCTTACATTG TAAGAAGAAA GGGATGCTTA ACTATTTTGG GAGTTTCAGG TAGAAATGAG ATTGATGGAT TCCCATTCAT GCAGCTTTCA TTCGGGGAAG TCGATGTGAG GTTTATCAAC AGATACCATG ACTCATGGCC ACCTGTTATT AATTTGATCG CAAGTGGAAA GATTGATGTA AAGAAATTTG TCACTCATAC CTTCCCTCTT GAAAAAGCTC ACGTCGCTCT TGAAACTGTC AGCAACCCTG CTATCAGCAC AATCAAGGTT ATGGTTAAAG ATGATGAAGA TTCTCTAACC TTGTAG
|
Protein sequence | MSPSKESLKF SKENTCLKLT SDRQLVIDSE PIPICGRNEV LVHIKCTGIC GSDIHVWKAG GIGNLQLKSD LILGHECSGE IIHIGSEVTE DFEIGNKVAI EPQLPCGICF LCTNGNMNLC LNVDFMGMPG MPGRLPSIHG SIQRYKTLDP RFVYKLPDNV TYEEGALVEV LSVGYHGIQK AGGLELGKPC AIAGCGPIGL ATLILAEAAG AYPIVVTDVS QEKLNFAKSL VPSVYTYKVQ TNLSPKESAE NVRKLFGKTE YEMPSVVLEC TGVASSINTC AYIVRRKGCL TILGVSGRNE IDGFPFMQLS FGEVDVRFIN RYHDSWPPVI NLIASGKIDV KKFVTHTFPL EKAHVALETV SNPAISTIKV MVKDDEDSLT L
|
| |