Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_89701 |
Symbol | SOR1 |
ID | 4839191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 30570 |
End bp | 31958 |
Gene Length | 1389 bp |
Protein Length | 385 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390506 |
Product | polyol dehydrogenase |
Protein accession | XP_001385027 |
Protein GI | 150865700 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACTAAACGTA CAAAGAAATT TTCCAAAGAT TCATATATTT AGACTACCGA ATCGCCGATA CTAGCCTACC ATCCATAGAT AAAATCCTAT TTTGGTTCCT AAATAAGACG AATAAACTAC AATTACAAAC TATCTTTACT TTCCATCTAA CTCACCTATT TAAACACAAT GAAGGCCATA GTTTACCATG ACAGGGAAGA TGTCCGCTAT CACCTGGACT TCCCTGAGCC GCAGATTGTC AGGCCTGACG ATGTCAAGAT CAAGGTCCAT TACTGTGGGA TCTGTGGTTC TGACTTGAAG GAGTATCTCG ATGGACCGAT TTTTTTCTCC AAGAAGGGTA CCAACAACGA AGTTTCCAAC TTGCCATATC CGCAATGCAT GGGTCACGAG ATCAGTGGTG AAGTTTACGA GGTCGGATCT GAAGTTGACA ACCTTCAGAT TGGAGACAAA GTGGTGGTGG AAGTCACAGG TACCTGTTTT GACAGATACC GGTTCCCCGA ATCGCCCAAC TTCAACAAAC CCAAGTGTGG AAGTTGTCTC GAAGGTCACT ATAATGCCTG TGCATATCTT GGGTTGACTG GCTTAGGTTT TACCAACGGA GGGTGTTCTG AATATTTGGT TACAGCTGCT AGCAAAGCCA TCAAGTTTCA AGAAGATATC ATTCCGATGG ATGTGGCTGC TGTAATCCAG CCAATTGCTG TCAGTTGGCA CGCGGTTCGC GTGTCTCACT TCCAAGAAGG TCAAAGTGCT TTGATCTTGG GAGGTGGCCC CATTGGGTTG ACTACCATTT TTGCATTGAA GGGAAACAAA GCTGGCAAGA TAGTCGTTTC TGAGCCTGCT TTGGCCAGAC GACAATTGGC AGAAAAGTTG GGCGTCACAG TCTTTGATCC TACCGGCAAA TCTGTTGATG AATGTGTAGA GGAGTTGAGG AAGTTGTCAC CCAACGGCTA TGGCTTCAAT CATTCATATG ATTGTTCCGG TGTGCCAGCT ACTTTCCAAA CCAGTCTCAG GGCATTGAAT ATTAGAGGAA CGGCTACTAA TGTTGCCGTG TGGGCTCACA AATCCGTTCC ATACTTTCCT ATGGAAGCCA CCTGGGCCGA AAAGATCATC ACCGGATCAA TTTGCTTTGT CAAGGACGAT TTTATAGATG TCGTCAATGC TCTTCACGAA GGCACTATTC CAGTTGACGA AGTCAAGTTA TTGATCACTT CCAAGATTCA TCTTGAGGAT GGAGTAGAGA AGGGCTTTTT AGAATTGATT CACCACAAGG AAAAGCATAT AAAGATCTTG TTTTCTCCTA AGGAAGAATA TAGAGTAAAG AAGTAATAGA CAAGTGTACT ATATAAGCAT CACTATTTAC TAGAATACAT AGAGCAACGC ATGATTACT
|
Protein sequence | MKAIVYHDRE DVRYHSDFPE PQIVRPDDVK IKVHYCGICG SDLKEYLDGP IFFSKKGTNN EVSNLPYPQC MGHEISGEVY EVGSEVDNLQ IGDKVVVEVT GTCFDRYRFP ESPNFNKPKC GSCLEGHYNA CAYLGLTGLG FTNGGCSEYL VTAASKAIKF QEDIIPMDVA AVIQPIAVSW HAVRVSHFQE GQSALILGGG PIGLTTIFAL KGNKAGKIVV SEPALARRQL AEKLGVTVFD PTGKSVDECV EELRKLSPNG YGFNHSYDCS GVPATFQTSL RALNIRGTAT NVAVWAHKSV PYFPMEATWA EKIITGSICF VKDDFIDVVN ALHEGTIPVD EVKLLITSKI HLEDGVEKGF LELIHHKEKH IKILFSPKEE YRVKK
|
| |