Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_58212 |
Symbol | YMT1 |
ID | 4838365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 57443 |
End bp | 58531 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389680 |
Product | D-threo-aldose 1-dehydrogenase |
Protein accession | XP_001384309 |
Protein GI | 150865192 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAT TTTTAGAGCC ACAGACAAAT GAGTCGCAAG CTATCGGAGA CTCCATTGTT GTCCAACCCT ATTCCATAGC TAACTTGCCA CCTTTGGTTG TAGGAGGTGC TGTTTTCAAC ACACAGTATT CGCTGAACCC CAGCAGTCTC CCTATCCAGG AGATTCTTGA AGATGCTTTT GCCAAAGGGT TGAATGCTAT CGATACTTCG CCCTACTATG GTCCATCAGA GATCCTCTTG GGAGAAGCTC TCAAGAAGAT TTCTTTCCCA AGACAAGAGT ACTACATATG TACCAAAGCT GGAAGAGTTA AGCTTGACGA TTTTGACTAC TCGCGTGATA GTGTCCGTAA ATCTGTAGAG AGATCGTTGG AGAGATTGAA CACATCGTAT CTTGATCTCG TGTATATGCA CGATATCGAG TTTGTAAAAG AAGACGAAAT CTTTGATGCG TTGAAAGAGT TGAAATTGTT GAAAACCGAA GGTCTCATCA AAAACTTCGG TATTTCAGGG TATCCTGTTC GTTTCTTGCA TAAGATTGCG TCCCGGAGTG TAGGGATTCC AGAGATTGGA CCTTTGGATG CTGTTTTATC ATATTCTAAC GGCTGTATTC AAAACACAAG ATTATTTGAA TTCTACGACC AGTTCTTTGA CGACTGTAAG CTCAAGAAGT TGTCTAACGG ATCTATTCTC AGTATGTCTT TGTTGAGATC GGACATAACA CATTCGTTCC ATCCAGCATC CAAGGAGCTC AAGGACAAGG TTTACGATAT TGCTCACCTC TTGAAGAAGG AGTACAATGG CTTGGAATTG GCAGATTTGG CTACACGTTT TGCTTTGAGA AAATGGTTGT TTGAAACTGT ACACCAGGCT GATTCAAGCA ATCTTCATTG GAATCCTTCT ACCTCGATTG TGTTGGGAGT TTCTAATGTT GAAGAGTTAG ATGTTGCCAT CAGATGCTAC TGGCAGGTGA AGAACAACAT TGACAATATC AACACCAAGG ATGATATTTT GTTTGAGAAG GTCAAGAACT TATTGGGCCC AGAGCACTTT AACGAGGTTT GGCCAAGTGG TATTGATGGA AGGCAATAG
|
Protein sequence | MPEFLEPQTN ESQAIGDSIV VQPYSIANLP PLVVGGAVFN TQYSSNPSSL PIQEILEDAF AKGLNAIDTS PYYGPSEILL GEALKKISFP RQEYYICTKA GRVKLDDFDY SRDSVRKSVE RSLERLNTSY LDLVYMHDIE FVKEDEIFDA LKELKLLKTE GLIKNFGISG YPVRFLHKIA SRSVGIPEIG PLDAVLSYSN GCIQNTRLFE FYDQFFDDCK LKKLSNGSIL SMSLLRSDIT HSFHPASKEL KDKVYDIAHL LKKEYNGLEL ADLATRFALR KWLFETVHQA DSSNLHWNPS TSIVLGVSNV EELDVAIRCY WQVKNNIDNI NTKDDILFEK VKNLLGPEHF NEVWPSGIDG RQ
|
| |