Gene PICST_58212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58212 
SymbolYMT1 
ID4838365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp57443 
End bp58531 
Gene Length1089 bp 
Protein Length362 aa 
Translation table12 
GC content42% 
IMG OID640389680 
ProductD-threo-aldose 1-dehydrogenase 
Protein accessionXP_001384309 
Protein GI150865192 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAT TTTTAGAGCC ACAGACAAAT GAGTCGCAAG CTATCGGAGA CTCCATTGTT 
GTCCAACCCT ATTCCATAGC TAACTTGCCA CCTTTGGTTG TAGGAGGTGC TGTTTTCAAC
ACACAGTATT CGCTGAACCC CAGCAGTCTC CCTATCCAGG AGATTCTTGA AGATGCTTTT
GCCAAAGGGT TGAATGCTAT CGATACTTCG CCCTACTATG GTCCATCAGA GATCCTCTTG
GGAGAAGCTC TCAAGAAGAT TTCTTTCCCA AGACAAGAGT ACTACATATG TACCAAAGCT
GGAAGAGTTA AGCTTGACGA TTTTGACTAC TCGCGTGATA GTGTCCGTAA ATCTGTAGAG
AGATCGTTGG AGAGATTGAA CACATCGTAT CTTGATCTCG TGTATATGCA CGATATCGAG
TTTGTAAAAG AAGACGAAAT CTTTGATGCG TTGAAAGAGT TGAAATTGTT GAAAACCGAA
GGTCTCATCA AAAACTTCGG TATTTCAGGG TATCCTGTTC GTTTCTTGCA TAAGATTGCG
TCCCGGAGTG TAGGGATTCC AGAGATTGGA CCTTTGGATG CTGTTTTATC ATATTCTAAC
GGCTGTATTC AAAACACAAG ATTATTTGAA TTCTACGACC AGTTCTTTGA CGACTGTAAG
CTCAAGAAGT TGTCTAACGG ATCTATTCTC AGTATGTCTT TGTTGAGATC GGACATAACA
CATTCGTTCC ATCCAGCATC CAAGGAGCTC AAGGACAAGG TTTACGATAT TGCTCACCTC
TTGAAGAAGG AGTACAATGG CTTGGAATTG GCAGATTTGG CTACACGTTT TGCTTTGAGA
AAATGGTTGT TTGAAACTGT ACACCAGGCT GATTCAAGCA ATCTTCATTG GAATCCTTCT
ACCTCGATTG TGTTGGGAGT TTCTAATGTT GAAGAGTTAG ATGTTGCCAT CAGATGCTAC
TGGCAGGTGA AGAACAACAT TGACAATATC AACACCAAGG ATGATATTTT GTTTGAGAAG
GTCAAGAACT TATTGGGCCC AGAGCACTTT AACGAGGTTT GGCCAAGTGG TATTGATGGA
AGGCAATAG
 
Protein sequence
MPEFLEPQTN ESQAIGDSIV VQPYSIANLP PLVVGGAVFN TQYSSNPSSL PIQEILEDAF 
AKGLNAIDTS PYYGPSEILL GEALKKISFP RQEYYICTKA GRVKLDDFDY SRDSVRKSVE
RSLERLNTSY LDLVYMHDIE FVKEDEIFDA LKELKLLKTE GLIKNFGISG YPVRFLHKIA
SRSVGIPEIG PLDAVLSYSN GCIQNTRLFE FYDQFFDDCK LKKLSNGSIL SMSLLRSDIT
HSFHPASKEL KDKVYDIAHL LKKEYNGLEL ADLATRFALR KWLFETVHQA DSSNLHWNPS
TSIVLGVSNV EELDVAIRCY WQVKNNIDNI NTKDDILFEK VKNLLGPEHF NEVWPSGIDG
RQ