Gene PICST_59150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59150 
SymbolSAD1 
ID4838558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1459782 
End bp1460843 
Gene Length1062 bp 
Protein Length353 aa 
Translation table12 
GC content48% 
IMG OID640389873 
Productsecondary alcohol dehydrogenase (SADH1) 
Protein accessionXP_001384580 
Protein GI126136112 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.318338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.350827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCC CAAAAACACA AGTTGCTTAC GGTTATGTCC CCGGCAAGAA GACGATTCAA 
TGTTTCCCCA ACCACCCAGT GCAGACCCCA GGAGATAACC AGGTTTTGTT GAAGATCGAG
GCTGCTGGTA TGTGCCATAG CGACCACAAC ATCCTCCTTT CTGGCCCTCT TGCAGGCGGT
AAAGGTGAAC CTAAGATGGT AATGGGTCAT GAGATTGCTG GCCAGATTGT CCAAGTCGGA
AAGAACCTCG AAAAATCCGA TATCTACGAA ATCGGTGGCC GCTTCGCTGT GACGATCGCC
AAAGCATGTG GAGAGTGTGA GATGTGCCGG GGAGGTGTAG ATAACGCTTG TGGAAATTCT
GTAATGGCCT ACGGATTGAA TTGCGACGGA GGGTTCCAGC AATACTTGTT GATCGACAAC
TTGAGAACGT TATTGCCTAT TCCAGAAGGC ATGAGCTACG AGGACGCTGC TGTTTCTACT
GATGCCGTCT TGACTCCCTT CCATGCAATT CAGAAAGTCA GAGACTTGCT CCATCCCACC
ACGAAAGTGT TGGTCCAGGG CTTGGGTGGT CTTGGTTTGA ATGCTGTCCA GATCTTGAAG
AGCTACAACT GCAATATCGT CGCCTGCGAC ATCAAGGAAG AAAGTAGAGA ATTGGCCAAG
GGCCTTGGAG CAGCGGAAAC CTACGCCAAC ATCGGGGACT CCAGTCATTC AATAGAGAGC
TTTGACCTCT GTTTTGACTT TGTCGGTATT GACATCACCT TTAAGAACAG TCAGAGCTAC
GTAAAAAACC ATGGAAAGAT CGTAATGGTG GGCTTGGGAA GGTACAAGTT GAGCACTTTG
AACTTCGAGC TCGCAAGAAG AGATGTCGAG ATTATCTTCA ATTTCGGAGG CACTTCTTTG
GAGCAAATTG AGTGTATGAA GTGGATCTCC TTGGGCAGAA TCAAGCCTGT AGCCCAGGTT
GTGGACATGG AACAGTTGCC TAACTACATG GAGAAGTTGG CCAACAACGC TATCAAGGGA
AGAATGGTTT TCAGACCCAA TTTCAGAAAA TCCAATTTGT AG
 
Protein sequence
MSIPKTQVAY GYVPGKKTIQ CFPNHPVQTP GDNQVLLKIE AAGMCHSDHN ILLSGPLAGG 
KGEPKMVMGH EIAGQIVQVG KNLEKSDIYE IGGRFAVTIA KACGECEMCR GGVDNACGNS
VMAYGLNCDG GFQQYLLIDN LRTLLPIPEG MSYEDAAVST DAVLTPFHAI QKVRDLLHPT
TKVLVQGLGG LGLNAVQILK SYNCNIVACD IKEESRELAK GLGAAETYAN IGDSSHSIES
FDLCFDFVGI DITFKNSQSY VKNHGKIVMV GLGRYKLSTL NFELARRDVE IIFNFGGTSL
EQIECMKWIS LGRIKPVAQV VDMEQLPNYM EKLANNAIKG RMVFRPNFRK SNL