Gene PICST_40172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40172 
SymbolSOR3 
ID4852048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3501783 
End bp3502928 
Gene Length1146 bp 
Protein Length381 aa 
Translation table 
GC content41% 
IMG OID640393756 
ProductSorbitol dehydrogenase (L-iditol 2-dehydrogenase) 
Protein accessionXP_001387001 
Protein GI126276430 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.859827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCAT CAAAAGAATC GTTGAAATTC TCCAAGGAGA ACACTTGTCT AAAGCTTACT 
TCGGATCGTC AATTGGTAAT AGACAGCGAG CCAATTCCCA TCTGTGGACG AAATGAAGTA
TTGGTTCATA TCAAATGCAC TGGTATCTGT GGATCAGATA TCCATGTTTG GAAGGCAGGT
GGAATTGGAA ATTTGCAACT TAAATCAGAT TTGATTCTTG GACATGAGTG TTCTGGCGAA
ATCATTCATA TTGGAAGTGA AGTAACCGAA GACTTTGAGA TAGGTAACAA GGTGGCTATC
GAACCTCAAC TTCCTTGTGG TATATGCTTC CTTTGCACCA ACGGTAACAT GAACTTATGT
TTGAATGTTG ACTTCATGGG CATGCCTGGT ATGCCCGGAA GGCTGCCTTC CATTCATGGA
AGCATACAGA GATATAAGAC ATTGGATCCA AGGTTTGTTT ACAAACTTCC TGATAATGTG
ACTTACGAGG AAGGTGCTTT GGTAGAAGTT CTTTCTGTTG GTTATCATGG TATCCAGAAG
GCAGGTGGTT TGGAATTGGG AAAGCCTTGT GCTATTGCCG GTTGTGGTCC AATTGGTTTG
GCTACTTTGA TTCTTGCTGA AGCAGCTGGA GCTTACCCAA TAGTTGTAAC TGATGTCTCT
CAAGAGAAAT TGAACTTTGC AAAGTCATTG GTTCCTTCCG TATACACTTA CAAGGTTCAG
ACAAACTTGA GTCCAAAAGA AAGTGCAGAA AATGTTAGAA AACTCTTCGG TAAAACTGAG
TACGAAATGC CTAGTGTAGT TTTGGAATGC ACTGGTGTTG CTTCTTCTAT CAATACTTGT
GCTTACATTG TAAGAAGAAA GGGATGCTTA ACTATTTTGG GAGTTTCAGG TAGAAATGAG
ATTGATGGAT TCCCATTCAT GCAGCTTTCA TTCGGGGAAG TCGATGTGAG GTTTATCAAC
AGATACCATG ACTCATGGCC ACCTGTTATT AATTTGATCG CAAGTGGAAA GATTGATGTA
AAGAAATTTG TCACTCATAC CTTCCCTCTT GAAAAAGCTC ACGTCGCTCT TGAAACTGTC
AGCAACCCTG CTATCAGCAC AATCAAGGTT ATGGTTAAAG ATGATGAAGA TTCTCTAACC
TTGTAG
 
Protein sequence
MSPSKESLKF SKENTCLKLT SDRQLVIDSE PIPICGRNEV LVHIKCTGIC GSDIHVWKAG 
GIGNLQLKSD LILGHECSGE IIHIGSEVTE DFEIGNKVAI EPQLPCGICF LCTNGNMNLC
LNVDFMGMPG MPGRLPSIHG SIQRYKTLDP RFVYKLPDNV TYEEGALVEV LSVGYHGIQK
AGGLELGKPC AIAGCGPIGL ATLILAEAAG AYPIVVTDVS QEKLNFAKSL VPSVYTYKVQ
TNLSPKESAE NVRKLFGKTE YEMPSVVLEC TGVASSINTC AYIVRRKGCL TILGVSGRNE
IDGFPFMQLS FGEVDVRFIN RYHDSWPPVI NLIASGKIDV KKFVTHTFPL EKAHVALETV
SNPAISTIKV MVKDDEDSLT L