Gene PICST_31015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31015 
SymbolGCY2 
ID4837860 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1522612 
End bp1523600 
Gene Length989 bp 
Protein Length309 aa 
Translation table12 
GC content41% 
IMG OID640389175 
Productaldo/keto reductase 
Protein accessionXP_001383910 
Protein GI150864905 
COG category[R] General function prediction only 
COG ID[COG0656] Aldo/keto reductases, related to diketogulonate reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.323179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.394678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACTT CTTATAAGTA TGTGACTTCA TTTTCAACGA ATTGATCAGT TCCTTATATA 
CTAACAATCT TCATAGATCA TCAAAAGTTT ACAAGCTCAA TAGTGGTGAA ACCATCCCAG
CATTGGGACT TGGTACCTGG CAAGCCCTGG CCGATGACGT CTACAAAGCT GTTTTGTTTG
CTCTCAAAAC AGGTTACAGA CACATCGATT CCGCTCTTGC TTACGGCAAT GAAGAACCTG
TTGGCCGTGC TATTAGAGAT TCAGGAATCC CAAGAAAAGA AATCTTTGTA ACAACTAAGT
TGGCCCCCAT TGATGCTCTT GATCCTGCTG GTGCCTTGGA TCAATCCTTG AAAAACCTTG
GTTTGGACTA TGTTGACTTG TACTTGATGC ATTGGCCAGT TTGTTTGAAC AAGGCAAACA
AGCTGCACCC AGGTATTCCA ACATTACCTA ACGGAAAGCG TGATATAGTG TTTGACCGCG
ATTTCACCCA AACATACGCC GACATGCAGC ATTTGGTTGA ATCAGGTAAA GCCAAGTCAA
TTGGAGTTTC TAACTTCTCT ATTAAAAACT TGAAAAAGTT GTTTAGTTCG CCTGACTATA
AGATCGTTCC TACGGTTAAC CAGGTTGAGA TACACCCATA CTTGCCACAA ACTGAGCTTC
TTGAGTTCTG TAAGAAACAT GATATTTTGT TGGAAGCTTT CAGTCCATTG GGCTCCTCCA
ATTCCCCATT ATTGAAGGAC GAAACTATTG TCAAAATTGC AGAAAAGAAT CAGGTTTCGG
TAGCTACAAT TTTGATCTCT TGGGCTATCT GGAGAGGAAC TGTTGTGTTG CCAAAGTCTG
TCAGCGATTC AAGAATTGAA AGCAACTTCA ATGTTGTCGA TTTGTCGGAC GAAGATGGTG
AAGAATTGAA CAACTTGCAC AAAGTCAAGG GTATCAAGAG ATTCGTCAGT CCTAACTGGG
ATCCAATTGA CGTCTACGGT GAAGATTAG
 
Protein sequence
MATSYKSSKV YKLNSGETIP ALGLGTWQAS ADDVYKAVLF ALKTGYRHID SALAYGNEEP 
VGRAIRDSGI PRKEIFVTTK LAPIDALDPA GALDQSLKNL GLDYVDLYLM HWPVCLNKAN
KSHPGIPTLP NGKRDIVFDR DFTQTYADMQ HLVESGKAKS IGVSNFSIKN LKKLFSSPDY
KIVPTVNQVE IHPYLPQTEL LEFCKKHDIL LEAFSPLGSS NSPLLKDETI VKIAEKNQVS
VATILISWAI WRGTVVLPKS VSDSRIESNF NVVDLSDEDG EELNNLHKVK GIKRFVSPNW
DPIDVYGED