Gene PICST_42452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42452 
SymbolGRP2.1 
ID4836960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp345996 
End bp347006 
Gene Length1011 bp 
Protein Length336 aa 
Translation table12 
GC content44% 
IMG OID640388275 
Productdihydroflavonol-4-reductases 
Protein accessionXP_001382297 
Protein GI150863730 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAA CTACCGTTTT TCTCTCGGGT GCCACTGGTT ATATCGCACA GCATATAATT 
GTCCAGCTTC TTTCTAAGGG GTATAATGTG GTTGGTTCTG TCAGATCGCA AGAGAAGGGT
GAAAAGTTGA AGTCTACATA TGGTGAACAA TTTCAGTATG TTGTTGTACC TAGCTTAGAC
CAAAAGGGTG CTTTCGATGA AGCCTTGAAG CAACATCCTG AAGCCACCAT ATTCTTACAC
ACTGCTTCTC CTGTAACTTT CTCTAGTGAA GACAACGAGA AAGATATCTT GATTCCTGCT
ATCGAAGGAA CCAGAAATGC CTTACAAGCT ATTTATGACC ATGCTCCTCA GATCAAGAGG
GTTGTTTTGA CCAGTTCTAC AGTCTCGTTA GCTGACATTG ATGATTTCCA AATTCCTTCG
CTCAAGTTGA ACGAAGAGTC GTGGGCCAGT GTTACTTATG AAGATGGTAA GACCAAAGAT
GCCATGACCG CCTACTGGGC TTCCAAGAAG TATGCTGAAA AGGCAGCCTG GGCTTTTGTT
GAATCCAATA AACCCAACTT CGCCCTCTCC GCCGTCCTTC CTTCATATGT GTTTGGACCT
CAAGCACACG ATGCCGAAGC TAAGGGTCAA ATGAACTTGA CTGCTGAAGT TTTTGCTAGT
GTTTACCGTT TGTCCAAGAA CGATGAGGTT CCTGAAGTAG CTGGTCCTTT TGTTGATGTC
AGAGATGTGG CCAAGGCTCA CATTGTTGCT TTCGAGAAGG ATGAAGCCAA GGGTCAAAGA
ATCATTACCA GCAGTGCCAG ATTCAATGCG CAGCTGATCT TGAACATCAT CAGAGATAAG
TTTCCCGATC TCAGAGAGAA ATTGCCAGTT GGAGTTCCTG CCAATGGCGA TGTCTCTGAG
TTTGTCCGCT GGGATGACCA GAAGTCTAAG AATTTGTTGG GTTTCGAATT CTCTGATCTT
GAGAAGGTAG TTGTCGATAC TATCGAGCAA GTGATTAGAG CCAACAAATA A
 
Protein sequence
MSTTTVFLSG ATGYIAQHII VQLLSKGYNV VGSVRSQEKG EKLKSTYGEQ FQYVVVPSLD 
QKGAFDEALK QHPEATIFLH TASPVTFSSE DNEKDILIPA IEGTRNALQA IYDHAPQIKR
VVLTSSTVSL ADIDDFQIPS LKLNEESWAS VTYEDGKTKD AMTAYWASKK YAEKAAWAFV
ESNKPNFALS AVLPSYVFGP QAHDAEAKGQ MNLTAEVFAS VYRLSKNDEV PEVAGPFVDV
RDVAKAHIVA FEKDEAKGQR IITSSARFNA QSILNIIRDK FPDLREKLPV GVPANGDVSE
FVRWDDQKSK NLLGFEFSDL EKVVVDTIEQ VIRANK