Gene PICST_75910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_75910 
SymbolERG1 
ID4837155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2450159 
End bp2451765 
Gene Length1607 bp 
Protein Length499 aa 
Translation table12 
GC content48% 
IMG OID640388470 
Productsqualene epoxidase(monooxygenase), erosterol biosynthesis 
Protein accessionXP_001383223 
Protein GI150864419 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.886863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTTATGACCG CGCCATTGGA TGTCAAGTAC GATGTTATTG TCATCGGTGC CGGAGTTGTA 
GGACCGGCCA TCGCCACAGC CTTAGCTAGA CAGGGCAGAA AGGTATTGAT TGTAGAGAGA
GACTGGGCCA AGCCCGACAG AATTGTCGGG GAGTTGATGC AGCCGGCCGG GGTCAAGGCT
CTCAGAGAGT TGGGGATGAT TCTGGCCATC AACAACATCG AGGCTTTCGA CTGTAGAGGT
TACTACATCA AATACTTCAA CAAGAACATC CAGATCCCAT ACCCATTAAA GGAAGATACA
GCTAGAACGA ACCCTGTGAA ACCGGTGGCT GATTGTGTCA GAGATGGCAA CGACAAGTTA
CAGGAAGACT CGACTTTGAG TCCTTCCGAA TGGGACGAAG ACGAACGTGT CAGAGGAGTT
GCTTTCCACC ATGGAGACTT CTTACAGAAC TTGAGACAGA TTGTGAAAGA CGAGCCCAAC
GTAGAATGGG TCGAGGGAAA TGTCGTCAAG ATCTTGAGAG ACAATTTCGA CTCCACTATT
GTGACTGGAG TTAGGGTTAA GGAACTGGTT GGCTCAAAGG ACTACCATGC CAAGTTGACC
ATCTGCTGTG ACGGTATCTA TTCCAAGTTC AGAAAGGAGT TGTCCAGCAA CAATGTACCA
ACCATAGGCT CGTACTTCAT AGGGTTATAC ATGACCGACT GTAAATTGCC AGCCAAGCAC
CACGGCCACG TTATCTTGGG GGACCAGGCC CCCGTGATTG TGTACCAGAT CTCTCCTACC
GAGACAAGAA TCTTGTGTGC CTATAGATCG TTGAAACCAC CTTCGCGTGC CAACAACGAG
TTGTACAACT ATTTGAAGAA CGAGGTCTTG CCCGTGTTGC CCGAAGAAAC CAAGCCGGCT
TTCGAAATCG CCTTGGAAGG AGGCAAGTAC AGAGTCATGC CTAACCAATA CTTGCCAGCC
AGAAGACAGG GCAGCAGCCA GCACAAGGGA TTGGTAATGT TGGGAGACTC CCTAAATATG
AGACATCCTC TCACCGGTGG TGGTATGACG GTTGGTTTGA ACGATGCGGT GTTGTTGGCT
AAGTTGTTGC ATCCTGCGTA CATTGCTGAT TTTGAGGAGT ATGACGAAGT GTCCCGGATA
TTGAAGCAGT TCCACAGAAA GAGAAAGAAT CTCGACGCCG TCATCAATAC CTTGTCGGTT
GCCTTGTACT CGTTGTTTGC AGCCGACAAA AATTCGCTCA AGATCTTGCA AAGAGGCTGT
TTCCAGTACT TCTTGTTAGG AGGGTCCTGT GTAACCGGCC CCATTGGGTT GTTGTCAGGA
ATGTTGCCCT TCCCAATGTT GTTGTTTAAC CACTTCTTCA GCGTCGCTTT TTATGCTATC
TACTGTAACT TCAAAGACAG AGGCTTGGCT GGATTCCCTA TTGCTCTCTT GGAAGCCTTT
GCTGTATTCT TTACTGCTGT AATCGTATTT GCTCCCTACT TGTGGCTTGA GTTAGTAGCC
TAGACTTCCA GAGAACATCT GTCTGTGGTG AAGTGAAACG AAAACACATA GAGAAAAGAG
CTATACTCAT ATAGACATAC ATGTATATTT AATTAACGTT GAGCTCC
 
Protein sequence
MTAPLDVKYD VIVIGAGVVG PAIATALARQ GRKVLIVERD WAKPDRIVGE LMQPAGVKAL 
RELGMISAIN NIEAFDCRGY YIKYFNKNIQ IPYPLKEDTA RTNPVKPVAD CVRDGNDKLQ
EDSTLSPSEW DEDERVRGVA FHHGDFLQNL RQIVKDEPNV EWVEGNVVKI LRDNFDSTIV
TGVRVKESVG SKDYHAKLTI CCDGIYSKFR KELSSNNVPT IGSYFIGLYM TDCKLPAKHH
GHVILGDQAP VIVYQISPTE TRILCAYRSL KPPSRANNEL YNYLKNEVLP VLPEETKPAF
EIALEGGKYR VMPNQYLPAR RQGSSQHKGL VMLGDSLNMR HPLTGGGMTV GLNDAVLLAK
LLHPAYIADF EEYDEVSRIL KQFHRKRKNL DAVINTLSVA LYSLFAADKN SLKILQRGCF
QYFLLGGSCV TGPIGLLSGM LPFPMLLFNH FFSVAFYAIY CNFKDRGLAG FPIALLEAFA
VFFTAVIVFA PYLWLELVA