Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_75910 |
Symbol | ERG1 |
ID | 4837155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2450159 |
End bp | 2451765 |
Gene Length | 1607 bp |
Protein Length | 499 aa |
Translation table | 12 |
GC content | 48% |
IMG OID | 640388470 |
Product | squalene epoxidase(monooxygenase), erosterol biosynthesis |
Protein accession | XP_001383223 |
Protein GI | 150864419 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.886863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTTATGACCG CGCCATTGGA TGTCAAGTAC GATGTTATTG TCATCGGTGC CGGAGTTGTA GGACCGGCCA TCGCCACAGC CTTAGCTAGA CAGGGCAGAA AGGTATTGAT TGTAGAGAGA GACTGGGCCA AGCCCGACAG AATTGTCGGG GAGTTGATGC AGCCGGCCGG GGTCAAGGCT CTCAGAGAGT TGGGGATGAT TCTGGCCATC AACAACATCG AGGCTTTCGA CTGTAGAGGT TACTACATCA AATACTTCAA CAAGAACATC CAGATCCCAT ACCCATTAAA GGAAGATACA GCTAGAACGA ACCCTGTGAA ACCGGTGGCT GATTGTGTCA GAGATGGCAA CGACAAGTTA CAGGAAGACT CGACTTTGAG TCCTTCCGAA TGGGACGAAG ACGAACGTGT CAGAGGAGTT GCTTTCCACC ATGGAGACTT CTTACAGAAC TTGAGACAGA TTGTGAAAGA CGAGCCCAAC GTAGAATGGG TCGAGGGAAA TGTCGTCAAG ATCTTGAGAG ACAATTTCGA CTCCACTATT GTGACTGGAG TTAGGGTTAA GGAACTGGTT GGCTCAAAGG ACTACCATGC CAAGTTGACC ATCTGCTGTG ACGGTATCTA TTCCAAGTTC AGAAAGGAGT TGTCCAGCAA CAATGTACCA ACCATAGGCT CGTACTTCAT AGGGTTATAC ATGACCGACT GTAAATTGCC AGCCAAGCAC CACGGCCACG TTATCTTGGG GGACCAGGCC CCCGTGATTG TGTACCAGAT CTCTCCTACC GAGACAAGAA TCTTGTGTGC CTATAGATCG TTGAAACCAC CTTCGCGTGC CAACAACGAG TTGTACAACT ATTTGAAGAA CGAGGTCTTG CCCGTGTTGC CCGAAGAAAC CAAGCCGGCT TTCGAAATCG CCTTGGAAGG AGGCAAGTAC AGAGTCATGC CTAACCAATA CTTGCCAGCC AGAAGACAGG GCAGCAGCCA GCACAAGGGA TTGGTAATGT TGGGAGACTC CCTAAATATG AGACATCCTC TCACCGGTGG TGGTATGACG GTTGGTTTGA ACGATGCGGT GTTGTTGGCT AAGTTGTTGC ATCCTGCGTA CATTGCTGAT TTTGAGGAGT ATGACGAAGT GTCCCGGATA TTGAAGCAGT TCCACAGAAA GAGAAAGAAT CTCGACGCCG TCATCAATAC CTTGTCGGTT GCCTTGTACT CGTTGTTTGC AGCCGACAAA AATTCGCTCA AGATCTTGCA AAGAGGCTGT TTCCAGTACT TCTTGTTAGG AGGGTCCTGT GTAACCGGCC CCATTGGGTT GTTGTCAGGA ATGTTGCCCT TCCCAATGTT GTTGTTTAAC CACTTCTTCA GCGTCGCTTT TTATGCTATC TACTGTAACT TCAAAGACAG AGGCTTGGCT GGATTCCCTA TTGCTCTCTT GGAAGCCTTT GCTGTATTCT TTACTGCTGT AATCGTATTT GCTCCCTACT TGTGGCTTGA GTTAGTAGCC TAGACTTCCA GAGAACATCT GTCTGTGGTG AAGTGAAACG AAAACACATA GAGAAAAGAG CTATACTCAT ATAGACATAC ATGTATATTT AATTAACGTT GAGCTCC
|
Protein sequence | MTAPLDVKYD VIVIGAGVVG PAIATALARQ GRKVLIVERD WAKPDRIVGE LMQPAGVKAL RELGMISAIN NIEAFDCRGY YIKYFNKNIQ IPYPLKEDTA RTNPVKPVAD CVRDGNDKLQ EDSTLSPSEW DEDERVRGVA FHHGDFLQNL RQIVKDEPNV EWVEGNVVKI LRDNFDSTIV TGVRVKESVG SKDYHAKLTI CCDGIYSKFR KELSSNNVPT IGSYFIGLYM TDCKLPAKHH GHVILGDQAP VIVYQISPTE TRILCAYRSL KPPSRANNEL YNYLKNEVLP VLPEETKPAF EIALEGGKYR VMPNQYLPAR RQGSSQHKGL VMLGDSLNMR HPLTGGGMTV GLNDAVLLAK LLHPAYIADF EEYDEVSRIL KQFHRKRKNL DAVINTLSVA LYSLFAADKN SLKILQRGCF QYFLLGGSCV TGPIGLLSGM LPFPMLLFNH FFSVAFYAIY CNFKDRGLAG FPIALLEAFA VFFTAVIVFA PYLWLELVA
|
| |