Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_61325 |
Symbol | ERG27 |
ID | 4840378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 839135 |
End bp | 840175 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391693 |
Product | 3-keto-steroid reductase |
Protein accession | XP_001385525 |
Protein GI | 150866056 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.406154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00397596 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCACTTA TTAAAGAGGG TCAAGTTGCC GTGATAACCG GGACGTCATC GAACCTTGGT TTGAACATCG CGTACCGATT GGTAGAGCAA ACAGATCCCA AGACCAATTT GACATTAGTA GTCACCTCAA GAACACTTCC CAAAGTTAAA GACATCATCA CAACTATCAA AGACTATGTG GTAGAACACT TTCCAGAAAG AGTTTCGAAA ATTGAATTTG ACTATTTGTT GGTGGACTTC GTAGACATGG TTTCTATATT ATCTGCCTAC TATGAGTTAA ACAAGAGGTA TCGTCACATT GACTACTTGT TTGTAAATGC TGCACAGGGA GTCTACGGTG GTATTGACTG GATTGGTGCT ACCAAGGAGA TTCTCACCGA TCCGATCGAA GGTGTGACCC ATCCAACGTA CAAAATCCAA AAGGTTGGTG TCAAGTCTAG TGATGGTATG GGATTGGTTT TCCAAGCTAA CGTGTTTGGA CCTTACTATT TTATCCATAG AATCAAACAC TTGTTGCAAA ATGGCGGACG TATAGTATGG ATATCTTCCA TTATGTCTAA GCCAAAATAT TTATCGTTCA ACGATCTCCA ATTGATCAAG TCTCCTGAAT CGTACGAGGG CTCCAAGAGA TTGGTGGATT TATTGCACTT TGGTACATAC AAAACATTGG CGAAAGACTA TAACATTCAA CAACTGCTTG TCCATCCAGG TATATTCACC AGTTTTTCGT TTTTCCAATA CTTGAATTTC TTCACCTATT ATTCAATGTT GGTGTTGTTC TATATTGCAA GATGGTCTGG ATCTCCTTAC CATAACATAT CAGGTTATAT AGCCGCCAAT GCTCCTGTCA AGTGTGCCGT TGGAAAGGAA AAGCAAGATG TTAAAGTAGA ATCTTGCAGT ACAAGATACG GTCAAGAATA CATTCGCTAC CAGGAAATAG ACTCAACTGG TTCCGAGGAT GTGGTTGCTT ACCTCGACAA ACTTGTACAA GAATGGGACG TTAAGTTAAA GGACCAGATA ACGAGTACTA GACTTCCTTG A
|
Protein sequence | MSLIKEGQVA VITGTSSNLG LNIAYRLVEQ TDPKTNLTLV VTSRTLPKVK DIITTIKDYV VEHFPERVSK IEFDYLLVDF VDMVSILSAY YELNKRYRHI DYLFVNAAQG VYGGIDWIGA TKEILTDPIE GVTHPTYKIQ KVGVKSSDGM GLVFQANVFG PYYFIHRIKH LLQNGGRIVW ISSIMSKPKY LSFNDLQLIK SPESYEGSKR LVDLLHFGTY KTLAKDYNIQ QSLVHPGIFT SFSFFQYLNF FTYYSMLVLF YIARWSGSPY HNISGYIAAN APVKCAVGKE KQDVKVESCS TRYGQEYIRY QEIDSTGSED VVAYLDKLVQ EWDVKLKDQI TSTRLP
|
| |