Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_54590 |
Symbol | |
ID | 4836866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1252464 |
End bp | 1255169 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640388181 |
Product | predicted protein |
Protein accession | XP_001383009 |
Protein GI | 126132968 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.900491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTAT TATCCTTTAA AGACAAGGTC GTGATCATCA CCGGTGCCGG TGGTGGTTTG GGTAAGCAAT ACTCGTTAGA ATTCGCAAAG AGAGGCGCCA AGGTTGTTGT CAACGACTTG GGTGGTTCTC TTAGCGGTCA GGGAGGAAAC TCTAAGGCTG CTGATGTCGT TGTTGACGAA ATCAGAAAGG CTGGAGGTAT TGCTGTGGCC GACTACAACA ATGTGCTTGA TGGTGACAAG ATTGTTGAAA CCGCTGTCAA GAACTTTGGT ACCGTCCACG TTGTCATTAA CAACGCCGGT ATCTTGAGAG ACGCCTCGTT CAAGAAGATG CAAGAGAAGG ACTTCCAACT TGTCCTTGAT GTCCATCTCA ATGGTGCCTT CAAGGTGACC CAGGCTGCTT GGCCATACTT CAGAAAGCAG AACTACGGTA GAATCGTCAA CACTGCTTCC CCAGCTGGTT TGTACGGTAA CTTTGGCCAA ACCAACTACT CTGCTGCCAA GGCTGGTTTA CTTGGTTTCG CTGAGACATT GGCTAAGGAA GGTGGCAAGT ACAACATTCT TGCCAACACC ATTGCTCCAT TGGCCCGTTC AAGAATGACC GAATCTATCA TGCCTCCATC TATCTTGGAA AAGATGAGTC CAGAAAAGGT CGCTCCTATT GTGCTCTACT TGGCTTCCCA GAACAACGAG ATTACTGGAC AAACTTTTGA AATTGCTGCT GGCTTTTACT CTCAAATTAG ATGGGAAAGA TCTGGTGGTG TTCTCTTCAA GCCAGACCAA TCTTTTACTG CCGAATCTGT GGCTAAGAGA TTTGACGAAA TCTTGAACTT TGATGATTCC AAGAAGCCAG AGCTCTTAAA GAATCAACAC CCTAACTTGT TGAACGATTA CCTCAGTTTG ACGGAAGCTG CTAGAGCATT GCCTCCTAAT GATAACAGCG GAGCTCCTAC CGTTTCATTG AAGGGTAGAG TCGTATTGAT CACAGGTGCC GGTGCCGGTT TGGGCAAAGA GTATGCCAAG TGGTACGCCA GGTACGGTGC CAAGGTCGTG GTTAATGACT TTAAAGACGC TTCCAAGACT GTGGAAGAGA TTAAGGCTGC TGGTGGTGAA GCCCATGTAG ACCACCATGA TGTTGCTACC CAGGCCGAGG CTATCATCAA GAATGTCATT GACAAATACG GAACCATTGA CATCTTGGTT AACAACGCTG GTATATTGAG AGACAGATCT TTCGCCAAGA TGAGCTACGA TGAATGGATT CAAGTTCAGA AGGTCCACTT GTTAGGTACT TTCAACTTGA CTAGATTGGC CTGGCCATAC TTCTCTGAGA AGAAGTTCGG TAGAGTCATC AACATCACCT CTACCTCTGG TATCTATGGT AACTTCGGAC AAGCAAACTA CTCTTCGGCC AAGGCTGCTA TCTTGGGTTT GACAAAAACC ATTGCCATCG AAGGTGCCAA GTCCAACATT AAGGTTAACG TTGTTGCTCC TCATGCTGAA ACTGCTATGA CCTTGACTAT TTTCAGAGAA GAAGACAAGA ACTTGTACCA CCCAGATGCT GTCGCTCCTT TGTTGGTCTA TTTGGCTAGT GAAGATGTTC CAGTTACCGG TGAAATCTTT GAAATCGGTG GAGGCTGGAT CGGTAACACC AGATGGCAGA GATCTAAGGG TGTTGTAGCT AAAACTGATG CAGACGTGAC TGCTGAATTC GTCAATGCTC ACATTGAAGA AATCATCGAC TTCGAATCGG GCTCCACCAA CCCAAAGAGC ACCACCGAGT CTTCTATGGA GATCTTGTCT GCTATTGGTG GCGACGACGA TGAAGATGAA GATGAGGAAG AAGAAGAAGA TGAGGACGAT GACGATGACA TTGTGGACCC TGTCTGGCAC TACAATGACA GAGACGTCAT TCTCTATAAC ATCGCGCTTG GTGCAACTGC TAAAGAGTTG AAGTACGTCT ACGAAAACGA CGCCGACTTC CAGGTCATTC CAACCTTTGG TCACTTGGCT ACGTTCAATT CTGGCCAATC TCAGCTTACG TTCGCAAGAT TGTTGAAGAA TTTCAACCCT ATGCTTTTGT TGCATGGTGA ACACTACATC AAGCTTCATA AGTTCCCTGT CCCAGTTGAA GCTTCCATCA AGACTACCTT CCAACCTATC AACATTACCC AGAAGGGCAC CAACACCGTC GTTGTACATG GTTCTAAGTC TACTGATGCC ACCACCGGTG AAGTTGTGTT TGAAAACGAG GCCACATTCT TTATCAGAAA GTGTGAAGGT AAGAATAAGA CCTATGCTGA AAGACGTAAA TTTGCAACTC TTCCTTTCAC TGCTCCAACC TCTGCTCCTG ACTTTGTCAC TGAAATTAAG ATCTCAGAAG ACAAGGCATC TTTGTACAGA TTGACAGGTG ACAGAAACCC ATTGCACATT GACCCTAACT TTGCCAAGGG TGCCAAGTTC GACAGACCTA TTTTGCACGG TATGGCTACC TACGGTTTGT CTGCAAAGGT TTTGTTGGAC AAGTTTGGTC CTTTTGATGA AATCAAGGCA AGATTTACTG GAATTGTTTT CCCTGGTGAA ACTTTGAAGG TTTTGGCATG GAAGCAAGGT GATGTGGTTA TTTTCCAATC TCACGTTGTC GAAAGAGGTA CCATCGCTAT CAACAACGCT GCTATCAAGT TAATCAACAA TACTGCCAAC CTTTAG
|
Protein sequence | MSLLSFKDKV VIITGAGGGL GKQYSLEFAK RGAKVVVNDL GGSLSGQGGN SKAADVVVDE IRKAGGIAVA DYNNVLDGDK IVETAVKNFG TVHVVINNAG ILRDASFKKM QEKDFQLVLD VHLNGAFKVT QAAWPYFRKQ NYGRIVNTAS PAGLYGNFGQ TNYSAAKAGL LGFAETLAKE GGKYNILANT IAPLARSRMT ESIMPPSILE KMSPEKVAPI VLYLASQNNE ITGQTFEIAA GFYSQIRWER SGGVLFKPDQ SFTAESVAKR FDEILNFDDS KKPELLKNQH PNLLNDYLSL TEAARALPPN DNSGAPTVSL KGRVVLITGA GAGLGKEYAK WYARYGAKVV VNDFKDASKT VEEIKAAGGE AHVDHHDVAT QAEAIIKNVI DKYGTIDILV NNAGILRDRS FAKMSYDEWI QVQKVHLLGT FNLTRLAWPY FSEKKFGRVI NITSTSGIYG NFGQANYSSA KAAILGLTKT IAIEGAKSNI KVNVVAPHAE TAMTLTIFRE EDKNLYHPDA VAPLLVYLAS EDVPVTGEIF EIGGGWIGNT RWQRSKGVVA KTDADVTAEF VNAHIEEIID FESGSTNPKS TTESSMEILS AIGGDDDEDE DEEEEEDEDD DDDIVDPVWH YNDRDVILYN IALGATAKEL KYVYENDADF QVIPTFGHLA TFNSGQSQLT FARLLKNFNP MLLLHGEHYI KLHKFPVPVE ASIKTTFQPI NITQKGTNTV VVHGSKSTDA TTGEVVFENE ATFFIRKCEG KNKTYAERRK FATLPFTAPT SAPDFVTEIK ISEDKASLYR LTGDRNPLHI DPNFAKGAKF DRPILHGMAT YGLSAKVLLD KFGPFDEIKA RFTGIVFPGE TLKVLAWKQG DVVIFQSHVV ERGTIAINNA AIKLINNTAN L
|
| |