Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_51972 |
Symbol | GBD1 |
ID | 4851102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 939116 |
End bp | 940201 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | |
GC content | 43% |
IMG OID | 640392810 |
Product | flavonol synthase |
Protein accession | XP_001387413 |
Protein GI | 126274088 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.842126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCC TGTCCAACAT AAGACCTTGG CAATCGCCAG AAGAAACTCA GGAAGATCTA GACTGGGCAA ATCTTACCAT TATTGACTTG GCCCACTTTG ATGAACCTGG CCAGAAGCAA ATTCTTGCCA ATCAGCTAAA GGAGGCTCTG AACTCAGATG GTTTCTGGGC TGTGATTAAC GGAGGATTTG ACCAAAATGA TATCAATGAA GCATTTGCCT ATGGAAGAAG TTTCTTCGAA GATTACACTG AAGAAGAGAA GAAAGCGTTG GAAGTAGATT TCACCACTGG AAACTATTTT GGCTACAAAG TCCGTGGAAA CAAGCCTGTT TTTGGCACCC AAGTCAGAGA CAACACCGAA ACCCTCAACA TTGCCAAATT CACCAAAGAT GACACATTTG CCGAATACCA CAAGAACAAC TTCATCCAAA ATAACCATGA TAAGTTAGCC CAACTCTCTC GGAAGGTGTT TGAAGTTGCC CGTAAGTTAT TTATATTGTT TGCCATTATT CTCGAACTAG ATGAGAATTA TTTTGTTGAT CGTCATCTTT ATGACGATCC CAGTGACGAT CTGCTTCGGT TCATGAAATA TCACCCAAGA ACAAAAGAAG AAGACGCTCA GGTAGAGAAC ATATGGGCAA GAGCACATAC TGACTTTGGG AGTTTGACCC TATTGTTCAA CCAGGTGGTA GCTGGCCTAC AGATCAAGTT GGCTGACGGA GAATGGAAGT ATGTCAAACC GGTCACTGGT GGACTTATCT GCAACATCGG AGATACTTTA AATTTCTGGT CTGGAGGATA TTTCAAGACC ACTATTCACA GAGTAGTGAG ACCTCCTGAA GATCAGGTTA ATGCACCTAG AATTGGAGCC TTCTATTTCG TTCGTCCAGG AGACAAAGCC CAAATACAAA TTGCCCCATC TCCGTTATTG AAGCGTTTAG GGTTATACAG AGAAACCGAA CCTATTGGTG GTACGGAATA CGTGAGAAAG AGAGTCAAGG ATTACCATGA CGTGAAAGGT TATAATAAGC AGGCCGACAA GGTATTCAAG TTGGGAGAGT TTGAGGTTAT TGACGGTTTT AATTAG
|
Protein sequence | MTVLSNIRPW QSPEETQEDL DWANLTIIDL AHFDEPGQKQ ILANQLKEAL NSDGFWAVIN GGFDQNDINE AFAYGRSFFE DYTEEEKKAL EVDFTTGNYF GYKVRGNKPV FGTQVRDNTE TLNIAKFTKD DTFAEYHKNN FIQNNHDKLA QLSRKVFEVA RKLFILFAII LELDENYFVD RHLYDDPSDD LLRFMKYHPR TKEEDAQVEN IWARAHTDFG SLTLLFNQVV AGLQIKLADG EWKYVKPVTG GLICNIGDTL NFWSGGYFKT TIHRVVRPPE DQVNAPRIGA FYFVRPGDKA QIQIAPSPLL KRLGLYRETE PIGGTEYVRK RVKDYHDVKG YNKQADKVFK LGEFEVIDGF N
|
| |