Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82625 |
Symbol | GAL1 |
ID | 4838280 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 420273 |
End bp | 422036 |
Gene Length | 1764 bp |
Protein Length | 518 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389595 |
Product | galactokinase |
Protein accession | XP_001383712 |
Protein GI | 150864747 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.560809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAT CTGTAGTACC TACATTCCAC GACCTATCTT TCTACTCTCG CCCAGATGAG AACAAAGTCA GGTATGAAAA GTTGACTGCT GAGTTCGTCA AGAACTTCGA GGTCAAGCCT GACTTCTTCG CCAGATCGCC TGGAAGAGTT AACTTGATGG GAGACCACAT TGACTACAAT TTCTTTTCGG TCTTGCCTAT GGCTATTGAT GTCGATGTAG TTGCTGCTGT GAAAACTAAC GACTCTGGAG AAATGGTTAT TACCAATACA AACAGTGCCA ATTTCAAGAA GGAAACCATA AAGCTTCCTG AAGATGGAAG TGTTGTTTCC ATAGAAAAGG AACATTTCTC GTGGGCAAGT TACTTCAGTT GTAGTTTGAT TGTTGCCCAC AAGTACATCA TGGAAAAGTA TCCCGAGAAG GTTGCTGGAG GATCCAAGCC ATTGAAGGGC TTGTACATTA CTTTCGACGG CACTGTCCCT ACTGGTGGTG GCTTGTCTTC ATCAGCTGCT TTCTGTGTTG CATCCACTTT GGCCGTGTTG AAAGCTAATG GTGTGGATAG TATCAGTAAG GCAGACTTGA CTAGAATCAC TGTTGTTAGT GAGCACTACG TTGGAGTCAA CACTGGTGGT ATGGACCAAT GTGCTTCTGT ATATGGTGAA GCTTCCAAGG CATTGTTAAT TCATTTCAGA CCTAAGTTGA TTGGTATTCC ATTTGGATTC CCAAAGATCG CTGAAGACGA CGAACTCACA TTTTTGATTT CCAACAGTTT GCAAGTTTCC AACAAGCACG AAACTGCTCC TATCCACTAC AATTTGAGAG TTGTGGAAAT GGCCATTGCT TCTGACATCT TGGCAAAGAA GTTGAACCTC AACCCTCCCA GAGATTCTAA CGTCTCAACT GCTACTTTAA GAGGTGTCTT GGACTCTTAT GTTGCTGAAG TGTTGAAGCA AGAAAAATGG GACGCTGACG ACTACCAGAC TGGTATCAAG CACTTAAACA CTCTTTTGGA GATTGTTGAA AGAGAATCTA TATTTAATGC TGAACAGAAG GTTGGATTCT CTGTAGAAGA AGCAGCAAAG GAGTTAGGAA TTACTGTTGA AGAGTTCAAA GAGAAGTACT TATCCAAGAT TCCAGTTAGA TTCGAAACCT TGAAATTGTA CCAAAGAGCA AAACATGTCT ACGCAGAGAG TTTGAAGGTA TTGGAGTGTT TGCTGTTGTT GGGAGAGTTC AGCAAGTCGT CGAAGAACCC ACAACAATTT CTCGAAGCTT TTGGTAACAT TCTAAACGAG TCGCAAAAGT CTTTGGACTT GTTGAACAAT TCCTCTAACG AGAAGTTGAA CAAGATCTGT GAAATTGCCC TTAAGAATGG TTCCTACGGT TCCAGAGTCA CAGGCGCTGG ATGGGGTGGA TGCATTGTTC ACTTGTCAAC GAGCAAAAAA TTACCACAAT TACAAAAGGC TCTTGTGGAC GAATACTTTA AAGTTGAGTT CCCCGCAATC ACCCAAGCGG AGTTAGATGA GGCCATTATC AACAGTCAAC CTGCTACAGG AAGTTGTGTT GTCTTGTTAG AGTAGATAAA ATATGACATT ACATCTGCAA CTTGCAGCTA GAATACGAAA TACGGGGACA AAAATAGCAC AGCAGCGAGG GAATCCACAA TCTCCAACAA TAAACTTAAA ACTAGCTTAA GCTGACAAGA TTGCTCTGTT ACATAGAAAA AAATATGTGT AGACTTTTAC GATAATCTGA TGATTACGAA ATCT
|
Protein sequence | MSESVVPTFH DLSFYSRPDE NKVRYEKLTA EFVKNFEVKP DFFARSPGRV NLMGDHIDYN FFSVLPMAID VDVVAAVKTN DSGEMVITNT NSANFKKETI KLPEDGSVVS IEKEHFSWAS YFSCSLIVAH KYIMEKYPEK PLKGLYITFD GTVPTGGGLS SSAAFCVAST LAVLKANGVD SISKADLTRI TVVSEHYVGV NTGGMDQCAS VYGEASKALL IHFRPKLIGI PFGFPKIAED DELTFLISNS LQVSNKHETA PIHYNLRVVE MAIASDILAK KLNLNPPRDS NVSTATLRGV LDSYVAEVLK QEKWDADDYQ TGIKHLNTLL EIVERESIFN AEQKVGFSVE EAAKELGITV EEFKEKYLSK IPVRFETLKL YQRAKHVYAE SLKVLECLSL LGEFSKSSKN PQQFLEAFGN ILNESQKSLD LLNNSSNEKL NKICEIALKN GSYGSRVTGA GWGGCIVHLS TSKKLPQLQK ALVDEYFKVE FPAITQAELD EAIINSQPAT GSCVVLLE
|
| |