Gene PICST_82625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82625 
SymbolGAL1 
ID4838280 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp420273 
End bp422036 
Gene Length1764 bp 
Protein Length518 aa 
Translation table12 
GC content42% 
IMG OID640389595 
Productgalactokinase 
Protein accessionXP_001383712 
Protein GI150864747 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.560809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAT CTGTAGTACC TACATTCCAC GACCTATCTT TCTACTCTCG CCCAGATGAG 
AACAAAGTCA GGTATGAAAA GTTGACTGCT GAGTTCGTCA AGAACTTCGA GGTCAAGCCT
GACTTCTTCG CCAGATCGCC TGGAAGAGTT AACTTGATGG GAGACCACAT TGACTACAAT
TTCTTTTCGG TCTTGCCTAT GGCTATTGAT GTCGATGTAG TTGCTGCTGT GAAAACTAAC
GACTCTGGAG AAATGGTTAT TACCAATACA AACAGTGCCA ATTTCAAGAA GGAAACCATA
AAGCTTCCTG AAGATGGAAG TGTTGTTTCC ATAGAAAAGG AACATTTCTC GTGGGCAAGT
TACTTCAGTT GTAGTTTGAT TGTTGCCCAC AAGTACATCA TGGAAAAGTA TCCCGAGAAG
GTTGCTGGAG GATCCAAGCC ATTGAAGGGC TTGTACATTA CTTTCGACGG CACTGTCCCT
ACTGGTGGTG GCTTGTCTTC ATCAGCTGCT TTCTGTGTTG CATCCACTTT GGCCGTGTTG
AAAGCTAATG GTGTGGATAG TATCAGTAAG GCAGACTTGA CTAGAATCAC TGTTGTTAGT
GAGCACTACG TTGGAGTCAA CACTGGTGGT ATGGACCAAT GTGCTTCTGT ATATGGTGAA
GCTTCCAAGG CATTGTTAAT TCATTTCAGA CCTAAGTTGA TTGGTATTCC ATTTGGATTC
CCAAAGATCG CTGAAGACGA CGAACTCACA TTTTTGATTT CCAACAGTTT GCAAGTTTCC
AACAAGCACG AAACTGCTCC TATCCACTAC AATTTGAGAG TTGTGGAAAT GGCCATTGCT
TCTGACATCT TGGCAAAGAA GTTGAACCTC AACCCTCCCA GAGATTCTAA CGTCTCAACT
GCTACTTTAA GAGGTGTCTT GGACTCTTAT GTTGCTGAAG TGTTGAAGCA AGAAAAATGG
GACGCTGACG ACTACCAGAC TGGTATCAAG CACTTAAACA CTCTTTTGGA GATTGTTGAA
AGAGAATCTA TATTTAATGC TGAACAGAAG GTTGGATTCT CTGTAGAAGA AGCAGCAAAG
GAGTTAGGAA TTACTGTTGA AGAGTTCAAA GAGAAGTACT TATCCAAGAT TCCAGTTAGA
TTCGAAACCT TGAAATTGTA CCAAAGAGCA AAACATGTCT ACGCAGAGAG TTTGAAGGTA
TTGGAGTGTT TGCTGTTGTT GGGAGAGTTC AGCAAGTCGT CGAAGAACCC ACAACAATTT
CTCGAAGCTT TTGGTAACAT TCTAAACGAG TCGCAAAAGT CTTTGGACTT GTTGAACAAT
TCCTCTAACG AGAAGTTGAA CAAGATCTGT GAAATTGCCC TTAAGAATGG TTCCTACGGT
TCCAGAGTCA CAGGCGCTGG ATGGGGTGGA TGCATTGTTC ACTTGTCAAC GAGCAAAAAA
TTACCACAAT TACAAAAGGC TCTTGTGGAC GAATACTTTA AAGTTGAGTT CCCCGCAATC
ACCCAAGCGG AGTTAGATGA GGCCATTATC AACAGTCAAC CTGCTACAGG AAGTTGTGTT
GTCTTGTTAG AGTAGATAAA ATATGACATT ACATCTGCAA CTTGCAGCTA GAATACGAAA
TACGGGGACA AAAATAGCAC AGCAGCGAGG GAATCCACAA TCTCCAACAA TAAACTTAAA
ACTAGCTTAA GCTGACAAGA TTGCTCTGTT ACATAGAAAA AAATATGTGT AGACTTTTAC
GATAATCTGA TGATTACGAA ATCT
 
Protein sequence
MSESVVPTFH DLSFYSRPDE NKVRYEKLTA EFVKNFEVKP DFFARSPGRV NLMGDHIDYN 
FFSVLPMAID VDVVAAVKTN DSGEMVITNT NSANFKKETI KLPEDGSVVS IEKEHFSWAS
YFSCSLIVAH KYIMEKYPEK PLKGLYITFD GTVPTGGGLS SSAAFCVAST LAVLKANGVD
SISKADLTRI TVVSEHYVGV NTGGMDQCAS VYGEASKALL IHFRPKLIGI PFGFPKIAED
DELTFLISNS LQVSNKHETA PIHYNLRVVE MAIASDILAK KLNLNPPRDS NVSTATLRGV
LDSYVAEVLK QEKWDADDYQ TGIKHLNTLL EIVERESIFN AEQKVGFSVE EAAKELGITV
EEFKEKYLSK IPVRFETLKL YQRAKHVYAE SLKVLECLSL LGEFSKSSKN PQQFLEAFGN
ILNESQKSLD LLNNSSNEKL NKICEIALKN GSYGSRVTGA GWGGCIVHLS TSKKLPQLQK
ALVDEYFKVE FPAITQAELD EAIINSQPAT GSCVVLLE