Gene PICST_72279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_72279 
SymbolGPH1 
ID4839086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1552249 
End bp1555099 
Gene Length2851 bp 
Protein Length896 aa 
Translation table12 
GC content45% 
IMG OID640390401 
ProductReleases glucose-1-phosphate from glycogen 
Protein accessionXP_001384991 
Protein GI150865677 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02093] glycogen/starch/alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCTAGTCTT CTTCTAGTCT CAATTGATAT ACCCCGCAAC TAACATTCAT CATGGGTGAC 
TATTTGACTC CCTCGGAATT GCCTCAGCCC AAGTTGAAAA GAACCAACAC TGGGTTCACG
CCCAACCAGA TTATCGCCCT CGATGCCTCC ATCCCGGAAC TGGCCAAACA CATCTGGTAC
AAGTACTCGC AGTTGAAGGT GTTGGACTCC AAGAAGCAGA CCGCTACCAC CGAAGACGAT
GTCTTGACCA ACAAGGATGC CTTTGAGGAG CTGTTTGTCA AACATGTGGA AACGACTTTG
GCCAGAAACA TGTACAACTG TGACAACTTG GCTGCCTACC AGGCCACTTC TAACACTATC
AGAGACGCGT TATTGATTGA CTGGGCCAAC ACTCAGCAGA AGCAGACCAT CCAGGATGGA
AAGAGAGTGT ACTACTTATC TTTGGAATTC TTGATGGGAA GAGCCATGGA CAACGCTTTG
ATCAACTTGA AGCTGAGAGA TAACACCAAG AAGTCATTGA CGGAATTGGG TTTCAATTTG
GAAGACGTTT TGGAACAGGA GCCAGACGCT GCCTTAGGTA ACGGTGGTTT GGGTAGATTG
GCTGCCTGTT TCGTTGACTC TCTCTCATCG AAGAACTATT CCGGTTGGGG TTACGGTTTG
AACTACCAGT ACGGTATTTT CAAGCAGAAG ATTGTTGATG GCTACCAGAT CGAAACTCCA
GACTACTGGC TCAAGTACTC AAATCCATGG GAAATCATGA GAAGTGAAAT CCAGATTCCT
GTAGACTTCT ACGGCTATGT GTATGAAGAT CATGACCCAA ACACTGGCAA GGCTAAGAAG
ATTTGGGCTG GAGGTGAGAG AGTTCTTGCT GTGGCTGCCG ATTTCCCAAT TCCTGGTTTC
AACACAGACA ACACCAACAA TTTGAGATTG TGGAATGCCA AGCCCACCGA AGAGTTTGAT
TTCACCAAGT TCAACGCCGG TGACTACCAG CAGTCTGTAG GCGCTCAACA AAGAGCCGAA
TCCATCACCT CTGTCTTGTA CCCTAACGAC AACTTTGAGA GTGGTAAGGA ATTGAGATTG
AAGCAACAGT ACTTCTGGGT CGCTGCTTCA TTGCACGATA TTGTTCGTAG ATTTAAGAAG
AACCACAAGA ACAACTGGAA GAAGTTCCCA GACCAAATCG CCATCCAATT GAATGATACT
CATCCAACTC TTGCCATTGT CGAATTGCAA AGAATTTTGG TAGATTTGGA AGATTTGGAA
TGGAACGAAG CCTGGAACAT CGTCACCAAG GTGTTTGCTT ACACTAACCA CACGGTTATG
TCTGAGGCAT TGGAAAAGTG GCCTGTAGAC TTGCTTGGTA GATTGTTACC TCGTCATTTG
GAAATCATCT ACGATATCAA CTTCTTCTTC TTGAAGGAAG TTGAACGCAA GTTCCCAGAC
GACAGAGACT TATTGGGTAA GGTTTCTGTA ATTGAAGAAA ACGCCCCCAA GTCTGTAAAG
ATGGCCTTCT TGGCTATTGT TGGTTCTCAC AAGGTGAACG GTGTGGCTGA ATTGCACTCA
GAGTTGATCA AGACGACCAT CTTCAAGGAC TTTGTCAAGG TGTTCGGTGA AGACAAGTTC
ACCAACGTCA CCAATGGTAT TACTCCTAGA CGTTGGTTGA GACAAGCCAA CCCAGAGTTG
GCAGCCTTGA TTGCCGAGAA GTTGAACGAT CCAAACTACG AATACTTGAC CAACTTGGGC
CGTTTGAAGA AGTTGGAAGA GTTTGTCAAC GACGACGAAT TCTTGAAGAG GTGGGATATC
GTCAAATTCA ACAATAAGGT GAGATTAGCT GCGTTGGTGA AAGAAACCAC TGGTGTAGTA
TTGGACCCAA CTGTGTTGTT TGATGTTCAG GTCAAGAGAA TCCACGAGTA CAAGAGACAG
CAGTTGAACA TCTTTGCCGT TATCTACCGT TACTTGAAGA TCAAGGAATT GTTACTGCAA
GGCGTTTCTG TTGATGAAAT TAAGGAAAAG TACTACATCC CCAAGGCTTC CATCTTTGGT
GGTAAGGCTG CTCCCGGTTA CTACATGGCC AAGACGATTA TCCACTTGAT CAACAAGGTT
GGAGACATCG TCAACAATGA CCCAGAAATT GGTGACTTGT TGAAGGTTGT TTTCATTCCA
GATTATAACG TGTCCAAGGC AGAAATCATT ACTCCAGGTT CCGATTTGTC CAACCACATC
TCGACTGCTG GTACTGAAGC TTCCGGTACT TCCAACATGA AGTTTGCCTT GAACGGTGGT
TTGATTATTG GTACTGTTGA TGGTGCCAAT GTCGAAATTA CCCGTGAAAT CGGTGAAGAA
AACATCTTCT TGTTTGGTAA CTTGGCTGAG TCTGTTGAAG ATTTGAGACA CAAGCACATC
TATGAAGGTG TGCACATTCC TCAGACTTTG GCCCAGGTAT TCTCTGCAAT TGAATCGGGT
ATCTTCGGCA ATCCAGACGA GTACAAGGCA TTGATCGATG GTATCAAGTA CCACGGCGAC
TACTACTTGG TCAGCGACGA CTTTGAGTTG TTCTTGGCTG CCCACGTGAA GTTGGAAAAG
GTGTTTGGAC ACCATGGAGG AGACGCTAGC GACACCGACC ATTTGCACAA GTGGGTCAAG
AGTGCGGTGT TGAGTGTAGC CAACATGGGT TTCTTCAGTA GTGACAGATG TATCGATGAG
TACGCCGAAG ACATCTGGAA CATCGAGCCT TTGAACCAAT AGAGTGTCCG TAGCGTTTTT
TCAATTCTCT ATCTCTATAC ATTTTACAGC TTCGTCTTGT TTTGAGACGG TTATGATTGC
TTTTAGTTTT TAGTTATACA TGTGTTTGTT G
 
Protein sequence
MGDYLTPSEL PQPKLKRTNT GFTPNQIIAL DASIPESAKH IWYKYSQLKV LDSKKQTATT 
EDDVLTNKDA FEESFVKHVE TTLARNMYNC DNLAAYQATS NTIRDALLID WANTQQKQTI
QDGKRVYYLS LEFLMGRAMD NALINLKSRD NTKKSLTELG FNLEDVLEQE PDAALGNGGL
GRLAACFVDS LSSKNYSGWG YGLNYQYGIF KQKIVDGYQI ETPDYWLKYS NPWEIMRSEI
QIPVDFYGYV YEDHDPNTGK AKKIWAGGER VLAVAADFPI PGFNTDNTNN LRLWNAKPTE
EFDFTKFNAG DYQQSVGAQQ RAESITSVLY PNDNFESGKE LRLKQQYFWV AASLHDIVRR
FKKNHKNNWK KFPDQIAIQL NDTHPTLAIV ELQRILVDLE DLEWNEAWNI VTKVFAYTNH
TVMSEALEKW PVDLLGRLLP RHLEIIYDIN FFFLKEVERK FPDDRDLLGK VSVIEENAPK
SVKMAFLAIV GSHKVNGVAE LHSELIKTTI FKDFVKVFGE DKFTNVTNGI TPRRWLRQAN
PELAALIAEK LNDPNYEYLT NLGRLKKLEE FVNDDEFLKR WDIVKFNNKV RLAALVKETT
GVVLDPTVLF DVQVKRIHEY KRQQLNIFAV IYRYLKIKEL LSQGVSVDEI KEKYYIPKAS
IFGGKAAPGY YMAKTIIHLI NKVGDIVNND PEIGDLLKVV FIPDYNVSKA EIITPGSDLS
NHISTAGTEA SGTSNMKFAL NGGLIIGTVD GANVEITREI GEENIFLFGN LAESVEDLRH
KHIYEGVHIP QTLAQVFSAI ESGIFGNPDE YKALIDGIKY HGDYYLVSDD FELFLAAHVK
LEKVFGHHGG DASDTDHLHK WVKSAVLSVA NMGFFSSDRC IDEYAEDIWN IEPLNQ