Gene PICST_73701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_73701 
SymbolGLK1 
ID4840467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp302512 
End bp304065 
Gene Length1554 bp 
Protein Length471 aa 
Translation table12 
GC content40% 
IMG OID640391782 
ProductGlucokinase 
Protein accessionXP_001386072 
Protein GI126139099 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5026] Hexokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.63067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0118903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAAGATTGTA ACGCATTTAC CCTTTCTTTA AAAATGAATA TTGAATTAGA AGCTGCTGTT 
GAGGAAATTG TCAAGCAATT CGCTATTGAC AAGGATTTCC TTGTCGAAGC AACTAAACAT
TTTCACGAAT CCATGAGCGC TGGTTTAGCA ACTTCTACCC CAACTAGAGA TTACATGCCT
ATGATTCCAA CTTATGTAAC TGGAATTCCA ACAGGAAAAG AAAAGGGATT GTACTTGGCC
GCAGACTTGG GTGGTACCAA CTTTAGAGTA TGCTCCATCC ACTTGGGTGG TGACCACACA
TTTGAAATGA AGCAGTCCAA GTACAAGATT CCTGTGGATT TGATGCAAGG TGAAGATGCC
ACTGCCGATG GTTTGTTCAA CTATTTGGCA GAAAAGGTAA AAACTTTCTT GGACCAGCAT
CACAATGAAC ATGCTGAACA GTTGAAATTG GGCTTTACCT TTTCTTTCCC AGTGAACCAA
ACTGCCTTGA ATAGAGGAAC ATTGATTCGT TGGACCAAGG GTTTTGATTT GCCTGATTGT
GTAGACAAGG ATGTCGTCGA ATTATTGCAG AAGCATATGG AATTGTTAGG TGTCAAGGTT
CATGTTGCTG CTTTGGCAAA TGATACTGTT GGTACCTTAT TGTCTAGAGC ATACTCTAAT
GATATTTCCA AGACAAATTC CAATACCGTT GTTGGTGCCA TCTTTGGCAC AGGAACGAAC
GGAGCTTACT TTGAAACTTT GAAAAACATT CCAAAGTTGA AGAAGGAAGA TATACCTGAA
GGAGCCAAGG GTATGGTGAT TAACACTGAG TGGGGTTCGT TCGACAATAC ATTGAAGATT
TTGCCTTGTA CCAAGTACGA TAAACTTGTT GACGATGAGA CTGCAAACGT CGGCTATCAC
TTATTTGAAA AGAGAATCAG CGGTATGTTT TTGGGTGAAT TGTTGAGAGT TGCCTTAATG
GATTTATTCG ACCGTGGCTT GATCTTCCAG GAATTGTACA AGGCTAGAGG TGGTACCTTG
CCCCACAGAA TTTTCGAACC ATGGCTTATT TCTGCAGAGG TGTTATCTTA TTTACAAATT
GATGATTCCA CTGACTTGAA GATGTCAGAA TTGGTCTTGG AAAACCACTT GAGATTGCCA
ACCAACAAAG AAGAAAGGCT TGTTATTCAG AAATTGACTC AGTCAATTTC ACATAGGGCT
GCATATCTTT CAGCTATTCC ATTGGCATCA ATTGTTGCTC GTGTTCAGGA TCAATATAAG
GATGACGATA GAGATTTCGA ATTTGGTTGC GATGGTTCCG TTGTTGAGTT CTATCCTGGC
TTCAGATCAA AGATTTTGGA AGCAGTTGCT TTGATTGACC CATTGAAAGG TTCTTCGAAA
AAGATCCACC TCAGAATTGC CAAGGATGGA AGTGGAGTCG GAGCAGCATT GTGTGCAAGT
GTCTCCTAAT TCATCTAATA ATGAAACATT ACGACTTCTT GTATGAGCCA TATCTCTATC
TAAAAGCAAA TGCATATTTT CTAGTTTGTA ATTTTTATAA ATGTTTGACT TCGT
 
Protein sequence
MNIELEAAVE EIVKQFAIDK DFLVEATKHF HESMSAGLAT STPTRDYMPM IPTYVTGIPT 
GKEKGLYLAA DLGGTNFRVC SIHLGGDHTF EMKQSKYKIP VDLMQGEDAT ADGLFNYLAE
KVKTFLDQHH NEHAEQLKLG FTFSFPVNQT ALNRGTLIRW TKGFDLPDCV DKDVVELLQK
HMELLGVKVH VAALANDTVG TLLSRAYSND ISKTNSNTVV GAIFGTGTNG AYFETLKNIP
KLKKEDIPEG AKGMVINTEW GSFDNTLKIL PCTKYDKLVD DETANVGYHL FEKRISGMFL
GELLRVALMD LFDRGLIFQE LYKARGGTLP HRIFEPWLIS AEVLSYLQID DSTDLKMSEL
VLENHLRLPT NKEERLVIQK LTQSISHRAA YLSAIPLASI VARVQDQYKD DDRDFEFGCD
GSVVEFYPGF RSKILEAVAL IDPLKGSSKK IHLRIAKDGS GVGAALCASV S