Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_73701 |
Symbol | GLK1 |
ID | 4840467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 302512 |
End bp | 304065 |
Gene Length | 1554 bp |
Protein Length | 471 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391782 |
Product | Glucokinase |
Protein accession | XP_001386072 |
Protein GI | 126139099 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5026] Hexokinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.63067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0118903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAAGATTGTA ACGCATTTAC CCTTTCTTTA AAAATGAATA TTGAATTAGA AGCTGCTGTT GAGGAAATTG TCAAGCAATT CGCTATTGAC AAGGATTTCC TTGTCGAAGC AACTAAACAT TTTCACGAAT CCATGAGCGC TGGTTTAGCA ACTTCTACCC CAACTAGAGA TTACATGCCT ATGATTCCAA CTTATGTAAC TGGAATTCCA ACAGGAAAAG AAAAGGGATT GTACTTGGCC GCAGACTTGG GTGGTACCAA CTTTAGAGTA TGCTCCATCC ACTTGGGTGG TGACCACACA TTTGAAATGA AGCAGTCCAA GTACAAGATT CCTGTGGATT TGATGCAAGG TGAAGATGCC ACTGCCGATG GTTTGTTCAA CTATTTGGCA GAAAAGGTAA AAACTTTCTT GGACCAGCAT CACAATGAAC ATGCTGAACA GTTGAAATTG GGCTTTACCT TTTCTTTCCC AGTGAACCAA ACTGCCTTGA ATAGAGGAAC ATTGATTCGT TGGACCAAGG GTTTTGATTT GCCTGATTGT GTAGACAAGG ATGTCGTCGA ATTATTGCAG AAGCATATGG AATTGTTAGG TGTCAAGGTT CATGTTGCTG CTTTGGCAAA TGATACTGTT GGTACCTTAT TGTCTAGAGC ATACTCTAAT GATATTTCCA AGACAAATTC CAATACCGTT GTTGGTGCCA TCTTTGGCAC AGGAACGAAC GGAGCTTACT TTGAAACTTT GAAAAACATT CCAAAGTTGA AGAAGGAAGA TATACCTGAA GGAGCCAAGG GTATGGTGAT TAACACTGAG TGGGGTTCGT TCGACAATAC ATTGAAGATT TTGCCTTGTA CCAAGTACGA TAAACTTGTT GACGATGAGA CTGCAAACGT CGGCTATCAC TTATTTGAAA AGAGAATCAG CGGTATGTTT TTGGGTGAAT TGTTGAGAGT TGCCTTAATG GATTTATTCG ACCGTGGCTT GATCTTCCAG GAATTGTACA AGGCTAGAGG TGGTACCTTG CCCCACAGAA TTTTCGAACC ATGGCTTATT TCTGCAGAGG TGTTATCTTA TTTACAAATT GATGATTCCA CTGACTTGAA GATGTCAGAA TTGGTCTTGG AAAACCACTT GAGATTGCCA ACCAACAAAG AAGAAAGGCT TGTTATTCAG AAATTGACTC AGTCAATTTC ACATAGGGCT GCATATCTTT CAGCTATTCC ATTGGCATCA ATTGTTGCTC GTGTTCAGGA TCAATATAAG GATGACGATA GAGATTTCGA ATTTGGTTGC GATGGTTCCG TTGTTGAGTT CTATCCTGGC TTCAGATCAA AGATTTTGGA AGCAGTTGCT TTGATTGACC CATTGAAAGG TTCTTCGAAA AAGATCCACC TCAGAATTGC CAAGGATGGA AGTGGAGTCG GAGCAGCATT GTGTGCAAGT GTCTCCTAAT TCATCTAATA ATGAAACATT ACGACTTCTT GTATGAGCCA TATCTCTATC TAAAAGCAAA TGCATATTTT CTAGTTTGTA ATTTTTATAA ATGTTTGACT TCGT
|
Protein sequence | MNIELEAAVE EIVKQFAIDK DFLVEATKHF HESMSAGLAT STPTRDYMPM IPTYVTGIPT GKEKGLYLAA DLGGTNFRVC SIHLGGDHTF EMKQSKYKIP VDLMQGEDAT ADGLFNYLAE KVKTFLDQHH NEHAEQLKLG FTFSFPVNQT ALNRGTLIRW TKGFDLPDCV DKDVVELLQK HMELLGVKVH VAALANDTVG TLLSRAYSND ISKTNSNTVV GAIFGTGTNG AYFETLKNIP KLKKEDIPEG AKGMVINTEW GSFDNTLKIL PCTKYDKLVD DETANVGYHL FEKRISGMFL GELLRVALMD LFDRGLIFQE LYKARGGTLP HRIFEPWLIS AEVLSYLQID DSTDLKMSEL VLENHLRLPT NKEERLVIQK LTQSISHRAA YLSAIPLASI VARVQDQYKD DDRDFEFGCD GSVVEFYPGF RSKILEAVAL IDPLKGSSKK IHLRIAKDGS GVGAALCASV S
|
| |