Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH01400 |
Symbol | |
ID | 3259022 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 774116 |
End bp | 777099 |
Gene Length | 2984 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 49% |
IMG OID | 638258343 |
Product | hexokinase, putative |
Protein accession | XP_572328 |
Protein GI | 58270344 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5026] Hexokinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.537283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTGCTCTC CCCCGCCAAT ACACACTGCA GTGAGTTCCC CCTCCCGTGT GGAACCTGCC CCCCTTCCTT TCCCAGGCAC CGCACGTGGC CACGGCTCAC ACCCTCCTTT GTCCCAGCGC CATTCATTAC ACTTAACATC CACGTGAGTA TTCCGTTTCC GCCATCCCCT GCGAGTATTC CGTTCCCCAT CCCCCGCTGA CAAACACAAT GGCCGCCACC CTCCGCCAGC GCACAGCCAT GTCCGAGACC CCCGACCAGC TCGCAACCCG CGTCCAGAAC CTTAGCACGG CGCCTACAAC CACCCAGAGG ACCGGCTCTG GCTCTGCCGT TTTGGAGAAC GGTACAAACA TTGGCTCCGC CGCTGGCCGC AAGGCCTCGA TCCCTGCTCA GGTAGACCCC GAGAGAATCA TCAGCACCAG TGGCAGTGGC CGAACCAGCA GGCGAGGCAG TGGTTTGGTG ATGACTCCAG GAGGCGTTCA GACTGTATAC CACACCAGGA CAAATGTATG TCACAAGCAC CGAGACGCGC AATGGATCTG ACCCGAATGT CGCTGTAGGA GGATATCGAA TTCCCTCATG CTGGGAAGAG TGAGTTGATA ACCGAGGATG GGCGTTTTAG CCCAGTCGCC TAATCATTAT CTGCAGAGAC TATGGCCGAT CTCTTGAGGA AGTACGAAAG TCTTTTTACC CGTAAGCCGA TGTTCCACCG CCGAGACTAG ACTGCAAACC TGAGATCCTT GCAGTGACTC CCCAGAGGAT GAGGATGATC GTCCATGCCA TTGAAGAGAC TCTTGATAAC GGGTTGCAGA AGAATGGACA AGTTGTGGTG AGTCTTGTGA TTCTTCCAAC TAGACTTTCC AAGCTGATCC CCTGTAGCCT ATGAGTGAGT CACAGTCACA CATCGCTTGT GTCTATCCAT GCTGAATGTA GTTACTTTTC CAGTTCCTAC TTATGTGTTT GGCGTATGTT CAATTTTCTC GGTGTGTTCA CATCTTTACT GATATACCTC ACTGTGCATA GTGGCCTACC GGTAACGAAG TCGGGGATTT CCTCGCTCTC GACCTTGGTG GTACCAATCT TCGAGTCTGT CTCGTTACTC TTCTAGGCAG TGGAAAGTTT GAAGTCACTC AGACCAAGTA CCGATTGACC GAGGAACAGA AGCAAGGCGA GGGACAAGCT CTGTGCGTGC CTTGCCATTT CATCCACTGC ATTCCTAACA CGAAGAACAG TCTGGACTTT TGCGCAGAGT GTTTAAACAG CTTCATCCGC GATACCCTCG GCCGCACTGA AAAAGACGGT ATCCTCCCCC TTGGTTTCAC TGTAAGTTGA CTGTTACACC TTCCGTAAAC AATGAAAATT GACCGCCGGA ACCCAATCAG TTCTCCTACC CTTGCTCGTA CGTGCGCAGC TGTAGTAATA ATATTGTGAA GCTCATCTTT TGAAACAGTC AAGACCGAAT TGATCACGGT GGTAAGTAAC CATTTTTTAC GCCTGAATGA CTTCTTATCA TCACGGTACA GTTCTTATCC GTTGGACCAA GGGATTCGGT GCTCCCAACA TTGAAGGATA CGATGTCGCC GCCATGTTCA AGGACAGTCT CAAGCGTATG GTGCGTTGAT CCCTCTTGAA CTTTGTTATT CTTCTAATGC AGACGCCCTT TATAGGACGT CCCCGCGGAA CTCACTGCTC TCATCAATGA CACTACCGGT ACTCTTATCG CCTCCAACTA CGTTGACCCC CACACCAAGA TCGCTGTCAT CTTCGGAACC GGCTGTAACG CTGCCTACAT GGAGACCGCC GGCAGCATCC CCAAGATCGA CTACGTCGGA TTGCCCGAGG AACAGGGAAT GGCTATCAAC GTGAGTAATC CAGCGTCTTT TTTTCTGCAT ATGAGCTCTG GGCAGAGGAG CTGATTAAGA GACGGATGTT GACAGTGTGA ATGGGGAGCG TTCGACTCTT TCGACCACCA ACACCTTCGT GAGTTTCCTT CCCCAATATC CAGCTATTGC TAACAACATT CAAGCCCGAA CCAAGTACGA CATTATCATT GATGAATCTT CCAACAAGCC AGGAGAGCAG GTGAGCAACC TTTTCGTGAT TGATTATATT TTGACTGACC TATTCTGCAG TCCTTTGAGA AGATGATTGC TGGTCTTTAC CTTGGTGAAA TCTTCCGTCT CGTTCTCTGC GAGCTCATCG ATTCTGGTGA CCTTTTCTTG GGTCAGAACA CCTACAAGCT CGAAAAGGCC TATGCTTTCG ACACCGCTTT CTTGTCTCTC ATGGAAGCGT AAGTGCACTT CAAACTGCAA TTTGCCTTTT TTTTTGCTAA AGTCACGTCA TAAGCGATGT TACCGAAGAG CTTTTGACCA TCATCGGTGT CTTTGCTCAC TTCTTCGGCC TTGAAACTAC CCTTGAAGAG CGTCAGTTCT TTAAAAAGCT TGCTGTGTTG GTCGGCACCC GATCTGCTAG GCTTTCTGCG TGTGGTATCG CCGCCATTGT TAGCAAGAAG GGGTACCTCG AAGAAGGATG TGCCGTTGGC GCCGATGGAA GTTTGTACAA CGTATGTTTA TGCCTACACA GCCTTTCTAT ATATTGGATA AGGCTAATAA AAGTGCCAAG AAATACCCCA ACTTTGCGGA CCGAGTTCAT GAGGCGCTTA CGGACATTTT CGGTGAAAGC GGCAAGAAGA TTGTCACCCA CCATGCTGAG GATGGTTCAG GTGTCGGTAG CGCAATCATT GCCGGTGCGT CTCTCCAGTA TCGTCGTGAT TGAGGCAAAA AACTAATTTT GCATCTTCTT TTTTTTCCAG CAATGACCAA GGCCAGGAAG GACTCTGGAT TCTTTGTCGA ATACTAATTT CCTCTTACGG TATTTCTACG TGTCGGTCGG TTGCTGTGTA CCAGATAAAA ATTTCGAACG GAAACGTGGC CAAAACTTTT CATTAGAGGG TTTTAAACAA AGATGTATGA ATGAATATTT AATT
|
Protein sequence | MSETPDQLAT RVQNLSTAPT TTQRTGSGSA VLENGTNIGS AAGRKASIPA QVDPERIIST SGSGRTSRRG SGLVMTPGGV QTVYHTRTNE DIEFPHAGKK TMADLLRKYE SLFTLTPQRM RMIVHAIEET LDNGLQKNGQ VVPMIPTYVF GWPTGNEVGD FLALDLGGTN LRVCLVTLLG SGKFEVTQTK YRLTEEQKQG EGQALLDFCA ECLNSFIRDT LGRTEKDGIL PLGFTFSYPC SQDRIDHGVL IRWTKGFGAP NIEGYDVAAM FKDSLKRMDV PAELTALIND TTGTLIASNY VDPHTKIAVI FGTGCNAAYM ETAGSIPKID YVGLPEEQGM AINCEWGAFD SFDHQHLPRT KYDIIIDESS NKPGEQSFEK MIAGLYLGEI FRLVLCELID SGDLFLGQNT YKLEKAYAFD TAFLSLMEAD VTEELLTIIG VFAHFFGLET TLEERQFFKK LAVLVGTRSA RLSACGIAAI VSKKGYLEEG CAVGADGSLY NKYPNFADRV HEALTDIFGE SGKKIVTHHA EDGSGVGSAI IAAMTKARKD SGFFVEY
|
| |