Gene CNH01400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH01400 
Symbol 
ID3259022 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp774116 
End bp777099 
Gene Length2984 bp 
Protein Length557 aa 
Translation table 
GC content49% 
IMG OID638258343 
Producthexokinase, putative 
Protein accessionXP_572328 
Protein GI58270344 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5026] Hexokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.537283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTGCTCTC CCCCGCCAAT ACACACTGCA GTGAGTTCCC CCTCCCGTGT GGAACCTGCC 
CCCCTTCCTT TCCCAGGCAC CGCACGTGGC CACGGCTCAC ACCCTCCTTT GTCCCAGCGC
CATTCATTAC ACTTAACATC CACGTGAGTA TTCCGTTTCC GCCATCCCCT GCGAGTATTC
CGTTCCCCAT CCCCCGCTGA CAAACACAAT GGCCGCCACC CTCCGCCAGC GCACAGCCAT
GTCCGAGACC CCCGACCAGC TCGCAACCCG CGTCCAGAAC CTTAGCACGG CGCCTACAAC
CACCCAGAGG ACCGGCTCTG GCTCTGCCGT TTTGGAGAAC GGTACAAACA TTGGCTCCGC
CGCTGGCCGC AAGGCCTCGA TCCCTGCTCA GGTAGACCCC GAGAGAATCA TCAGCACCAG
TGGCAGTGGC CGAACCAGCA GGCGAGGCAG TGGTTTGGTG ATGACTCCAG GAGGCGTTCA
GACTGTATAC CACACCAGGA CAAATGTATG TCACAAGCAC CGAGACGCGC AATGGATCTG
ACCCGAATGT CGCTGTAGGA GGATATCGAA TTCCCTCATG CTGGGAAGAG TGAGTTGATA
ACCGAGGATG GGCGTTTTAG CCCAGTCGCC TAATCATTAT CTGCAGAGAC TATGGCCGAT
CTCTTGAGGA AGTACGAAAG TCTTTTTACC CGTAAGCCGA TGTTCCACCG CCGAGACTAG
ACTGCAAACC TGAGATCCTT GCAGTGACTC CCCAGAGGAT GAGGATGATC GTCCATGCCA
TTGAAGAGAC TCTTGATAAC GGGTTGCAGA AGAATGGACA AGTTGTGGTG AGTCTTGTGA
TTCTTCCAAC TAGACTTTCC AAGCTGATCC CCTGTAGCCT ATGAGTGAGT CACAGTCACA
CATCGCTTGT GTCTATCCAT GCTGAATGTA GTTACTTTTC CAGTTCCTAC TTATGTGTTT
GGCGTATGTT CAATTTTCTC GGTGTGTTCA CATCTTTACT GATATACCTC ACTGTGCATA
GTGGCCTACC GGTAACGAAG TCGGGGATTT CCTCGCTCTC GACCTTGGTG GTACCAATCT
TCGAGTCTGT CTCGTTACTC TTCTAGGCAG TGGAAAGTTT GAAGTCACTC AGACCAAGTA
CCGATTGACC GAGGAACAGA AGCAAGGCGA GGGACAAGCT CTGTGCGTGC CTTGCCATTT
CATCCACTGC ATTCCTAACA CGAAGAACAG TCTGGACTTT TGCGCAGAGT GTTTAAACAG
CTTCATCCGC GATACCCTCG GCCGCACTGA AAAAGACGGT ATCCTCCCCC TTGGTTTCAC
TGTAAGTTGA CTGTTACACC TTCCGTAAAC AATGAAAATT GACCGCCGGA ACCCAATCAG
TTCTCCTACC CTTGCTCGTA CGTGCGCAGC TGTAGTAATA ATATTGTGAA GCTCATCTTT
TGAAACAGTC AAGACCGAAT TGATCACGGT GGTAAGTAAC CATTTTTTAC GCCTGAATGA
CTTCTTATCA TCACGGTACA GTTCTTATCC GTTGGACCAA GGGATTCGGT GCTCCCAACA
TTGAAGGATA CGATGTCGCC GCCATGTTCA AGGACAGTCT CAAGCGTATG GTGCGTTGAT
CCCTCTTGAA CTTTGTTATT CTTCTAATGC AGACGCCCTT TATAGGACGT CCCCGCGGAA
CTCACTGCTC TCATCAATGA CACTACCGGT ACTCTTATCG CCTCCAACTA CGTTGACCCC
CACACCAAGA TCGCTGTCAT CTTCGGAACC GGCTGTAACG CTGCCTACAT GGAGACCGCC
GGCAGCATCC CCAAGATCGA CTACGTCGGA TTGCCCGAGG AACAGGGAAT GGCTATCAAC
GTGAGTAATC CAGCGTCTTT TTTTCTGCAT ATGAGCTCTG GGCAGAGGAG CTGATTAAGA
GACGGATGTT GACAGTGTGA ATGGGGAGCG TTCGACTCTT TCGACCACCA ACACCTTCGT
GAGTTTCCTT CCCCAATATC CAGCTATTGC TAACAACATT CAAGCCCGAA CCAAGTACGA
CATTATCATT GATGAATCTT CCAACAAGCC AGGAGAGCAG GTGAGCAACC TTTTCGTGAT
TGATTATATT TTGACTGACC TATTCTGCAG TCCTTTGAGA AGATGATTGC TGGTCTTTAC
CTTGGTGAAA TCTTCCGTCT CGTTCTCTGC GAGCTCATCG ATTCTGGTGA CCTTTTCTTG
GGTCAGAACA CCTACAAGCT CGAAAAGGCC TATGCTTTCG ACACCGCTTT CTTGTCTCTC
ATGGAAGCGT AAGTGCACTT CAAACTGCAA TTTGCCTTTT TTTTTGCTAA AGTCACGTCA
TAAGCGATGT TACCGAAGAG CTTTTGACCA TCATCGGTGT CTTTGCTCAC TTCTTCGGCC
TTGAAACTAC CCTTGAAGAG CGTCAGTTCT TTAAAAAGCT TGCTGTGTTG GTCGGCACCC
GATCTGCTAG GCTTTCTGCG TGTGGTATCG CCGCCATTGT TAGCAAGAAG GGGTACCTCG
AAGAAGGATG TGCCGTTGGC GCCGATGGAA GTTTGTACAA CGTATGTTTA TGCCTACACA
GCCTTTCTAT ATATTGGATA AGGCTAATAA AAGTGCCAAG AAATACCCCA ACTTTGCGGA
CCGAGTTCAT GAGGCGCTTA CGGACATTTT CGGTGAAAGC GGCAAGAAGA TTGTCACCCA
CCATGCTGAG GATGGTTCAG GTGTCGGTAG CGCAATCATT GCCGGTGCGT CTCTCCAGTA
TCGTCGTGAT TGAGGCAAAA AACTAATTTT GCATCTTCTT TTTTTTCCAG CAATGACCAA
GGCCAGGAAG GACTCTGGAT TCTTTGTCGA ATACTAATTT CCTCTTACGG TATTTCTACG
TGTCGGTCGG TTGCTGTGTA CCAGATAAAA ATTTCGAACG GAAACGTGGC CAAAACTTTT
CATTAGAGGG TTTTAAACAA AGATGTATGA ATGAATATTT AATT
 
Protein sequence
MSETPDQLAT RVQNLSTAPT TTQRTGSGSA VLENGTNIGS AAGRKASIPA QVDPERIIST 
SGSGRTSRRG SGLVMTPGGV QTVYHTRTNE DIEFPHAGKK TMADLLRKYE SLFTLTPQRM
RMIVHAIEET LDNGLQKNGQ VVPMIPTYVF GWPTGNEVGD FLALDLGGTN LRVCLVTLLG
SGKFEVTQTK YRLTEEQKQG EGQALLDFCA ECLNSFIRDT LGRTEKDGIL PLGFTFSYPC
SQDRIDHGVL IRWTKGFGAP NIEGYDVAAM FKDSLKRMDV PAELTALIND TTGTLIASNY
VDPHTKIAVI FGTGCNAAYM ETAGSIPKID YVGLPEEQGM AINCEWGAFD SFDHQHLPRT
KYDIIIDESS NKPGEQSFEK MIAGLYLGEI FRLVLCELID SGDLFLGQNT YKLEKAYAFD
TAFLSLMEAD VTEELLTIIG VFAHFFGLET TLEERQFFKK LAVLVGTRSA RLSACGIAAI
VSKKGYLEEG CAVGADGSLY NKYPNFADRV HEALTDIFGE SGKKIVTHHA EDGSGVGSAI
IAAMTKARKD SGFFVEY