Gene CNB02660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB02660 
Symbol 
ID3255870 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp783458 
End bp786007 
Gene Length2550 bp 
Protein Length488 aa 
Translation table 
GC content47% 
IMG OID638254915 
Producthexokinase, putative 
Protein accessionXP_568853 
Protein GI58262886 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5026] Hexokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.800331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTTGTTACA CACGTCTCTA TATCCGCAAT TCCGCCTTCG CTCCATACAT CCTCGCCATT 
CTCGCCTCCA TTCCCTCAAC CACAGCCTCC AAAACCCCTC TATCACTGTA TATCTTTCAC
TTCATCTTGT CACATATCTT GTCTACATCT CTTCCTTTGT GGATCCCGTA GCTCATCTCA
ACTGTCATCT TCAACAAACC CACATACGCA TCTCGATTTC TATACCATCC ACCAGAGCCT
CGAAATAACA CCACTATGAC CCTCCCCGAA GTTTGCAAGC AATTTGAACC TTATTTCATC
TTGGACGATG CGAAGCTCGT CGATATCGTC AAGCACTTCC GTAAGGAGAT GGAGGAAGGT
CTTGCGAGCT ATGGAAAAGA CATGGCTATG ATTCCTACCT TCGTTACTGG CGTTCCAGAT
GGTACTGAAG AAGGGTGAGT CTTTAGCGTT ACTAGGGCGA TTAGGGAATG GAGACAAGAA
ACTTTCGCTT TTCATGTTGG AAAAAAAAAG TTTGCTCGCT ATAGAATGAT CAGCTGATTT
TTTTGGGCAG CGTATTCTTG GCTTTGGATC TGGGAGGCAC CAATTTGTAA GTCATTGCAG
TACATGGCTT TTTTGAATCA CGCCAATCAT CGTCCCTTTT CGTGATGCTG TTGGAAGGCC
GATCGTGGAC GGAGGCTGAT TAACTGAATT TCTCCTGATG CCCGTTCAAT AATAGACGCG
TCTGTTTGAT TCTTCTGCAA GGCCACAACC AATTCAAGAT CAAACAGCAA AAGTATAAAG
TATCAGAGGA GCTTAAAACG GGCCAGGCGC GGGTGCTCTT TGGTGAGTAA ATGTTTAGGT
GTAATTTCTG AATCAGAGTA TCTAATGGTG GAGAGTTGCA GATTACATTG CAGAATCTGT
CGATAACTTC CTTACCGAAG TTGAGAGTCA CGAAGATATT GCTATACCCG CCACCGGCGA
GCCTATGCAT CTTGGGTTTA CCTTTAGGTG CGTTGCGTCC GCCGAGGAAC CTTATCTCAA
GCTGATGCAC ATCGCAGTTT CCCTGTGGAA CAAACTGCTA TTGACGCGGG CAAATTACTT
ACATGGACCA AGGGCTTCAA CACGAAGAAC GCTATCGGCC ACGATGTCGT TCGTCTTTTA
CAAGACGCTT TCGACCGCAA GCACATGCAT GTCCGATGCA GTGCATTGGT TAATGACGTA
AGTACCACCT TTTCTTCCAA ACTTCAGCTC ACAATACATC GTACAGACTG TGGGCACCCT
TCTTTCCCGC TCATATCAGA GTGGTCCTGC TCTCATCGGT GCCATTTTTG GCACTGGTAC
AAATGGTGCC TATATCGACA AGACACGAAC CATCAGCAAG TTGGGCAAAG AAAAAATTGA
GGACGCCGAA GAGGGCGGCG AGCATGCTGG AGAATTTATG GTCGTCAACA TGGAATGGGG
AGCGTTTGAC AACAAGGTCT GTGTAGTGCC CTTGTGATCA ATAGACTAGG GGTTGACACA
AAGCAGAGGC AGTGCCTACC TATCTCCATT TTTGATAATA AACTGGACCG AGAAAGCATA
AACCCGTGAG TTGGGCCATT TTTTCAAGTC AAAAAGATTA TGCTAATCAC TTTACATCAT
GCAGGCGGAA GCAAGCTTTC GAGAAACTGG TGTCCGGCAT GTGTAAGCCC ATGTTTCAAA
GAAAACGTCG TCCTTTTTAG GCCGCTGACA TTGTCAACAG ACCTAGGTGA AATCACTCGT
AACATGCTTC TCTACATGAT TGATTCCTCT CTCCTCTTCG GGGGTCACTC CAGTGAGATT
ATCAACACGC ATTATGGCTT CGACACTTCT TTCGTCTCTG GTATTGAAGG AATCTCTTCT
CCCGAGGAGG TAAAAAAGTT AATCGTCAAG GAACTCAAGG TTGACCCAAA GCACGTTACG
GATAAGTGTC CCGAGATTGT TCAGTGGGCG GTACGATTGG TTTCCGACCG AGCTTGCAAG
CTTGCCGCCT GCGCTATTGC AGCTGTTGTC CTTCATACAG GCAATGACAA GGCTCCAGAG
GGTGAGGAAG ACAAGGGTGT GGATGTTGGT GTGGATGGCA GTGTTGCCGA GTTCTTACCC
ATGTTTAGTG AGAGGGTGAT GGCCGCATTG AAGGCTATCC TCGGTGAGAA GAGTGCGGCA
AGGGTGAAGT TGGGACTGGC CAAGGATGGG AGTGGTGTCG GTGGTAAGTT GCCACGAGGA
TTTCTAGTCA CTCAGTGCAA GTAGTCGCTT ACGGGAAACG CAGCGGCTCT CACTGCACTC
CAGGCCAAGA AGGCATTAGA TCGGCGATCA GACAGGTCAA TCAAGTTCGT CCCTGGCGAA
CGTGGACCTT AAAATAAGAC ACCTAGCAAT TTTTTGTACC GTACCTTACT TTATCTACCT
GTTTCGCTCG CGCATGTGTG AATATGTGTC GTTTCGGCCT TTATTTTGGC GTTGTGAGAA
GTAGCAAACC AAAGATGGAG ATCATCTGTC ATCTTGAGCC TGGAATACCG CTGATGCCAT
ATTTTCTGAT GCATTAGCTT GTCAAACATT
 
Protein sequence
MTLPEVCKQF EPYFILDDAK LVDIVKHFRK EMEEGLASYG KDMAMIPTFV TGVPDGTEEG 
VFLALDLGGT NLRVCLILLQ GHNQFKIKQQ KYKVSEELKT GQARVLFDYI AESVDNFLTE
VESHEDIAIP ATGEPMHLGF TFSFPVEQTA IDAGKLLTWT KGFNTKNAIG HDVVRLLQDA
FDRKHMHVRC SALVNDTVGT LLSRSYQSGP ALIGAIFGTG TNGAYIDKTR TISKLGKEKI
EDAEEGGEHA GEFMVVNMEW GAFDNKRQCL PISIFDNKLD RESINPRKQA FEKLVSGMYL
GEITRNMLLY MIDSSLLFGG HSSEIINTHY GFDTSFVSGI EGISSPEEVK KLIVKELKVD
PKHVTDKCPE IVQWAVRLVS DRACKLAACA IAAVVLHTGN DKAPEGEEDK GVDVGVDGSV
AEFLPMFSER VMAALKAILG EKSAARVKLG LAKDGSGVGA ALTALQAKKA LDRRSDRSIK
FVPGERGP