Gene CNB04330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB04330 
Symbol 
ID3255550 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1263282 
End bp1265375 
Gene Length2094 bp 
Protein Length560 aa 
Translation table 
GC content48% 
IMG OID638255078 
Productgalactokinase, putative 
Protein accessionXP_568932 
Protein GI58263044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0351932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCG CAGAAATTCT TCATCTCTTA TCATCTCTTC AAACCAACAA TTACCAAGGC 
ATTCACCAGA AAAGCATGTC TGCCAAGGTC CCCATCACCG TCTTCAACTC TCTGCAAGAA
ATATATCCAT CTGCTTCGGC CGTTCTTCGC GAAGGGTGAG TCGTCCTTGA TAACAAAAGC
TCTGAAGCGC TCACAACATC GGCCCGATCG ACCGACCTTT TGACAACAAT AACAGACAAC
GATGGAACAC TCTTCTCACC CGGTTCCAAG AACGCTTTGG CGAGACTCCA ACATACATCG
TTCGTGCTCC TGGACGGGTC AATGTACTTG GAGAGCACAT CGATTACTCT CTTTTCGTGA
GTTTCGGTTT GACGTGCCTC CTTTGGAGCT GTCATTTTGA CGGAACCGCA ATCTTATCTT
GATGGCACCA TGATAGGATT ACGGCATCAT CAATTGTTAA TTCTTGTAAC TGATATTTGG
ATGTCTGATT CTTAGCCCGT ACTTCCTGCA GCTATTGAGC AAGATATTCT TTTCGCCCTT
CGCCCTACGC GTCCTGTTGC CGGTTCGAAT CCAACAGTAC GGTTAGAAAA CTATGACAAG
AAGTACAGTT ACCCAGGCTG CTCTTTCTCA CTTGTCCCTG GAGAAAATGG ATGGGACGTA
GGTCTTAACG CCGGAGGAGG GTGGGACAAG TACGTTAGGG CTGCTTTATT GGAGTGCTTG
GATGAGCTTT TCCCTGTTGG CAAAGGGGGT GGGAAACAAG AGGCAGTAGG GATGGATGTG
CTTGTTTCTG GGAGTATACC TCCTGGCTCT GGCTTGAGTG TAGGTAATTC TTCAACTCAT
TGAAACATAT CTTAAAGCTG ACGAAATTTC AGAGCTCCGC AGCAATGGTC GTTGGCTCCG
TCATCATGTT CCTAGTTGCC AACAATTTGG CGGCAGGTAA AACGAAAGAG GATGTTGTAC
AACTCGCCAT CAATTCCGAG CATCGCATGG GCTTGCGCAC AGGTGGCATG GACCAATCTG
TCTCTGCTCT TGCTTTGCCC AACAACCTGT TACACCTCTC TTTTCACCCA GGCTTGCTGC
CTGCACCTTT ACCTCTTCCC GGCAATGTAT CGTTGGTCAT CACAAACTCC CTTGCCCCGC
ATTCTTTGAC AGACTCTGCG CCGGAAGAGT ACAATTTGCG AGTCATTGAG ATCCTCATCG
CCACCCGTCT CATCTTACAT CACTGGAAGT TAGAGTCACA GTTCTACCGC AACCCTAGAC
CATGGTTAAG AGAAGTCTTG GGCGCTTGGG TGGGTGAGAA GGGTCATATG GGATGGGAAA
AGGAAGGTGA GGTGACGAAA AAGGCTCTTG GTGACATTGA ATGGATCAAG AGAGATGGAG
GGTGGACCAG GGAAGAGATG ATCAAGTATT CGGGCATGGA CGAAGAAGAG TTTAAGAAGA
GTTACCTCGA CTTCCTCGAG AGTAGGTGTT ATCATTTTGT ATATGTATCA AGCTGACAGG
TTGGGCGTTT ATAGTTCGAG CAGAAAAGTT CCACCTCTAT GAACGCTTGC ATCATACCCT
CACCGAATCT TTGCGCGTTC ACAAATTCGT GCACCTCTGC CAATCCATCT CCACTTCCAA
CCCTCTCCCT CCATCTTCTG ATACGCCTCT ACCTACAGCA AACGACATCC TCAGCCAACT
GGGCAAGTTA TTTGATGCAT CTCACGCTTC CATGCGAGAT ACATATGACT GCACCCACCC
TCTCGTGGAT TCGCTGCAAG AGCTGTGCTT AAAAAGTGGA GCAATTGGTT CCAGGATGAC
GGGTGGGGGA TGGGGTGGGT CAGTGGTCAG CCTTGTGGAA AGCTCACAAG TGCCCGAGTT
TTTGGAAAAA GTCAGAAAAG GATATGAAAA ATATGGTGAT TTGGAGGATG AAGAATGGGT
AGAGGTTGGT TTTGCTACAA TGCCGGGCCA TGGAGCTGGA GGTGAGCCTT GTAATCTCTG
TTGTCTATAG CATTTCGCTG ACTAGTGGAT AGTGTATGTT GTGGAGAATG GGATTAGAGT
GGAAAATGGG GCAGCATAAG GCTCTCGTCT TTATCACGAG CCTTCTTCCA GCAA
 
Protein sequence
MTTAEILHLL SSLQTNNYQG IHQKSMSAKV PITVFNSLQE IYPSASAVLR EGQRWNTLLT 
RFQERFGETP TYIVRAPGRV NVLGEHIDYS LFPVLPAAIE QDILFALRPT RPVAGSNPTV
RLENYDKKYS YPGCSFSLVP GENGWDVGLN AGGGWDKYVR AALLECLDEL FPVGKGGGKQ
EAVGMDVLVS GSIPPGSGLS SSAAMVVGSV IMFLVANNLA AGKTKEDVVQ LAINSEHRMG
LRTGGMDQSV SALALPNNLL HLSFHPGLLP APLPLPGNVS LVITNSLAPH SLTDSAPEEY
NLRVIEILIA TRLILHHWKL ESQFYRNPRP WLREVLGAWV GEKGHMGWEK EGEVTKKALG
DIEWIKRDGG WTREEMIKYS GMDEEEFKKS YLDFLEIRAE KFHLYERLHH TLTESLRVHK
FVHLCQSIST SNPLPPSSDT PLPTANDILS QLGKLFDASH ASMRDTYDCT HPLVDSLQEL
CLKSGAIGSR MTGGGWGGSV VSLVESSQVP EFLEKVRKGY EKYGDLEDEE WVEVGFATMP
GHGAGVYVVE NGIRVENGAA