Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB04330 |
Symbol | |
ID | 3255550 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 1263282 |
End bp | 1265375 |
Gene Length | 2094 bp |
Protein Length | 560 aa |
Translation table | |
GC content | 48% |
IMG OID | 638255078 |
Product | galactokinase, putative |
Protein accession | XP_568932 |
Protein GI | 58263044 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0351932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACCG CAGAAATTCT TCATCTCTTA TCATCTCTTC AAACCAACAA TTACCAAGGC ATTCACCAGA AAAGCATGTC TGCCAAGGTC CCCATCACCG TCTTCAACTC TCTGCAAGAA ATATATCCAT CTGCTTCGGC CGTTCTTCGC GAAGGGTGAG TCGTCCTTGA TAACAAAAGC TCTGAAGCGC TCACAACATC GGCCCGATCG ACCGACCTTT TGACAACAAT AACAGACAAC GATGGAACAC TCTTCTCACC CGGTTCCAAG AACGCTTTGG CGAGACTCCA ACATACATCG TTCGTGCTCC TGGACGGGTC AATGTACTTG GAGAGCACAT CGATTACTCT CTTTTCGTGA GTTTCGGTTT GACGTGCCTC CTTTGGAGCT GTCATTTTGA CGGAACCGCA ATCTTATCTT GATGGCACCA TGATAGGATT ACGGCATCAT CAATTGTTAA TTCTTGTAAC TGATATTTGG ATGTCTGATT CTTAGCCCGT ACTTCCTGCA GCTATTGAGC AAGATATTCT TTTCGCCCTT CGCCCTACGC GTCCTGTTGC CGGTTCGAAT CCAACAGTAC GGTTAGAAAA CTATGACAAG AAGTACAGTT ACCCAGGCTG CTCTTTCTCA CTTGTCCCTG GAGAAAATGG ATGGGACGTA GGTCTTAACG CCGGAGGAGG GTGGGACAAG TACGTTAGGG CTGCTTTATT GGAGTGCTTG GATGAGCTTT TCCCTGTTGG CAAAGGGGGT GGGAAACAAG AGGCAGTAGG GATGGATGTG CTTGTTTCTG GGAGTATACC TCCTGGCTCT GGCTTGAGTG TAGGTAATTC TTCAACTCAT TGAAACATAT CTTAAAGCTG ACGAAATTTC AGAGCTCCGC AGCAATGGTC GTTGGCTCCG TCATCATGTT CCTAGTTGCC AACAATTTGG CGGCAGGTAA AACGAAAGAG GATGTTGTAC AACTCGCCAT CAATTCCGAG CATCGCATGG GCTTGCGCAC AGGTGGCATG GACCAATCTG TCTCTGCTCT TGCTTTGCCC AACAACCTGT TACACCTCTC TTTTCACCCA GGCTTGCTGC CTGCACCTTT ACCTCTTCCC GGCAATGTAT CGTTGGTCAT CACAAACTCC CTTGCCCCGC ATTCTTTGAC AGACTCTGCG CCGGAAGAGT ACAATTTGCG AGTCATTGAG ATCCTCATCG CCACCCGTCT CATCTTACAT CACTGGAAGT TAGAGTCACA GTTCTACCGC AACCCTAGAC CATGGTTAAG AGAAGTCTTG GGCGCTTGGG TGGGTGAGAA GGGTCATATG GGATGGGAAA AGGAAGGTGA GGTGACGAAA AAGGCTCTTG GTGACATTGA ATGGATCAAG AGAGATGGAG GGTGGACCAG GGAAGAGATG ATCAAGTATT CGGGCATGGA CGAAGAAGAG TTTAAGAAGA GTTACCTCGA CTTCCTCGAG AGTAGGTGTT ATCATTTTGT ATATGTATCA AGCTGACAGG TTGGGCGTTT ATAGTTCGAG CAGAAAAGTT CCACCTCTAT GAACGCTTGC ATCATACCCT CACCGAATCT TTGCGCGTTC ACAAATTCGT GCACCTCTGC CAATCCATCT CCACTTCCAA CCCTCTCCCT CCATCTTCTG ATACGCCTCT ACCTACAGCA AACGACATCC TCAGCCAACT GGGCAAGTTA TTTGATGCAT CTCACGCTTC CATGCGAGAT ACATATGACT GCACCCACCC TCTCGTGGAT TCGCTGCAAG AGCTGTGCTT AAAAAGTGGA GCAATTGGTT CCAGGATGAC GGGTGGGGGA TGGGGTGGGT CAGTGGTCAG CCTTGTGGAA AGCTCACAAG TGCCCGAGTT TTTGGAAAAA GTCAGAAAAG GATATGAAAA ATATGGTGAT TTGGAGGATG AAGAATGGGT AGAGGTTGGT TTTGCTACAA TGCCGGGCCA TGGAGCTGGA GGTGAGCCTT GTAATCTCTG TTGTCTATAG CATTTCGCTG ACTAGTGGAT AGTGTATGTT GTGGAGAATG GGATTAGAGT GGAAAATGGG GCAGCATAAG GCTCTCGTCT TTATCACGAG CCTTCTTCCA GCAA
|
Protein sequence | MTTAEILHLL SSLQTNNYQG IHQKSMSAKV PITVFNSLQE IYPSASAVLR EGQRWNTLLT RFQERFGETP TYIVRAPGRV NVLGEHIDYS LFPVLPAAIE QDILFALRPT RPVAGSNPTV RLENYDKKYS YPGCSFSLVP GENGWDVGLN AGGGWDKYVR AALLECLDEL FPVGKGGGKQ EAVGMDVLVS GSIPPGSGLS SSAAMVVGSV IMFLVANNLA AGKTKEDVVQ LAINSEHRMG LRTGGMDQSV SALALPNNLL HLSFHPGLLP APLPLPGNVS LVITNSLAPH SLTDSAPEEY NLRVIEILIA TRLILHHWKL ESQFYRNPRP WLREVLGAWV GEKGHMGWEK EGEVTKKALG DIEWIKRDGG WTREEMIKYS GMDEEEFKKS YLDFLEIRAE KFHLYERLHH TLTESLRVHK FVHLCQSIST SNPLPPSSDT PLPTANDILS QLGKLFDASH ASMRDTYDCT HPLVDSLQEL CLKSGAIGSR MTGGGWGGSV VSLVESSQVP EFLEKVRKGY EKYGDLEDEE WVEVGFATMP GHGAGVYVVE NGIRVENGAA
|
| |