Gene CNF01520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01520 
Symbol 
ID3258034 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp451477 
End bp453345 
Gene Length1869 bp 
Protein Length534 aa 
Translation table 
GC content56% 
IMG OID638257277 
ProductUDP-N-acetylglucosamine diphosphorylase, putative 
Protein accessionXP_571302 
Protein GI58268292 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4284] UDP-glucose pyrophosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACACTCGGC CCGACCCACA ACACTGCATT CCCATCTTCT CTGCAGCTTT CTCGTCCGTC 
GTCCACTTTG CGCCCGTCCA CAGCACCATG ACCGTCCAGC CAGCACCAGA CCCTGCGCTC
CTCGCCCACC TCAGAGACCT CTACGCCGCC GCCAACCAGG CCCATGTATT CGCATTCTAC
GACTCGCTCT CGCCCTCCGA CCAGGCCGCC TTGCTCGGCC AGCTCGCCTC CATCGACGTC
CACCGCGTCA ACCGTATCTA CTCCACAGCG ATCGCAGCTG CTGAAGCTCT CACGCCGTCC
AAGGAGAACA GCAACATCTT TGGCGGCGGA CAGCCGAACC ACATCGGCGA AGGCGCGAAC
GGTAACCTTG TAGGCAACGA GACTGTCCAG GGTTCCTTGC CCATCAAGGA GGAGGCCATG
CCCTTGCCTG AGGAGGCATG CGCGACTGTG CTTAACAACG CTTCCGAAGA AGCTCAATGG
CGCGACGCCG GTTTGAAGGC GATTGCCGAC AACCAGGTCG CCGTCCTCCT CATGGCCGGT
GGACAGGGCA CCCGTCTCGG CTCTGCGCTC CCCAAGGGAC TGTACGATAT CAAGTTGCCC
AGTGGACAGA CTTTGTTCGA ATACCAGGCC AAGAGGATCT GCAAGCTCGA GAGGCTGGCG
GAAGAAAAGG CGGGCAAGGA GAAGGGTAGT GTCACCATTC GGTGGTACGT GATGACCAGT
GGTCCCACCC GGGTCGAGAC GGAAAAGTAC TTCAAGGCGA AAGGCTTCTT TGGGTTGAGA
GAAGAAAATG TCATCTTTTT TGAGCAAGGC AAGTCTTTGT GATTATCCAT CTCCTTGAAG
CAAACAACTA ACATGCCACA GGCGTACTCC CCGCCCTTGA CAACGACGGC AAGCTTCTTC
TTTCAACACC TAGCTCTGTA TCCGTTGCTC CCGACGGCAA CGGTGGTCTC TACGCCGCCC
TCCGTCGCCC TCTCTCCCCC TCATCCTCCC GCACGGTCCT CTCCGATCTC CGCGAGCACA
ATGTCCAATA CGTCCACGCC TACTGCGTCG ACAACTGCCT CGTCCGTGTT GCCGACCCCG
TCTTCATTGG CTGCTGCTTG TCTCGCAATG CCTCGGCCGG TGCCAAGGTT GTGCGCAAGA
CCATCCCCAC AGAGAGTGTG GGTGTCCTCG CGGCCAAGGG TAACGCTTTT GCCGTGGTGG
AGTACTCTGA GCTGAGCAAG GAAAAGGCCG AGCAGAGGAC TGCGGACGGT CAGCTGGCTT
TCCGTGCTGC CAACATTGCA AACCACTTTT ATACCACCGC CTTCCTCGAG TCGGTTGAAG
AAATGGAAAA GCATATGGCG TTCCACATTG CTCGAAAGAA GATCCCCACC GTCGACCTTT
CCACTGGCGA GCTTATCAAG CCTTCTGAGC CCAACGGCAT GAAACTTGAG CTTTTCGTCT
TTGACGTCTT CCCATTCACC AAGAGTCTCT GTGTACTCGA AGTCGACCGT GCCGAAGAAT
TCTCCCCGCT CAAGAATGCG CCCGGGAGCA AGGCCGACTG CCCCGAAACC AGCCGCAGGG
ATTTGCTCGC TCAGCAAAAA AGGTGGTTGA TCGCAAGCGG TGCCGAGGTT GCCGATGATG
TCGAGATTGA GGTCAGCCCC GAGGTCAGTT ATGCCGGTGA AGGCTTGAAC TGGATCGAGG
GCAAAAAGTT TACCAAGAGC GGAGTGTTGA ACGGTCGGAA TGATTTAGAG AAGCTTACCG
CGTAAAGGGA CAATTCTTTT TCTTTTCTTC TTCTTAGCAT TACGACGCAT CTGATTCAAT
AATGGACAAT GTCTCATATG TTTGTGTCAT TTTTTTATAC ATGTCTATTA ATTTCCAATG
CATAGAAGT
 
Protein sequence
MTVQPAPDPA LLAHLRDLYA AANQAHVFAF YDSLSPSDQA ALLGQLASID VHRVNRIYST 
AIAAAEALTP SKENSNIFGG GQPNHIGEGA NGNLVGNETV QGSLPIKEEA MPLPEEACAT
VLNNASEEAQ WRDAGLKAIA DNQVAVLLMA GGQGTRLGSA LPKGLYDIKL PSGQTLFEYQ
AKRICKLERL AEEKAGKEKG SVTIRWYVMT SGPTRVETEK YFKAKGFFGL REENVIFFEQ
GVLPALDNDG KLLLSTPSSV SVAPDGNGGL YAALRRPLSP SSSRTVLSDL REHNVQYVHA
YCVDNCLVRV ADPVFIGCCL SRNASAGAKV VRKTIPTESV GVLAAKGNAF AVVEYSELSK
EKAEQRTADG QLAFRAANIA NHFYTTAFLE SVEEMEKHMA FHIARKKIPT VDLSTGELIK
PSEPNGMKLE LFVFDVFPFT KSLCVLEVDR AEEFSPLKNA PGSKADCPET SRRDLLAQQK
RWLIASGAEV ADDVEIEVSP EVSYAGEGLN WIEGKKFTKS GVLNGRNDLE KLTA