Gene CNC03700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC03700 
Symbol 
ID3256565 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1167390 
End bp1169622 
Gene Length2233 bp 
Protein Length503 aa 
Translation table 
GC content47% 
IMG OID638255591 
ProductUTP-glucose-1-phosphate uridylyltransferase, putative 
Protein accessionXP_569599 
Protein GI58264886 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4284] UDP-glucose pyrophosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCCGTCCCC GTATCATCTT CCCATCTTCT CTTTCGTTCT CCCTTTGTTC CTTCTTTTTC 
ACAAATAACA ATGGCTACTC TCGAACCCAA GCAGGACCCC CAGGCCTCCG CCAGGCAGAG
GGGGCACTCA GCAATGGTAA GCACAGTAGG CACGTTCCGA CAAAGCTGTT GCTGATCAAG
ACATCCACGT CTTTTTCCTT CGTTCTCTAC AACACTTGTA CCCCATTCTT GCCTTCAACA
GGACTTTAAA TCTGCCACCA CTGGTGTCGC TGCCAAGACC ATGCGAAACG AGCTCAACAG
AATGGTCGCA AACGAGCAAG ACCCTGCTAA GAAGAAAGTG AGTTGTTGGG TCCTTTGTTG
AGCACGTTTT CGTGCGCTTT TCTTAGGATA ATAAACAGTC TCTTGTTAAA AGAAATTCGT
GTAAAGTTGC TGACTATGGG CTTGTAAAGA TGTTTGAAGC TGAGATGCAG TCCTTCTTCA
TCCTCTTCAA TCGTTTCCTC ACTGAGCGCG CTAAGGGCGA AAAGCTGTAG GTGCACTGCA
CATTATTTGA ATCAATGAAT TGACCACATT ACACAGCGAC TGGGACAAGA TCAACCCTCC
CAAGCCTGAG CAGGTCCGCC CTTACGAAGT CCTTCCCAAT GTTGACCCTT CAATCCTCAA
CAAGCTTGCT GTTCTCAAGC TCAACGGTGG TCTCGGCACT ACTATGGGCT GTGTCGGCCC
CAAGTCAATT ATTGAAGTCA GGGATAACAT GACTTTCCTC GACCTTTCTG TTCGACAAAT
TGAGGTAAGC AAAGCCTCTT GTGTTTCAAG GATGCTGCTA ATAGGGAGCA ATTGTCGCAC
AGCACTTGAA CGAAAAGTAC AATGTGAATG TGCCCTTCAT CCTCATGAAC TCTTTCAACA
CCGATGAGGA CACAGCTAGG ATCATCCAAA AGTACCAGAA CCACAACATC AATATCCTCA
CTTTCAACCA ATCTCGATAC CCCCGTGTTG ACAAGGAATC TTTGCTCCCT TGTCCTCGAG
AATCTTCAAG TGATAAGAGC AACTGGTACC CTCCCGGACA CGGTGACATC TTTGATGCTT
TGACGTAAGT TCTCCACTGC TGACTTGTAT CGAGTATAAT TGACACCTCA CGCCTCCCTC
AGCAACTCAG GCCTTCTTGA CAAGCTCATC GCTGCAGGCA AGGAGTACAT CTTCATCTCC
AACGTCGACA ATCTTGGTGC TGTCGTCGAT CTCAACATCT TCCAGACCAT GATTGACGCT
CAGGCCGAGT ATGTCATGGA AGTCACTGAC AAGACCAAGG CCGACGTCAA AGGTGGTACC
ATCATTGACT ACGATGGCAA GCCTAGGTTG CTCGAGGTTG CTCAAGTTCC CAAGGATCAC
CTTGATGAAT TCTGCAGCAC TCGAAAATTC AAGATTTGTA GGTGTACATT CTATGAAACG
AAGTGAAAGC TGATTCAGAA CAGTCAACAC CAACAACATT TGGTGTAACT TGCGAGCCAT
CAAGAGGATC ATGGACGAGG ATGCGCTCAA CCTGGAAATC ATTGTCAACA ACAAGGTTAC
CGACGATGGT CTAGCCGTTA TCCAACTCGA AACTGCCATC GGTGCTGCTA TCAAGGTGAT
TGGCTACATT GATGAGAAAA TCCCAACATT TTGCTGACCT CATACAGCAC TTCGACTCTG
CCATCGGCAT CAACGTTCCT CGATCACGAT TTTTGCCTGT AAAGTCTTGC TCGTAAGTGC
TCATCATTGG GCATAGAGTT TGGAGCTAAT TGAGATACAG TGATCTTCTT CTCATCAAGT
CCAAGCTCTA CAATCTTGAG CACGGTGTTT TGACCATGGA CAGGTCCCGA GAATTTGGAG
GCACCCCTGT TGTCAAGCTT GGTGGCGAGT TCAAGAAGGT TGCCAACTTT GAGAAGCGAT
TCAAGTCTAT CCCCAACATC ACCGAGCTCG ACCATCTTAC TGTTTCTGGC GATGTCTGGT
TCGGTAAGAG CGTGAGGCTT GCTGGTACTT GTATCATTGT CGCCACTGAG GGCAACAAGA
TCATGATCCC CGACGGTACC AACCTCGAGA ACAAGTTGAT TACTGGTAAC CTTTCAATCA
TTGACCATTA AGCATGAGGA TGTAGGTTTG AGAGGTTGTG GCGGTGTCTA ATGTTTCCTC
TTCTGTCCGG AGGGTATTGT GGGAATGGCT GTATAAATAA AATTCGTCAT GCATCCCAAA
GATATGACTG ATT
 
Protein sequence
MATLEPKQDP QASARQRGHS AMDFKSATTG VAAKTMRNEL NRMVANEQDP AKKKMFEAEM 
QSFFILFNRF LTERAKGEKL DWDKINPPKP EQVRPYEVLP NVDPSILNKL AVLKLNGGLG
TTMGCVGPKS IIEVRDNMTF LDLSVRQIEH LNEKYNVNVP FILMNSFNTD EDTARIIQKY
QNHNINILTF NQSRYPRVDK ESLLPCPRES SSDKSNWYPP GHGDIFDALT NSGLLDKLIA
AGKEYIFISN VDNLGAVVDL NIFQTMIDAQ AEYVMEVTDK TKADVKGGTI IDYDGKPRLL
EVAQVPKDHL DEFCSTRKFK IFNTNNIWCN LRAIKRIMDE DALNLEIIVN NKVTDDGLAV
IQLETAIGAA IKHFDSAIGI NVPRSRFLPV KSCSDLLLIK SKLYNLEHGV LTMDRSREFG
GTPVVKLGGE FKKVANFEKR FKSIPNITEL DHLTVSGDVW FGKSVRLAGT CIIVATEGNK
IMIPDGTNLE NKLITGNLSI IDH