Gene CNH00470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00470 
Symbol 
ID3259233 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp1054109 
End bp1055316 
Gene Length1208 bp 
Protein Length315 aa 
Translation table 
GC content48% 
IMG OID638258439 
Productconserved hypothetical protein 
Protein accessionXP_572239 
Protein GI58270166 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAGA AGTTCGTCGT CGACAAGCCC CTCCTCCAGC TTAACGGTAC TTTAGGCGAA 
GGTGAGCGAT CATTTGTATT TAGTATGGTC AACTGCTTAA TGTCTCTGGC GTGCAGGCTG
CGTCTGGGAT ACCAGGACTC AGCGTCTCTA TTTCGTCGAC ATTGATCAAT ATAAGCTCTA
CACCTACGAG CCGTCTTCTG GCAAGTATGG ATATGAGTCA TTTGACAAGA AGGTCACTGC
TTTGGCATCC CTCGAGAATG GAGAGGGTGT GAGTCGGTAG TCGCACACTC ATCCATGAAG
ATCAACTAAT TCAAAATAGT TGATTGCTGC TGTTGAAGAT GGTTTCGCCT ACATTTCCTT
TGACAGCCTT CCCTTCCCAC CGACTTCTTC CAAGCAGTCG CTTATCCCTA TTTCCTCTGG
TGCCAGTCTC AACTGCAGGG AAAAGCGATT CAACGATGGT GCTGTAGACC CCGCTGGGCG
ATTTTTAGCC GGTACATTGG GATTCGAGCA CGGTAGCAAA AATGGGAAGA TGTACTCTCT
GCAAGCGGAA AAAGATGGGA GCTACAGTGC TCCGTTGATT CTTGATGGGA TCACTTGTAC
GAATGGTATG GGATGGACTG AAGACGCAAA GACTTTGTAA GTATGAAGCA ATACCACATA
ACAGAAACAT TCTAAGCTAT CCTTAATAGC TATTTCACAG ACAGCTGGAT CAAAGAAATT
GCGAAGTTCG ACTACGACAT TGTATGTGCA AGACACCCTG ATTTGAAGAT TCGTTAACTC
TTTTCGGTAG ACGACGGGGA AGCTCAGCAA CCGCCGAGTT TTCTCCAACT TTGACGGCTA
CGGTGAACCT GATGGCATGT GCATGGATTC TGAAGGCGGT ATCTGGACAT GTCGGTGGGC
TTCAGGAAAG GTCTTGCGCC TTACGCCTGA CGGCGAGATT GATGTCGAGA TTGACTTCCC
TACTGCTTGG CACATTACCT GTTGCATCTT TGGCGGTAAG TCGAGTATAA GGCATTGGGC
TCAGCAGTGG TACTGATCTA CACGTAGGTG AAAACCTTGA CGAACTCTAT GTTACTTCGG
CCGCCTCTGA CTACATCGGC GATAACCTTC CCGACCGTAA GAACGGTGGC GATTTGTTCG
TTGTGAAGGG CCTTGGATTC AGGGGAATTG AGCGAGGCAG GTTCAAGGGT ACCATTCCCA
ACAAATAG
 
Protein sequence
MFKKFVVDKP LLQLNGTLGE GCVWDTRTQR LYFVDIDQYK LYTYEPSSGK YGYESFDKKV 
TALASLENGE GLIAAVEDGF AYISFDSLPF PPTSSKQSLI PISSGASLNC REKRFNDGAV
DPAGRFLAGT LGFEHGSKNG KMYSLQAEKD GSYSAPLILD GITCTNGMGW TEDAKTFYFT
DSWIKEIAKF DYDITTGKLS NRRVFSNFDG YGEPDGMCMD SEGGIWTCRW ASGKVLRLTP
DGEIDVEIDF PTAWHITCCI FGGENLDELY VTSAASDYIG DNLPDRKNGG DLFVVKGLGF
RGIERGRFKG TIPNK