Gene CNC04790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04790 
Symbol 
ID3256356 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1446505 
End bp1447814 
Gene Length1310 bp 
Protein Length349 aa 
Translation table 
GC content47% 
IMG OID638255697 
Producthistidinol-phosphatase, putative 
Protein accessionXP_569759 
Protein GI58265206 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family 
TIGRFAM ID[TIGR01856] histidinol phosphate phosphatase HisJ family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACAGTCTG TCTGCTCTGC TTGAGCTGCA TAAATTTCCA TTACCGTTCG GAATTTTTGT 
TCCCGCCCCA GCTTCACATA TTCCGCCTTT TCCGTCCAGC CTATCTTCCC TAAATGCCTC
ACAGCCACCA TTCTCATTCG GGACAATTCT GCCGACATGC CAAAGACAAT CTCGAAGATG
TTATTCTTGA AGCGATCCGA CAAGGCTTCC AGTCATTTGG ACTGAGTGAG CATGCTCCTA
GATGGCGGGT CGAAGATCTT TTCCCAGAAG AAGTAGGTTT GCGGTAGCGT ACTGGGTGGG
AGGCTCATTA TAGTCTTGCC TACAGGCCGA TTTATGCCCC TCTGACCTCC TCTCGACATA
TGAAGACTTT TTGAAGACAG CATTGATCCT ACGCTCGAAA TACGACTCCC AAATTTCCTT
GCTGGTGTCT TTAGAGACAG ATTATATCAC TCCCCTCGAT TCAGAAAAAT TGACCTCATT
TCTTGTCGAG CATACGGAGA TCGACTATAT CGTTGGCAGT GTCCACCACG TCAATGGTGT
GTCCATCGAT TTTGACCGAC CAACATGGCT GAGGGCTGTT AAACTTGCCA AGGAGGGTAG
AATAGGTAAA ACAATGGACC CCGGACCGCC ACCAACACTT GAGTTGGGTG ATCCAAATGA
TCCTGAGCTC ATGACGACCT ACACCCCAGA TCTCTTGTCT GTCCAACCTT TCTTTGAAGC
ATATTTCGAC GCTCAATATG ATCTCATTGT AAAGCATCAA CCAGAGGTTC TAGGTCATAT
TGACCTATGC TCCTTATGGA TACCAAATAT CAGCTTGATG GAGCAGGAAC CTGTCTGGCA
AAAGGTAATA AGGAATGTCA AGGCGGTAAT CGCCTACGGA GGTCTATTTG AAGCGAACGC
TGCGGCTATT AGGAAAGGTT GGAAGACAAG TTATCCATGC AGGGATATAC TCCAAGTGAG
TATCTATTTT GCTAACAGGA ACATCTAGAC GAAATGATTA AATACTGTTT GGCCTTGCAG
TTGATCCAGG AACTGGGGGG AAGAGTTTGT TTGTCTGATG ATTCACATGG CATTTCTTAT
GTGGGACTCA ACTACCTTAA GATGAGAGAC TATCTAAAGG GCATGGGTCT TGAGCGTACA
TGGTACCTTG TTTCATCGAG TCGCCGCCAG ACTGGAGATT ACACCGTTGG TGAGCGGGGC
CGGGTTGCAG CTAGGCCGTT GGATGGGTGG TATGATCACC CTTTCTGGGC GAAACTATCG
GATGCTCAAC GACGTAAATG AGTAAATGGA TGGGTTAGGA AATGAGTAGC
 
Protein sequence
MPHSHHSHSG QFCRHAKDNL EDVILEAIRQ GFQSFGLSEH APRWRVEDLF PEEADLCPSD 
LLSTYEDFLK TALILRSKYD SQISLLVSLE TDYITPLDSE KLTSFLVEHT EIDYIVGSVH
HVNGVSIDFD RPTWLRAVKL AKEGRIGKTM DPGPPPTLEL GDPNDPELMT TYTPDLLSVQ
PFFEAYFDAQ YDLIVKHQPE VLGHIDLCSL WIPNISLMEQ EPVWQKVIRN VKAVIAYGGL
FEANAAAIRK GWKTSYPCRD ILQLIQELGG RVCLSDDSHG ISYVGLNYLK MRDYLKGMGL
ERTWYLVSSS RRQTGDYTVG ERGRVAARPL DGWYDHPFWA KLSDAQRRK