Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC04790 |
Symbol | |
ID | 3256356 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1446505 |
End bp | 1447814 |
Gene Length | 1310 bp |
Protein Length | 349 aa |
Translation table | |
GC content | 47% |
IMG OID | 638255697 |
Product | histidinol-phosphatase, putative |
Protein accession | XP_569759 |
Protein GI | 58265206 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1387] Histidinol phosphatase and related hydrolases of the PHP family |
TIGRFAM ID | [TIGR01856] histidinol phosphate phosphatase HisJ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACAGTCTG TCTGCTCTGC TTGAGCTGCA TAAATTTCCA TTACCGTTCG GAATTTTTGT TCCCGCCCCA GCTTCACATA TTCCGCCTTT TCCGTCCAGC CTATCTTCCC TAAATGCCTC ACAGCCACCA TTCTCATTCG GGACAATTCT GCCGACATGC CAAAGACAAT CTCGAAGATG TTATTCTTGA AGCGATCCGA CAAGGCTTCC AGTCATTTGG ACTGAGTGAG CATGCTCCTA GATGGCGGGT CGAAGATCTT TTCCCAGAAG AAGTAGGTTT GCGGTAGCGT ACTGGGTGGG AGGCTCATTA TAGTCTTGCC TACAGGCCGA TTTATGCCCC TCTGACCTCC TCTCGACATA TGAAGACTTT TTGAAGACAG CATTGATCCT ACGCTCGAAA TACGACTCCC AAATTTCCTT GCTGGTGTCT TTAGAGACAG ATTATATCAC TCCCCTCGAT TCAGAAAAAT TGACCTCATT TCTTGTCGAG CATACGGAGA TCGACTATAT CGTTGGCAGT GTCCACCACG TCAATGGTGT GTCCATCGAT TTTGACCGAC CAACATGGCT GAGGGCTGTT AAACTTGCCA AGGAGGGTAG AATAGGTAAA ACAATGGACC CCGGACCGCC ACCAACACTT GAGTTGGGTG ATCCAAATGA TCCTGAGCTC ATGACGACCT ACACCCCAGA TCTCTTGTCT GTCCAACCTT TCTTTGAAGC ATATTTCGAC GCTCAATATG ATCTCATTGT AAAGCATCAA CCAGAGGTTC TAGGTCATAT TGACCTATGC TCCTTATGGA TACCAAATAT CAGCTTGATG GAGCAGGAAC CTGTCTGGCA AAAGGTAATA AGGAATGTCA AGGCGGTAAT CGCCTACGGA GGTCTATTTG AAGCGAACGC TGCGGCTATT AGGAAAGGTT GGAAGACAAG TTATCCATGC AGGGATATAC TCCAAGTGAG TATCTATTTT GCTAACAGGA ACATCTAGAC GAAATGATTA AATACTGTTT GGCCTTGCAG TTGATCCAGG AACTGGGGGG AAGAGTTTGT TTGTCTGATG ATTCACATGG CATTTCTTAT GTGGGACTCA ACTACCTTAA GATGAGAGAC TATCTAAAGG GCATGGGTCT TGAGCGTACA TGGTACCTTG TTTCATCGAG TCGCCGCCAG ACTGGAGATT ACACCGTTGG TGAGCGGGGC CGGGTTGCAG CTAGGCCGTT GGATGGGTGG TATGATCACC CTTTCTGGGC GAAACTATCG GATGCTCAAC GACGTAAATG AGTAAATGGA TGGGTTAGGA AATGAGTAGC
|
Protein sequence | MPHSHHSHSG QFCRHAKDNL EDVILEAIRQ GFQSFGLSEH APRWRVEDLF PEEADLCPSD LLSTYEDFLK TALILRSKYD SQISLLVSLE TDYITPLDSE KLTSFLVEHT EIDYIVGSVH HVNGVSIDFD RPTWLRAVKL AKEGRIGKTM DPGPPPTLEL GDPNDPELMT TYTPDLLSVQ PFFEAYFDAQ YDLIVKHQPE VLGHIDLCSL WIPNISLMEQ EPVWQKVIRN VKAVIAYGGL FEANAAAIRK GWKTSYPCRD ILQLIQELGG RVCLSDDSHG ISYVGLNYLK MRDYLKGMGL ERTWYLVSSS RRQTGDYTVG ERGRVAARPL DGWYDHPFWA KLSDAQRRK
|
| |