Gene CND06120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND06120 
Symbol 
ID3257456 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp1686446 
End bp1689185 
Gene Length2740 bp 
Protein Length852 aa 
Translation table 
GC content52% 
IMG OID638256552 
Producthistidinol dehydrogenase, putative 
Protein accessionXP_570519 
Protein GI58266726 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0139] Phosphoribosyl-AMP cyclohydrolase
[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase
[TIGR03188] phosphoribosyl-ATP pyrophosphohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCCACATA CAAAATGAGC ACTCCCCCTT TCCTCCCCCT TGTCACCTCG CAGGACACCA 
CTCTCCTTCC CTCTCTCGCT CTTATCACCC CTGTCCTCAT CCCCTCCGAC CACCTCGAAC
AAATCCGCCA ATCATTGCCT GCAAATGCGT CATACTACGT CCAAGCAAAC GACAACGACG
ACTTAATCGC TCTCCTCGAC GGTGGCGCCG AGAAGCTCGT CGTTACCCCC CAACAATTAG
GGGCTGGTGG CGCGGGTATC CCCAAGGAAA GACTTATTCT CCAAGTCTCT GAGGAAGAAC
TTTCTACTTC CAAGCGTTTT GCCCAGCAGA CTGGTGGTAT CCTCATCATC TCTTGTGTCC
CTCACAATGC CAAATCGCTC GCGCTCCCTG GCGTGGACGT CTTTCTCCAA CTTCCTGAAG
TGCAACCTCT TCGGATCTTA AACCTCATCA GATCCTCTCG CCCTTCTTCC TACGTCATTC
CTTCTTCATA CCTTTCCATT GAGTCCTCCA CTACCGCCGA GAAAATCTCC ATTCCCGAAG
CCTTCCTTGC CCCTATCATT TCTGACCGCC CCGATGGTCT TTTCCCCACT ATCGTGTCCT
CTTACAGCCA CTCTACCACC CCTCTTGGTT TGGTCTACTC TTCCATTGAA AGCGTCAAGG
AGTCGATCCT TACACAGAAG GGCGTTTACC AATCTAGAAA GCACGGTTTA TGGAGGAAGG
GCGAGACTAG CGGTGCGGTG CAACAGGTCA CCGGCATCAA GCTCGACTGT GATAATGATG
CTTTGATCTT TGAGGTTGTC CAGCACGGTT CTGGTTTCTG CCACCTCCCT CAATCAACAT
GTTTTGGTAA CCTTTCCGGT ATCGCCAAGC TCTCCGACAC CCTCACGTCC CGTCTTGCTT
CTGCTCCCGA AGGCTCCTAC ACCAAGCGAC TTTTCACAGA TGAAAAGCTT TTGAGAAGCA
AGATCATGGA GGAAGCCGAG GAGCTCTGTG ACGCCCAAAC TAAAGAAGAG GTTGCGTTTG
AGGCGGCTGA TTTGGTCTAC TTTGCCTTGA CAAGATGTGT CAGCAAAGGC GTGAGCTGGA
GAGACGTTGA GGCAGCTTTA GACAAGAAGG CGTTGAAGGT GACAAGAAGG AAGGGTGATG
CCAAACCCAA GTGGGAGGAA AAGACCAGAG AGGTTGTAAA TGAAAACGGA GAAGCCAAAC
CTACCGTCCC CGAACCAACC AAGCTTCCCG AGACCGAGTC TGAAGATGTC CCTATCAAGA
TGCGAGCCGT TACCCTCTCT ACTCTCTCCG TCCTTGAGCA AAAAGACCTT CTCCTCCGAC
CCGTTCTCAA CTCACTGGCC ATGATTGACA AGGTCAAACC CATTGTCGAG CGCGTCCGAC
AGGAAGGCGA CGCCGGTTTG AAAGCCATGA CCAAGCAATT CGACCGTGCC GATCTTTCAT
CCAACGTTCT TCTCCCTCCC TTCGAGACCC CAGGAGAAGA TGTACTGCCC AAGGATGTGA
GAGAAGCGAT TGATGTAGCG TACAACAATG TCAAAGAATT CCACCAAGCT CAAAACGAAA
AGGAGCCGCT TGTGGTGGAG ACTATGCCTG GCGTCACTTG TTCTCGATTC GCTCGACCCA
TCGCCCGAGT TGGTGTCTAC GTACCCGGTG GTACTGCCAT CCTCCCTTCT ACGGCTATTA
TGCTCGGCGT CCCTGCCCAA GTTGCCGGCT GTAAAACCAT TGTCCTCGCT ACCCCTCCCC
GACAAGACGG ATCCATCTCC CCTGAAGTTT TGTACGTCGC CAAGCTTACG GGTGTTACTT
GTATCTTGAA GGCTGGTGGT GCTCAGGCTG TAGGCGCCAT GGCCTATGGA ACGGATGAGG
TTCCCAAGGT GGATAAGATC TTTGGTCCTG GTAACCAGTG GGTTACTGCG GCCAAGATGT
TGGTGCAGAA TGATACGGAT GCATTGGTAG CTATCGACAT GCCTGCTGGT CCATCCGAAG
TCCTTGTAAG TTATATCTAC TCCTGTTTCC TCTTCCTCTG CCAGGGGACA CATTGTGGCA
AAACTAAGCG ATATGTAACA GGTCATCGCC GACTACACTG CCAACCCCGT CTTCGTTGCT
TCCGACCTCC TCTCTCAAGC CGAACACGGC GTCGACTCCC AAGTCATCCT CCTCGCCATT
AATCTCACCC CTGAGCACCT CGCTGCTATC GAAGCCGAAA TTGATCGACA AGCCCGGGCG
CTCCCCCGCG TCAAGATTGC GAGGGAGGCG ATCAAGAAGA GTGTCACCGT TGAGGTCAAG
GATTTGGAAG AGGCTGTCAA GTTTAGCAAT GAATATGCTC CTGAGCACTT GATTTTGCAC
CTTGAGAAGG CGGAAGAGGT TGTGGCTGAG ATTGAGAATG CGGGTAGCGT GTTTGTTGGT
CCTTTCTCTC CAGAATCGTA AGTCTTGCCA TCTATCGCTT AAAGCATTTA TACACTCAAA
CTGGCCCCGG GACAAATGCT AATGCTCCTT TTCCTTTAAT TCATCTAGAT GCGGTGATTA
TGCCTCTGGT ACCAACCACA CCCTCCCGAC GAACGGTTTT GCTCGTCAAT TCTCTGGTGT
CAACACTCTT TCTTTCCAAA AACACATTAC TTCCCAGATC GTCAGTGCGG AAGGGCTGAA
GAAGTTGGGT CCGTATGTTA TCAGGTTGGC TGAGAGGGAA GGGTTGGAGG CGCATGCGAA
TGCTGTGAGG GTTAGGTTGG CAGAGTTGAA CAAACAATAA
 
Protein sequence
MSTPPFLPLV TSQDTTLLPS LALITPVLIP SDHLEQIRQS LPANASYYVQ ANDNDDLIAL 
LDGGAEKLVV TPQQLGAGGA GIPKERLILQ VSEEELSTSK RFAQQTGGIL IISCVPHNAK
SLALPGVDVF LQLPEVQPLR ILNLIRSSRP SSYVIPSSYL SIESSTTAEK ISIPEAFLAP
IISDRPDGLF PTIVSSYSHS TTPLGLVYSS IESVKESILT QKGVYQSRKH GLWRKGETSG
AVQQVTGIKL DCDNDALIFE VVQHGSGFCH LPQSTCFGNL SGIAKLSDTL TSRLASAPEG
SYTKRLFTDE KLLRSKIMEE AEELCDAQTK EEVAFEAADL VYFALTRCVS KGVSWRDVEA
ALDKKALKVT RRKGDAKPKW EEKTREVVNE NGEAKPTVPE PTKLPETESE DVPIKMRAVT
LSTLSVLEQK DLLLRPVLNS LAMIDKVKPI VERVRQEGDA GLKAMTKQFD RADLSSNVLL
PPFETPGEDV LPKDVREAID VAYNNVKEFH QAQNEKEPLV VETMPGVTCS RFARPIARVG
VYVPGGTAIL PSTAIMLGVP AQVAGCKTIV LATPPRQDGS ISPEVLYVAK LTGVTCILKA
GGAQAVGAMA YGTDEVPKVD KIFGPGNQWV TAAKMLVQND TDALVAIDMP AGPSEVLVIA
DYTANPVFVA SDLLSQAEHG VDSQVILLAI NLTPEHLAAI EAEIDRQARA LPRVKIAREA
IKKSVTVEVK DLEEAVKFSN EYAPEHLILH LEKAEEVVAE IENAGSVFVG PFSPESCGDY
ASGTNHTLPT NGFARQFSGV NTLSFQKHIT SQIVSAEGLK KLGPYVIRLA EREGLEAHAN
AVRVRLAELN KQ