Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND06120 |
Symbol | |
ID | 3257456 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1686446 |
End bp | 1689185 |
Gene Length | 2740 bp |
Protein Length | 852 aa |
Translation table | |
GC content | 52% |
IMG OID | 638256552 |
Product | histidinol dehydrogenase, putative |
Protein accession | XP_570519 |
Protein GI | 58266726 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0139] Phosphoribosyl-AMP cyclohydrolase [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase [TIGR03188] phosphoribosyl-ATP pyrophosphohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCCACATA CAAAATGAGC ACTCCCCCTT TCCTCCCCCT TGTCACCTCG CAGGACACCA CTCTCCTTCC CTCTCTCGCT CTTATCACCC CTGTCCTCAT CCCCTCCGAC CACCTCGAAC AAATCCGCCA ATCATTGCCT GCAAATGCGT CATACTACGT CCAAGCAAAC GACAACGACG ACTTAATCGC TCTCCTCGAC GGTGGCGCCG AGAAGCTCGT CGTTACCCCC CAACAATTAG GGGCTGGTGG CGCGGGTATC CCCAAGGAAA GACTTATTCT CCAAGTCTCT GAGGAAGAAC TTTCTACTTC CAAGCGTTTT GCCCAGCAGA CTGGTGGTAT CCTCATCATC TCTTGTGTCC CTCACAATGC CAAATCGCTC GCGCTCCCTG GCGTGGACGT CTTTCTCCAA CTTCCTGAAG TGCAACCTCT TCGGATCTTA AACCTCATCA GATCCTCTCG CCCTTCTTCC TACGTCATTC CTTCTTCATA CCTTTCCATT GAGTCCTCCA CTACCGCCGA GAAAATCTCC ATTCCCGAAG CCTTCCTTGC CCCTATCATT TCTGACCGCC CCGATGGTCT TTTCCCCACT ATCGTGTCCT CTTACAGCCA CTCTACCACC CCTCTTGGTT TGGTCTACTC TTCCATTGAA AGCGTCAAGG AGTCGATCCT TACACAGAAG GGCGTTTACC AATCTAGAAA GCACGGTTTA TGGAGGAAGG GCGAGACTAG CGGTGCGGTG CAACAGGTCA CCGGCATCAA GCTCGACTGT GATAATGATG CTTTGATCTT TGAGGTTGTC CAGCACGGTT CTGGTTTCTG CCACCTCCCT CAATCAACAT GTTTTGGTAA CCTTTCCGGT ATCGCCAAGC TCTCCGACAC CCTCACGTCC CGTCTTGCTT CTGCTCCCGA AGGCTCCTAC ACCAAGCGAC TTTTCACAGA TGAAAAGCTT TTGAGAAGCA AGATCATGGA GGAAGCCGAG GAGCTCTGTG ACGCCCAAAC TAAAGAAGAG GTTGCGTTTG AGGCGGCTGA TTTGGTCTAC TTTGCCTTGA CAAGATGTGT CAGCAAAGGC GTGAGCTGGA GAGACGTTGA GGCAGCTTTA GACAAGAAGG CGTTGAAGGT GACAAGAAGG AAGGGTGATG CCAAACCCAA GTGGGAGGAA AAGACCAGAG AGGTTGTAAA TGAAAACGGA GAAGCCAAAC CTACCGTCCC CGAACCAACC AAGCTTCCCG AGACCGAGTC TGAAGATGTC CCTATCAAGA TGCGAGCCGT TACCCTCTCT ACTCTCTCCG TCCTTGAGCA AAAAGACCTT CTCCTCCGAC CCGTTCTCAA CTCACTGGCC ATGATTGACA AGGTCAAACC CATTGTCGAG CGCGTCCGAC AGGAAGGCGA CGCCGGTTTG AAAGCCATGA CCAAGCAATT CGACCGTGCC GATCTTTCAT CCAACGTTCT TCTCCCTCCC TTCGAGACCC CAGGAGAAGA TGTACTGCCC AAGGATGTGA GAGAAGCGAT TGATGTAGCG TACAACAATG TCAAAGAATT CCACCAAGCT CAAAACGAAA AGGAGCCGCT TGTGGTGGAG ACTATGCCTG GCGTCACTTG TTCTCGATTC GCTCGACCCA TCGCCCGAGT TGGTGTCTAC GTACCCGGTG GTACTGCCAT CCTCCCTTCT ACGGCTATTA TGCTCGGCGT CCCTGCCCAA GTTGCCGGCT GTAAAACCAT TGTCCTCGCT ACCCCTCCCC GACAAGACGG ATCCATCTCC CCTGAAGTTT TGTACGTCGC CAAGCTTACG GGTGTTACTT GTATCTTGAA GGCTGGTGGT GCTCAGGCTG TAGGCGCCAT GGCCTATGGA ACGGATGAGG TTCCCAAGGT GGATAAGATC TTTGGTCCTG GTAACCAGTG GGTTACTGCG GCCAAGATGT TGGTGCAGAA TGATACGGAT GCATTGGTAG CTATCGACAT GCCTGCTGGT CCATCCGAAG TCCTTGTAAG TTATATCTAC TCCTGTTTCC TCTTCCTCTG CCAGGGGACA CATTGTGGCA AAACTAAGCG ATATGTAACA GGTCATCGCC GACTACACTG CCAACCCCGT CTTCGTTGCT TCCGACCTCC TCTCTCAAGC CGAACACGGC GTCGACTCCC AAGTCATCCT CCTCGCCATT AATCTCACCC CTGAGCACCT CGCTGCTATC GAAGCCGAAA TTGATCGACA AGCCCGGGCG CTCCCCCGCG TCAAGATTGC GAGGGAGGCG ATCAAGAAGA GTGTCACCGT TGAGGTCAAG GATTTGGAAG AGGCTGTCAA GTTTAGCAAT GAATATGCTC CTGAGCACTT GATTTTGCAC CTTGAGAAGG CGGAAGAGGT TGTGGCTGAG ATTGAGAATG CGGGTAGCGT GTTTGTTGGT CCTTTCTCTC CAGAATCGTA AGTCTTGCCA TCTATCGCTT AAAGCATTTA TACACTCAAA CTGGCCCCGG GACAAATGCT AATGCTCCTT TTCCTTTAAT TCATCTAGAT GCGGTGATTA TGCCTCTGGT ACCAACCACA CCCTCCCGAC GAACGGTTTT GCTCGTCAAT TCTCTGGTGT CAACACTCTT TCTTTCCAAA AACACATTAC TTCCCAGATC GTCAGTGCGG AAGGGCTGAA GAAGTTGGGT CCGTATGTTA TCAGGTTGGC TGAGAGGGAA GGGTTGGAGG CGCATGCGAA TGCTGTGAGG GTTAGGTTGG CAGAGTTGAA CAAACAATAA
|
Protein sequence | MSTPPFLPLV TSQDTTLLPS LALITPVLIP SDHLEQIRQS LPANASYYVQ ANDNDDLIAL LDGGAEKLVV TPQQLGAGGA GIPKERLILQ VSEEELSTSK RFAQQTGGIL IISCVPHNAK SLALPGVDVF LQLPEVQPLR ILNLIRSSRP SSYVIPSSYL SIESSTTAEK ISIPEAFLAP IISDRPDGLF PTIVSSYSHS TTPLGLVYSS IESVKESILT QKGVYQSRKH GLWRKGETSG AVQQVTGIKL DCDNDALIFE VVQHGSGFCH LPQSTCFGNL SGIAKLSDTL TSRLASAPEG SYTKRLFTDE KLLRSKIMEE AEELCDAQTK EEVAFEAADL VYFALTRCVS KGVSWRDVEA ALDKKALKVT RRKGDAKPKW EEKTREVVNE NGEAKPTVPE PTKLPETESE DVPIKMRAVT LSTLSVLEQK DLLLRPVLNS LAMIDKVKPI VERVRQEGDA GLKAMTKQFD RADLSSNVLL PPFETPGEDV LPKDVREAID VAYNNVKEFH QAQNEKEPLV VETMPGVTCS RFARPIARVG VYVPGGTAIL PSTAIMLGVP AQVAGCKTIV LATPPRQDGS ISPEVLYVAK LTGVTCILKA GGAQAVGAMA YGTDEVPKVD KIFGPGNQWV TAAKMLVQND TDALVAIDMP AGPSEVLVIA DYTANPVFVA SDLLSQAEHG VDSQVILLAI NLTPEHLAAI EAEIDRQARA LPRVKIAREA IKKSVTVEVK DLEEAVKFSN EYAPEHLILH LEKAEEVVAE IENAGSVFVG PFSPESCGDY ASGTNHTLPT NGFARQFSGV NTLSFQKHIT SQIVSAEGLK KLGPYVIRLA EREGLEAHAN AVRVRLAELN KQ
|
| |