Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH03520 |
Symbol | |
ID | 3259195 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 93459 |
End bp | 96162 |
Gene Length | 2704 bp |
Protein Length | 621 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258131 |
Product | histone deacetylase 1 (hd1), putative |
Protein accession | XP_572517 |
Protein GI | 58270722 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTCTACAAT TGACTCGTCT ATAGAGGAGC TAAATAGACA TCCGGAGCCT CCAATAACAG CGCACTACTC CTCCTGCACT TTCAAGTAGA TCTCCGCGAT ACACAAGCAA TTACCTCCCA CCTCCACCTA CAGCCACAAT GCTGTCCGCC CAGCCACCGA TAGACCCATC CCCACCTCCA TCTTCCTCCT CCTCTCGCAG GGTAGCATAC TATTACGATC AGGATGTGGG AAACTACAAC TACTATCTGG GTCATCCCAT GAAGCCCCAC CGAATAAGGA TGGCGCACAA TTTGATTGTC AACTACGGGA TGTGTGATGA GGAAGGGCAG GAGCATGGCC CCGCAGAAGT ATGGGGCGAA GGAAAGAGGG CAGTGAATGA TGAGATTGCC CAAAATTGGG GGGGAGGCGA GATGGGCGAT GCAGAAGTGA AGTGGGAAAA GGCGGCGCTT AGGGGATCAA GGAGTAAGAC GATGCAGGTT TTTGTAAGTG TTGCTCGCCT TCTTTGGAGA AGTATGACCA ACCAAATTTT TGTAGAAACC TCGAAGGGCT ACCAAAGAAG AAATGACTCG GTTTCATACA GATGAATATA TAGACCTGCT AGAGGCAGTT ACCCCGGAGA CTGCGGATGC TTTAACAGGG GGCGGAACGA GGTGTAAGTC TCTCCTGCAG CATTTTCCCG TACATTTCAG ACCATTGACA TGCCATCTAG GCCTTATTGG TGAGGACTGT CCCGCATTCG ACGGTTTATT CGAGTTCTGC ACCATCTCTG CAGGTGGATC GTTAGGTACG ATTTTGCCCC TTCTGTACTA GATTTGAGCA CTTATCCGAT TGCAATAGGT GCTGCTGAAC GTCTGAACGC CGGTGCTGCC GACATTGTCA TCAATTGGGC TGGTGGTCTG CATCACGCCA AGAAGACTGA AGCAAGTGGT TTCTGCTACG TGAATGACAT GTATGTTCCT GCCCTTTTCT TAAGGTTTCT TAGCTTACAT GTTTGCAGTG TGTTGGGTAT TCTTGAGCTT CTCAGAATAC ATCCCCGAGT TCTCTACATT GATGTTGACG TACATCACGG AGACGGTGTA GAGGAAGCGT TCTACGTAAC GGACAGGGTC ATGACTTGTA GTTTCCACCG ATTTGGAGAA TTCTTTCCTG GTACAGGTGA TGTCAGAGTG AGTTCTTTGT TAGGTGTATA AAGATCTTTT GCTGATAAAA TCGACAGGAT GTTGGTATGA AGAAAGGAAA AGGCTACGCC GTGAACGTCC CACTCCGAGA CGGTATCACC GATGACAGTT TCCAATCAAT ATTCAAGCCT GTGAGTATCC CATCGCGACT CTTTCTAACA GAAGCTGATG TTAAATCAGG TGATTGACCG TATCATGTCA CACTTCCGAC CCAGCGCTGT CGTCCTTCAA ATGGGTGCCG ATTCTATATC AGGCGACAAA CTCGGTGGCT TCAACCTAAC TTTGGAAGGT AGGTTTTCTT TTGCATGTGC TGCCCGAGAA ATAAGAATAA ATAGCTCATG TTCTCCTTCT GAAATAGGTC ATGCGGAGTG TGCCAGATTT ATCAAGAGTT TCAATGTACC TGTTATGATG GTTGGTGGGG GTGGATATAC TGTGTAAGTC CAGCTGATAC TCTGTGCAGA GTTACGATAG CTGATGAATG ATATAGTAAA AACGTTGCAA GGGCTTGGAC TAAGGAGACT GCCATTATGT GCGGTGTGGA CCTTGCTGAG GACTTGCCTT ACAATCAATT GTGCGTCTTA CATTCATCTC TTGTGATGAG GAAAATGTTA ACAGATAAAA TAGCTTGGAA TATTACGGAC CTCGATACAA GCTTGAAGTT TTGCCCACCA ATGCCGTAGA CCATAATCCC CCCGAATACC TGGAGCGTAT CAAGTAAGCT TCCGGTTCCT TCTGGTATGA GCAGCAGCTA ACATTTTGTG AATCAGGAAT CAAGTGTTTG AGAACCTCCG AAGTCTTCCA TTTGCTCCTT CTGCTCAAAT GCGTTCTGTC CCAAGTAAGA CTATTGGCCA GGTTTTGGGT ATAACGGATG GCGGTGACGA GGAACCCGAA GATGAGATTG ATCAGCGTAT AAAGAGTGAG CCAACCATCC CGCCCGAAAC CACATCTTCA TCAGAAAACT CATGAAGATC TCCGTTTTTA TAGAACTGTT GAAACGGCGC CGAGCCAATG TCCTAGATGT CGACGCCTCT TCATCTTCCG AATCCGAAAC CGACACCCCA GCCTTCTCCC GTCCCGGACG AGCCAGACGT CAAGGTGGCA ATTTTTCCGG CCGTGGTCGC TCATCCCGTC TCGCCCAACT GCAACGTAAC CGCGAAAGAA GGGAAGCAGC GCAGGAAGCG TCCAATGAAG TGGAAGATGA TCCTTGTGGG GTGTTGGGGA AGAAGAAGAG GAGTTTCTTC AAGGTGGTTC CCAAGTTGAA TGGTGTGGGA GTTGAGCCGA TAGACGCGAC GGGTTTCGGA ATCGGTGTGG TGACCAATGG ACTCAATGGT GTAGGAGGAT TGAAGGATCG GTTGGGGATA CCGGTGGTAG AGCAGTGGAA AGGGGACGGG ATCGTGTCGA GGGGTGGTAC GCCAGCGAGT ATAGCGTGAA AAGGCATGTG CCTCATATCA AAATGAGGGA TGTTGCAATC TTTTTTTGGT TTCAGTTTTG CGTTCATACT TCATGGTCAG TATGCATTAT TAGT
|
Protein sequence | MLSAQPPIDP SPPPSSSSSR RVAYYYDQDV GNYNYYLGHP MKPHRIRMAH NLIVNYGMCD EEGQEHGPAE VWGEGKRAVN DEIAQNWGGG EMGDAEVKWE KAALRGSRSK TMQVFKPRRA TKEEMTRFHT DEYIDLLEAV TPETADALTG GGTRCLIGED CPAFDGLFEF CTISAGGSLG AAERLNAGAA DIVINWAGGL HHAKKTEASG FCYVNDIVLG ILELLRIHPR VLYIDVDVHH GDGVEEAFYV TDRVMTCSFH RFGEFFPGTG DVRDVGMKKG KGYAVNVPLR DGITDDSFQS IFKPVIDRIM SHFRPSAVVL QMGADSISGD KLGGFNLTLE GHAECARFIK SFNVPVMMVG GGGYTVKNVA RAWTKETAIM CGVDLAEDLP YNQFLEYYGP RYKLEVLPTN AVDHNPPEYL ERIKNQVFEN LRSLPFAPSA QMRSVPSKTI GQVLGITDGG DEEPEDEIDQ RIKKLLKRRR ANVLDVDASS SSESETDTPA FSRPGRARRQ GGNFSGRGRS SRLAQLQRNR ERREAAQEAS NEVEDDPCGV LGKKKRSFFK VVPKLNGVGV EPIDATGFGI GVVTNGLNGV GGLKDRLGIP VVEQWKGDGI VSRGGTPASI A
|
| |