Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC02060 |
Symbol | |
ID | 3256683 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 576439 |
End bp | 579436 |
Gene Length | 2998 bp |
Protein Length | 659 aa |
Translation table | |
GC content | 47% |
IMG OID | 638255428 |
Product | histone deacetylase 1-1 (hd1), putative |
Protein accession | XP_569971 |
Protein GI | 58265630 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.300789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCGGTTCG TCTCTTGCAA CTGCCATCAT CCTTCGTTCT GGAATACCGT CCAGCACCCC ACAACAGCCC TTCTGTATAA TTCACAGAAA TAAACGCCAC CATGTCCTCG GCTGCTATTC ATCCTCCTCT CGACTTCTCG CCCGTTCGCC CAGATGGCCG GCGCGTAGCT TACTACTATG ACCACGACGT AGGCAACTAC CATTTTGGAC TCGGCCATCC AATGAAGCCT CATCGGATTA GAATGACACA TAACCTTGTA GTTAACTATG GACTGGCCGA CGACTATGAA GCTATGGAGG AAGAGGGCAG AAGAAAGGTG AATGCTCGGA TGGGGATGGA GAATGATGAA GCAAGATGGG TGAATGCCGA GATGCGAGGG TCGAGAGCAA AACGGATGCA GATCTTTGTG AGTTTAATCA GTACTTTATT AAGTGTCTTC TTGGTCTAGG CATGAAATCA AGCGTAAAAG GGCTGACAGT GAATCCGTCT GCAGAGACCT CGTCGAGCAA CCAAAACAGA TATGACGAGA TTCCATACCG ATGAGTACAT CGAGCTGCTT GAGAGCGTAC TACCGGAAAA TGCGGATGCT CTCACTGGAA ATAGGTCAAG AGGTAAGTAA GAAACCAGCT TTATGGATAG CGCCGTCTGA CAAGCCAATA TCGCTCGATA ACCGAAAGGC TTAACTGGGT CCGATTGTCC GGCAGTCGAA GGTATTTTCG AATTCAGCTC TATTTCTGCA GGTGGATCCA TTGGTGGGTT TTCTTTGATC TTCATCCAGA TTGCTTATTC ATAAAAATAT TATCTCAGGT GCTGCTGAGA AGCTCAATGA AGGTATAGCC GATATCGCAA TCAATTGGGC TGGTGGCCTA CACCACGCCA AAAAAACTGA AGCTAGTGGG TTTTGTTACG TCAACGATAT TGTCCTAGGA ATCCTCGAAC TTTTGCGGTG CGTTATCTCG TAGAGATTTT GTTTCCAGTC CCTAACATTT GGCAGAGTTA ACTCGCGGGT GCTCTACATC GATATCGATG TACATCACGG CGACGGAGTT GAAGAAGCGT TTTACTCTAC AGACAGAGTT ATGACTTGTT CTTTTCATCT CTTTGGTAAC TTCTTCCCAG GAACTGGGAC TCTCAAGGTG TTTGCGAACT ACTTCCATTG CACAGCTCAC CGTGCTGATA CAGAGCGCCG CAGGATGTTG GCCTGGGTAA AGGTAAAGGC TACGCCGTCA ATGTTCCACT GAGAGAAGGT ATCACAGACG AAGGGTTTCA CAGCATCTTT AAACCTGCAA GTCTAGTATT TTCCGGAGGA TCTGGGGGCG AAGTTGTTTA CCATAATGAA TAGGTTATTG CCGAAATCAT GGAGCATTAC CGCCCATGTG TGGTTGTATT GCAAGGCGGA GCGGACAGTA TGTCAGGAGA CAAGTTAGGG AGGTTGAACC TGTCGGACAA AGGTCGGTCT TCTTAATTGC AAATCAGGTT TCAACAGCTG ATATAACGAA TAGGGCATGC TGAGTGTGCC AAGTTTTTGA GGACTTTCAG TGTACCATTG ATGTTACTTG GTGGCGGGGG GTATACGACG TGAGTGTAAA ATATCTACTC GTAGTCCAGT GAGCCTGATG CAAATGTGGA TTTCAGGAAA AATGTGGCAA GGGCGTGGAC TAGAGAAACG GCTATCGCAT GTGGACAAGA GCTATCGGAA GATTTACCAA GCAATCAATA GTATGGCATA CCGCTACTGT CGATTCCCTT ACGCTGACCG ATTTCTGAAG TATGGAATAT TACGGGCCTC GATACAAACT AGAAGTATTA CCTTCCAACG TTGAAGATTT CAATACACCA GAATATCTCG AGGATCTTAA GTACGTCTGC CAGGCTTACA CTCACATTAC AGTCTCTGAA GTATCCATCA CACAGGCGCC AGATTTCCAA CCACTTGAAA AATCTCCCTT TCGCTCCTAG CGCTCAGATG CGACAAATTA CTGGTAGCAA TGTGAGTCAA GCAGTAGGGC TTAGCAATGA ATGGGAGACA GACGATCCGG AGGATCAAAT TGATCAGAGA CTCAAAAGTA CGCCGTCCTG CTAAGTCTTC CACCATCCAT CGCTCACTAA CGCTATAGAA CTGTTCGCAA GTAAGAATCT GAACGGTACC TATACTCAAG AGAGTGACGC TTTGTTGAGT GACTTAACAA GCATTTCAAG AATTAGACGC CAAGGTGGTC CCAAAAAACT GCCTAGGTCT TCAGGAACAT GTGGCAGAGC AAGGAAACGA TATCTTGAAG ATATTGCGAT GGGAGAAGAT CCTTGTTGTC TTCTGCCTGC CATGACCAGC AAAAAAAATG AGAGAAAAAC GATTGCGAAG AGCGGAAGTT CAGCGCCTCC AACATCAGCG CCAGAATGGC ACGCACATTT AACCGTGCAT GAAATGAAAA GCAGACCTAG TCTAGGTGAG ATCCTTCAGC AACCGTCGCC AGGCAGTGAG GACGAATGTG ATCGGAATCG TTGTATCACT GAGGGGTCGC AGCTACATAT GATAGCGCAG GGAAGGGGAA AAAGGAGTTT CTTTTCGCGC AAAGGTCTGC CACCCTCAAT TATATCTCCT CAGGCGATAA TAGGAGGATC AACGGTAAAA GTCGACAGGC TGATCAGAGC GAGGATGGCG GCGGCGACCA ATAATTCTCC GGTAGTGGTG GAATGGGATC GACAACAGGA GTAATGGACA ATTATGGTTT AATTATGAGT GTTCCGTTAG AAAGGGTTGC ACGTGTACAT CATTGTCAAC ACTTGTAGTA AGTAATATGT AGTAGGGCAT GAGGTACGTT GTGGGTAGGG AGCATAACTC GTTTGAAGTG GGCAAGCTAT GATAGTTGTA CTTGGGCGGC TCAGGCCGAG GGAGAGAGCG GAGTCTATTA TTAATTATCC AAGATTCGAG TGTCTCGGCG GCGAGGCGAC CTCAAGTTTT CAAAGCCGAT CCGCATTCTA CGTAGCTACT GGCACAGC
|
Protein sequence | MSSAAIHPPL DFSPVRPDGR RVAYYYDHDV GNYHFGLGHP MKPHRIRMTH NLVVNYGLAD DYEAMEEEGR RKVNARMGME NDEARWVNAE MRGSRAKRMQ IFRPRRATKT DMTRFHTDEY IELLESVLPE NADALTGNRS RGLTGSDCPA VEGIFEFSSI SAGGSIGAAE KLNEGIADIA INWAGGLHHA KKTEASGFCY VNDIVLGILE LLRVNSRVLY IDIDVHHGDG VEEAFYSTDR VMTCSFHLFG NFFPGTGTLK DVGLGKGKGY AVNVPLREGI TDEGFHSIFK PVIAEIMEHY RPCVVVLQGG ADSMSGDKLG RLNLSDKGHA ECAKFLRTFS VPLMLLGGGG YTTKNVARAW TRETAIACGQ ELSEDLPSNQ YMEYYGPRYK LEVLPSNVED FNTPEYLEDL KRQISNHLKN LPFAPSAQMR QITGSNVSQA VGLSNEWETD DPEDQIDQRL KKLFASKNLN GTYTQESDAL LSDLTSISRI RRQGGPKKLP RSSGTCGRAR KRYLEDIAMG EDPCCLLPAM TSKKNERKTI AKSGSSAPPT SAPEWHAHLT VHEMKSRPSL GEILQQPSPG SEDECDRNRC ITEGSQLHMI AQGRGKRSFF SRKGLPPSII SPQAIIGGST VKVDRLIRAR MAAATNNSPV VVEWDRQQE
|
| |