Gene CNK01390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01390 
Symbol 
ID3254640 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp407784 
End bp409494 
Gene Length1711 bp 
Protein Length457 aa 
Translation table 
GC content53% 
IMG OID638253628 
Producthistone acetylation-related protein, putative 
Protein accessionXP_567818 
Protein GI58260816 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5034] Chromatin remodeling protein, contains PhD zinc finger 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.410495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCCATATAAT TCGCCCTACA CGCACCCGTA CCCGTATAGA ACCACACTAC AATGCCACCC 
AAGCTCACAG CTCTGCAACC TCCATCCCCC TCACCTCCAC CCCCCACAAA CGACGCTCAG
GACCTCCTCG TCATACAGGA TATCATGGAT ACTCTCGACC AGATCCCACC AGAGCTCACA
AGAGTTCACA GCGACCTAAA TGAGCTTGGT GCTGTACTAT ACTGTAAGCG GATATCCTTT
ACGACCTTTT CGCTGGGCGC AAAAGCTGAT TTAGCCGGAT GTAGCTACTC TTGTCAGCCT
AGAGAAGAAG CTCTACACGC TAATTGACTG GATCCAAGAT CCCAATGTCA CACCAGAAAA
ACGGTTTGAG CTCCTGCAGG AGATTGCAGA GGAGGCCGCA AGGTACAAGC TCGGCGGTGA
TGATAAGATT CGAGTAGCTG CCGGTGCTTG TGATGGCGTA AGTGTTTGTT CTCCCTTGGG
TCGCGCAACA TTTTTGCTCA TCCCCCGGAT CCAGATTTTG AACCACCAGA AACATATCTC
CAATCTCCTT GCATCTTCCA CCTTGCTCGT TCCATCCCCA CCTTCTCCCT ACTCTCAATC
CCTCACTCTT CCCTTCCCTC AACCGGTTAC TAATTCTCGC CGCGTTGCCC GGGCTGCCAA
TTCGCCGTTC GGCAGTCGGG GCTATACCGC AAACGGGGGA CCGTCCGAGA CCAAGGTGGG
TGATACGCCT AGCAAAAAGA AGAGGAGTCG AGTGCAACAG TTGGGGGCAA GGGATGATGA
TGAGTCGTCG AGTGCGGGCG GGGAGAAGAA GAAGCCTGTT AAGCGAAGAA AGCAGTACGT
CTATCTTGTA TGGTCCCCAC ACGCCAACTG GTCTGATGGG CAAGCAGAAA CAGAGCGACT
TCACCTACCG ACTCTGTCGT CTCTAACTCT GGTTTTGGCG GGAAACCCAT CGAACCCCGT
ACTGCGCGTC AACTCGCCGC TGCTGCCAAC AGGGCCCGCC GTGAAGCCGA TGACGGAGGC
TCCGACACTG AATCACGAAC AGGCAACGAC GAAAAGCGCA GCAACGTACC CATGCAACCC
TCTTTTTCAA TCGACTCAAA GCGCACCGAT GGTTTGGGCT TGGATACAGG CAGCAGAGAA
GGCAGCGGTG TGAGAAGTGC AAACGTAACG CCTACTCTTG GCTACGCCGC GGTCATGCCC
CCTACGGCTG AACTCAAGCG TCCATCAAGG CGAGGCGGTA AGCGTTCAAG CACAAATATT
CCTGCTGCCG AAGAATTCGA GGAGGAAGAG TCCGAGGGAT ACGGTGACGA CGATAGGTCG
AAGAGAGGAG AGAGGTATGC CGAGATGGAG ATGGGTGAGG AGACTGGAGG TGCTGGGGAT
GATCTAGATT CAAAGGTGTA CTGCACTTGT AGGCAGGTCA GTTATGGCGA GATGATTGGA
TGTGACGATG ACGATTGTGA GATTGAATGG GTGAGTCCAT GTCTTTTATA GTTGGCTGTG
GCGTTGCTGA CAGCTCTGGG CGATTTAGTA TCACATTGGG TGTCTCGGTC TGGATAAAAC
TCCTGCAGGG AACTGGATCT GTCCCAGGTG TATTGAGAGG CGAAAGAAAC AGCCACGGGG
CAAGAAGGGG ACTCGAGGCA AGGCCCGAAA GTAAAAGCTA CTATCGTTAC TGGCACATAC
ACTCTTATTC CTTAAAGCAG GTTGTGATAT T
 
Protein sequence
MPPKLTALQP PSPSPPPPTN DAQDLLVIQD IMDTLDQIPP ELTRVHSDLN ELGAVLYSTL 
VSLEKKLYTL IDWIQDPNVT PEKRFELLQE IAEEAARYKL GGDDKIRVAA GACDGILNHQ
KHISNLLASS TLLVPSPPSP YSQSLTLPFP QPVTNSRRVA RAANSPFGSR GYTANGGPSE
TKVGDTPSKK KRSRVQQLGA RDDDESSSAG GEKKKPVKRR KQNRATSPTD SVVSNSGFGG
KPIEPRTARQ LAAAANRARR EADDGGSDTE SRTGNDEKRS NVPMQPSFSI DSKRTDGLGL
DTGSREGSGV RSANVTPTLG YAAVMPPTAE LKRPSRRGGK RSSTNIPAAE EFEEEESEGY
GDDDRSKRGE RYAEMEMGEE TGGAGDDLDS KVYCTCRQVS YGEMIGCDDD DCEIEWYHIG
CLGLDKTPAG NWICPRCIER RKKQPRGKKG TRGKARK