Gene CNG00240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00240 
Symbol 
ID3258627 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp56454 
End bp59665 
Gene Length3212 bp 
Protein Length940 aa 
Translation table 
GC content48% 
IMG OID638257638 
Productconserved hypothetical protein 
Protein accessionXP_571756 
Protein GI58269200 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5027] Histone acetyltransferase (MYST family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0793418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACCC CAGCGCATCA TGGCCACCGA CTACATGGCC ACGGGGCAGT CATCAGAGAG 
GACTTCTGTA GCTTCTGCGG AGGTACAGAC GCCATTAACA AGCAAGGCGT CCAGGAAACC
ATGGTCAGCT GCGCGGCCTG TGGACGGAGT GGCCATCCGA CATGCCTGAA CATGCTTACT
CCAAAGCTCA GGAAGCGGGT AATGATGTAT GACTGGCACT GTATAGAGTG CAAGACGTGT
GAGCAGTGTG CAATCAAAGG TGACGATGTG AGTACTATCA TCTCGTCCCG CAAGCACATT
GCGCTGATGA AAGTAGTCGC GGTTGATGTT CTGCGATACA TGTGATCGTG GATGGCATAG
CTACTGCCTG AACCCGTACG TGTATTGCTA GAAATATGCA AATGAGTACT GACTTATTGC
AGGCCACTGG CAAAACCACC AAAAGGTATG TTGTTGACAG TAGATACGTG TGTATTGACG
CGTTATTGAC GCGAACTGGC TGTGTAATAG GTTCATGGCA TTGTCCAAAA TGTTTATCAC
CACCTGCAGT CTCATCAGGA TCTATTAGCA ACCCCAGATC AGCTACCCGA CCCTCAAAGT
TACACCCACG CCCTTCAAAA CCAGGCAAAG CTCGACCAGC CAACACTCCC AACACCTCTA
ATAATCGTCG TCGGCCAAAA CAATCATTAG CAGGAGACGA TGCCTTGTTT ACTAGCCACC
GTATCAAAGT CAAGGTACCG AATCCTAATT ATCAATACAG GGATTCGGAA GAAGGAAGAG
GAACACCTAT GATTGTACGA CTGAAAGTAC CCAAAAGACC AGTCGAAGAA GAGCCGGAAG
AAAAGAAAAT ACCGTATGGA GGGGTCATAA CAGGTGACGA CGCAGATACT ACTCGGACAA
AGATAACAGA AGCAGATAAG GAAGCGTACC AGATGGCAAA GAATGCAGCA GAAAAGCAAC
TCGGTGGTCC TGTCCCTACA AGGGAAACGC CAGGGCCCGG CTCACCTCTG CCCATGGCAT
CACCTAGCGG TAAAACTACA CCTTCTTCAA AGTTCCCAGC CACAAGCAGA CCTCTCCGAG
ACCGACTACT CCACCAAACC TTACCCGACG CGTACCCATT CCCTTCCACA CCAGGCACAA
CGCAAGAAGT GGTTCCTTGG ACAGGGAGCG CAAGATTAGA GAAAATCAAA ACTATTAGGT
TTGGGCCGTA TGATATCAAC ACATGGTACT CTGCGCCATA TCCCGAAGAG TATGCATATG
TGCCGGATGG GAGGTTGTGG TTGTGCGAGT TTTGTTTAAA GTATATGAAA AGCGGATTTG
CTGCGACGCG GCATAGGGTA TGTCTCTTCA ACATCATAGT GCGATGTGCA CTGACGCGGT
ATATAGTTGA AATGCAAATC AAGACATCCG CCGGGAGATG AGATCTATCG CGAAGGTGCT
GTCTCAGTTT TTGAAGTGGA TGGACGCAAA AACAAGGTAG GTCTCCTGTC CTTTCCATTT
TATCTCATTA ACTCCTTTAC TAGATCTACT GTCAAAATCT TTGTCTTCTC GCCAAGATGT
TTCTCGATCA CAAAACGCTC TATTACGACG TCGAACCGTT TCTTTTCTAT GTCATGACCG
AAGTCGATGA ATTAGGCGCT CGATTTGTCG GATATTTTTC AAAGGAGAAG CGGAGTATGG
ACAACAACGT TAGTTGTATC ATGACCCTGC CGGTGCGACA ACGTAAAGGA TGGGGTCAGC
TTTTGATTGA TTTCAGTGGG TCATCATTGC CTTTTTTTGT GAGGAGCAAT ACACTAAGTG
TGTACAGGTT ATCTCCTATC GAAGAAAGAA GGACGAACAG GTTCGCCTGA AAAACCACTT
TCTGGCCTAG GAGCCGTCTC ATACAAATCC TACTGGCGTC TCACTGTTTT CAAATACCTC
CTCAACGCCA TCTCTCCATC TTTCAACCAT ACTCTCGAAC TACCCCCCGT CCCAGACGCC
ACACCTGGTC CTACATCCGA ATTGGACTTC AACTCTAACA CAGAAACCAA ACCCACTCCT
CCTCGCATAA CATCCAAAGA CATATCCAAA GCTACTAGCA TGACGCTAGA AGATATTTTC
ACCACGCTAT CTGCTGAAGG AATGATCAAT GTTCTGGATG ACCTGACGGT CGATGCGATA
GGGAAAACAC CAAACAGTGC TCGAACAAGA GGTCGGAGCC GGGGTCGTCC GAATGTAAAT
CGTCGCAAGG CAGATTTGAA TGGCTCAGGT ACGCTTGATC CCCAAATACA TCAGGATGAA
GACGATCATG TCAAGCTACC AAAACGGTAT GAGATTTTGC TGGATAAAGC GTATCTTCAA
GCGGTGGTGG AGAAACATGA GAAAAAGGGG TACTTGAAGC TTGCACCGGA GAGATTGAAG
TATCACCCAT TCTTGGTTGC TAGACAGACT GAGGAACCGG AGAGTGGTGG AAAGAAAGAG
GATGAGAACG AGGAAAATGG GAATGAAAAA GATAAGGAAA GGGAGGATGA GAGCGAAAAT
GGAGTCGTGA GGTTTGTGGA CTCAATTGCC GAAGCAGCGC ATATCATCGC CCAATCGCAC
GCGCATTTTC ATACTGTTCC ATCCTCTACA ATATCCCGCA CATCGCATCA TCCCGTCTAT
CCTACACCAA CCCCGAATCA CAACTCGATC CCCAATCGCA ATACTACATC ACCCGCGAGA
AATTTGCGTA AACGCAAGTC TGATGTTGTG CTCGAAACAC CCGTGAGAAA GTTGAGAAGT
CGGGATAACA TTCGAGAAGG AATGGGAACG TCGCCGAGGA CATTGCGTAT GAGGAATGCT
GTGGCGCTGG GCGAAGGAAT TGCAGAGGGG GAAGAGGAAA ACCAAGGGCT TTCCAAGTTG
AATCAATTAC CTGTCAAGCC TGTTACCCAA TTCGCTGATG GCGAGATTCC TATCGACCCA
GCGCTTCTCG AAGAAAGTGT CACTCTTGAC CCTAGTATGT ATGGTAGTGT CGGTGAAGAA
GTCTACGACA ACGATGCAGA AGGGGAAGAA TATATCGGCG AAGATGACGA TGCAGAAGGA
GAAGAGTATA TTGGGGAAGA GGAAGATGCA GAAGGTGAGG CGGATGAGGA GTATATTGAC
TATGATGTTT GATTAGGTCA TTCTTTTGTA CAAGTTTCGG GCTTTGGGTT GCATTGACCC
TATATATAAT CGTCGTGCAT GGTATGAATG TG
 
Protein sequence
MQTPAHHGHR LHGHGAVIRE DFCSFCGGTD AINKQGVQET MVSCAACGRS GHPTCLNMLT 
PKLRKRVMMY DWHCIECKTC EQCAIKGDDS RLMFCDTCDR GWHSYCLNPP LAKPPKGSWH
CPKCLSPPAV SSGSISNPRS ATRPSKLHPR PSKPGKARPA NTPNTSNNRR RPKQSLAGDD
ALFTSHRIKV KVPNPNYQYR DSEEGRGTPM IVRLKVPKRP VEEEPEEKKI PYGGVITGDD
ADTTRTKITE ADKEAYQMAK NAAEKQLGGP VPTRETPGPG SPLPMASPSG KTTPSSKFPA
TSRPLRDRLL HQTLPDAYPF PSTPGTTQEV VPWTGSARLE KIKTIRFGPY DINTWYSAPY
PEEYAYVPDG RLWLCEFCLK YMKSGFAATR HRLKCKSRHP PGDEIYREGA VSVFEVDGRK
NKIYCQNLCL LAKMFLDHKT LYYDVEPFLF YVMTEVDELG ARFVGYFSKE KRSMDNNVSC
IMTLPVRQRK GWGQLLIDFS YLLSKKEGRT GSPEKPLSGL GAVSYKSYWR LTVFKYLLNA
ISPSFNHTLE LPPVPDATPG PTSELDFNSN TETKPTPPRI TSKDISKATS MTLEDIFTTL
SAEGMINVLD DLTVDAIGKT PNSARTRGRS RGRPNVNRRK ADLNGSGTLD PQIHQDEDDH
VKLPKRYEIL LDKAYLQAVV EKHEKKGYLK LAPERLKYHP FLVARQTEEP ESGGKKEDEN
EENGNEKDKE REDESENGVV RFVDSIAEAA HIIAQSHAHF HTVPSSTISR TSHHPVYPTP
TPNHNSIPNR NTTSPARNLR KRKSDVVLET PVRKLRSRDN IREGMGTSPR TLRMRNAVAL
GEGIAEGEEE NQGLSKLNQL PVKPVTQFAD GEIPIDPALL EESVTLDPSM YGSVGEEVYD
NDAEGEEYIG EDDDAEGEEY IGEEEDAEGE ADEEYIDYDV