Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG00240 |
Symbol | |
ID | 3258627 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 56454 |
End bp | 59665 |
Gene Length | 3212 bp |
Protein Length | 940 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257638 |
Product | conserved hypothetical protein |
Protein accession | XP_571756 |
Protein GI | 58269200 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5027] Histone acetyltransferase (MYST family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0793418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACCC CAGCGCATCA TGGCCACCGA CTACATGGCC ACGGGGCAGT CATCAGAGAG GACTTCTGTA GCTTCTGCGG AGGTACAGAC GCCATTAACA AGCAAGGCGT CCAGGAAACC ATGGTCAGCT GCGCGGCCTG TGGACGGAGT GGCCATCCGA CATGCCTGAA CATGCTTACT CCAAAGCTCA GGAAGCGGGT AATGATGTAT GACTGGCACT GTATAGAGTG CAAGACGTGT GAGCAGTGTG CAATCAAAGG TGACGATGTG AGTACTATCA TCTCGTCCCG CAAGCACATT GCGCTGATGA AAGTAGTCGC GGTTGATGTT CTGCGATACA TGTGATCGTG GATGGCATAG CTACTGCCTG AACCCGTACG TGTATTGCTA GAAATATGCA AATGAGTACT GACTTATTGC AGGCCACTGG CAAAACCACC AAAAGGTATG TTGTTGACAG TAGATACGTG TGTATTGACG CGTTATTGAC GCGAACTGGC TGTGTAATAG GTTCATGGCA TTGTCCAAAA TGTTTATCAC CACCTGCAGT CTCATCAGGA TCTATTAGCA ACCCCAGATC AGCTACCCGA CCCTCAAAGT TACACCCACG CCCTTCAAAA CCAGGCAAAG CTCGACCAGC CAACACTCCC AACACCTCTA ATAATCGTCG TCGGCCAAAA CAATCATTAG CAGGAGACGA TGCCTTGTTT ACTAGCCACC GTATCAAAGT CAAGGTACCG AATCCTAATT ATCAATACAG GGATTCGGAA GAAGGAAGAG GAACACCTAT GATTGTACGA CTGAAAGTAC CCAAAAGACC AGTCGAAGAA GAGCCGGAAG AAAAGAAAAT ACCGTATGGA GGGGTCATAA CAGGTGACGA CGCAGATACT ACTCGGACAA AGATAACAGA AGCAGATAAG GAAGCGTACC AGATGGCAAA GAATGCAGCA GAAAAGCAAC TCGGTGGTCC TGTCCCTACA AGGGAAACGC CAGGGCCCGG CTCACCTCTG CCCATGGCAT CACCTAGCGG TAAAACTACA CCTTCTTCAA AGTTCCCAGC CACAAGCAGA CCTCTCCGAG ACCGACTACT CCACCAAACC TTACCCGACG CGTACCCATT CCCTTCCACA CCAGGCACAA CGCAAGAAGT GGTTCCTTGG ACAGGGAGCG CAAGATTAGA GAAAATCAAA ACTATTAGGT TTGGGCCGTA TGATATCAAC ACATGGTACT CTGCGCCATA TCCCGAAGAG TATGCATATG TGCCGGATGG GAGGTTGTGG TTGTGCGAGT TTTGTTTAAA GTATATGAAA AGCGGATTTG CTGCGACGCG GCATAGGGTA TGTCTCTTCA ACATCATAGT GCGATGTGCA CTGACGCGGT ATATAGTTGA AATGCAAATC AAGACATCCG CCGGGAGATG AGATCTATCG CGAAGGTGCT GTCTCAGTTT TTGAAGTGGA TGGACGCAAA AACAAGGTAG GTCTCCTGTC CTTTCCATTT TATCTCATTA ACTCCTTTAC TAGATCTACT GTCAAAATCT TTGTCTTCTC GCCAAGATGT TTCTCGATCA CAAAACGCTC TATTACGACG TCGAACCGTT TCTTTTCTAT GTCATGACCG AAGTCGATGA ATTAGGCGCT CGATTTGTCG GATATTTTTC AAAGGAGAAG CGGAGTATGG ACAACAACGT TAGTTGTATC ATGACCCTGC CGGTGCGACA ACGTAAAGGA TGGGGTCAGC TTTTGATTGA TTTCAGTGGG TCATCATTGC CTTTTTTTGT GAGGAGCAAT ACACTAAGTG TGTACAGGTT ATCTCCTATC GAAGAAAGAA GGACGAACAG GTTCGCCTGA AAAACCACTT TCTGGCCTAG GAGCCGTCTC ATACAAATCC TACTGGCGTC TCACTGTTTT CAAATACCTC CTCAACGCCA TCTCTCCATC TTTCAACCAT ACTCTCGAAC TACCCCCCGT CCCAGACGCC ACACCTGGTC CTACATCCGA ATTGGACTTC AACTCTAACA CAGAAACCAA ACCCACTCCT CCTCGCATAA CATCCAAAGA CATATCCAAA GCTACTAGCA TGACGCTAGA AGATATTTTC ACCACGCTAT CTGCTGAAGG AATGATCAAT GTTCTGGATG ACCTGACGGT CGATGCGATA GGGAAAACAC CAAACAGTGC TCGAACAAGA GGTCGGAGCC GGGGTCGTCC GAATGTAAAT CGTCGCAAGG CAGATTTGAA TGGCTCAGGT ACGCTTGATC CCCAAATACA TCAGGATGAA GACGATCATG TCAAGCTACC AAAACGGTAT GAGATTTTGC TGGATAAAGC GTATCTTCAA GCGGTGGTGG AGAAACATGA GAAAAAGGGG TACTTGAAGC TTGCACCGGA GAGATTGAAG TATCACCCAT TCTTGGTTGC TAGACAGACT GAGGAACCGG AGAGTGGTGG AAAGAAAGAG GATGAGAACG AGGAAAATGG GAATGAAAAA GATAAGGAAA GGGAGGATGA GAGCGAAAAT GGAGTCGTGA GGTTTGTGGA CTCAATTGCC GAAGCAGCGC ATATCATCGC CCAATCGCAC GCGCATTTTC ATACTGTTCC ATCCTCTACA ATATCCCGCA CATCGCATCA TCCCGTCTAT CCTACACCAA CCCCGAATCA CAACTCGATC CCCAATCGCA ATACTACATC ACCCGCGAGA AATTTGCGTA AACGCAAGTC TGATGTTGTG CTCGAAACAC CCGTGAGAAA GTTGAGAAGT CGGGATAACA TTCGAGAAGG AATGGGAACG TCGCCGAGGA CATTGCGTAT GAGGAATGCT GTGGCGCTGG GCGAAGGAAT TGCAGAGGGG GAAGAGGAAA ACCAAGGGCT TTCCAAGTTG AATCAATTAC CTGTCAAGCC TGTTACCCAA TTCGCTGATG GCGAGATTCC TATCGACCCA GCGCTTCTCG AAGAAAGTGT CACTCTTGAC CCTAGTATGT ATGGTAGTGT CGGTGAAGAA GTCTACGACA ACGATGCAGA AGGGGAAGAA TATATCGGCG AAGATGACGA TGCAGAAGGA GAAGAGTATA TTGGGGAAGA GGAAGATGCA GAAGGTGAGG CGGATGAGGA GTATATTGAC TATGATGTTT GATTAGGTCA TTCTTTTGTA CAAGTTTCGG GCTTTGGGTT GCATTGACCC TATATATAAT CGTCGTGCAT GGTATGAATG TG
|
Protein sequence | MQTPAHHGHR LHGHGAVIRE DFCSFCGGTD AINKQGVQET MVSCAACGRS GHPTCLNMLT PKLRKRVMMY DWHCIECKTC EQCAIKGDDS RLMFCDTCDR GWHSYCLNPP LAKPPKGSWH CPKCLSPPAV SSGSISNPRS ATRPSKLHPR PSKPGKARPA NTPNTSNNRR RPKQSLAGDD ALFTSHRIKV KVPNPNYQYR DSEEGRGTPM IVRLKVPKRP VEEEPEEKKI PYGGVITGDD ADTTRTKITE ADKEAYQMAK NAAEKQLGGP VPTRETPGPG SPLPMASPSG KTTPSSKFPA TSRPLRDRLL HQTLPDAYPF PSTPGTTQEV VPWTGSARLE KIKTIRFGPY DINTWYSAPY PEEYAYVPDG RLWLCEFCLK YMKSGFAATR HRLKCKSRHP PGDEIYREGA VSVFEVDGRK NKIYCQNLCL LAKMFLDHKT LYYDVEPFLF YVMTEVDELG ARFVGYFSKE KRSMDNNVSC IMTLPVRQRK GWGQLLIDFS YLLSKKEGRT GSPEKPLSGL GAVSYKSYWR LTVFKYLLNA ISPSFNHTLE LPPVPDATPG PTSELDFNSN TETKPTPPRI TSKDISKATS MTLEDIFTTL SAEGMINVLD DLTVDAIGKT PNSARTRGRS RGRPNVNRRK ADLNGSGTLD PQIHQDEDDH VKLPKRYEIL LDKAYLQAVV EKHEKKGYLK LAPERLKYHP FLVARQTEEP ESGGKKEDEN EENGNEKDKE REDESENGVV RFVDSIAEAA HIIAQSHAHF HTVPSSTISR TSHHPVYPTP TPNHNSIPNR NTTSPARNLR KRKSDVVLET PVRKLRSRDN IREGMGTSPR TLRMRNAVAL GEGIAEGEEE NQGLSKLNQL PVKPVTQFAD GEIPIDPALL EESVTLDPSM YGSVGEEVYD NDAEGEEYIG EDDDAEGEEY IGEEEDAEGE ADEEYIDYDV
|
| |