Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK01150 |
Symbol | |
ID | 3254393 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 345377 |
End bp | 348319 |
Gene Length | 2943 bp |
Protein Length | 717 aa |
Translation table | |
GC content | 45% |
IMG OID | 638253605 |
Product | polycomb protein e(z), putative |
Protein accession | XP_567801 |
Protein GI | 58260782 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTCATGCC TCCCCACCAA CCCGGTGATC TCGAACCCCC CCCGAAAGTT CCACCAGAAA TTTATAACAC CGTTCGCCAA ACATGGGTTC AAACTTGGAA AGACTTTTAC GCCTGGAAAC CTCCCTCCAC TCTCCCGGTC GAAGATGGCT GGGTGAGTAT GCGAGCAGAG GAAGAGGATA AGAACCTTGT TAAAGATCAT GAGGATTTCG GTAAGCGCAT GGCCGCACTA CTGGAGAAGT TTGAGCAGGA AAGTGGGAGA GAAGGAAAGT TGGCAGAGGT CGAGTTTGAA GGTGCTGCCG AACTTAAGCC TCTACAGAAT ACCAACAAAT CGCCAATCGA ATGGAACGTA CATGGTAGAC CGATTCGTCC TCTCCCACCC ATCTACAGCC TTACACCCAC TATTGATCCC GTCCCCGAGT ACGCCTTTTG TATCTACACC CCTCGCAGCA TCTTGTCACC TGATGAAGTT ATTATGCCCT TTATGCCTAC CTTTGACGAT GACACGCTCG ACATACCAGG ATTCGAGACT GAAAAAGAAA AATATTCGTC TCTTTTTACT AGCTGCTTAT GGGACTTACC GGGACGAGAT GCTGATGTGG ATATTATCAT GTTTGAAACG TTGAAAAGGC TGGAGAAGCT AGAGGTAGAG AAAGAAGATA TCGACAGGAC AAGGATATTA CCAAAAGAGT GTCTCTACGT GGAAAGTCTC GATTTGAGAA GAGATCTACC TCCATTTCCA TTACCTCCTC AAAATATCGA CTCAGTCACA GGAAGGAAGT TGCCGGATGG CCTAAATCAT GTTGTAGGAG CAAAGAGGAA ATGGGAGGAA CCGTTAGAGC TTATTGAAGA GGATTTCGAA GATGATCTAG AAGGGTTTGA GGAAGCATGT TGCCATTATC CTACTTGCAC AAGCGTCATG TGTATTAGGC ATAGTATGTC AATATTTTTC CCTCGGCTTG GCTCAACTAA CGCGAAGCTC AGCCGGCATG TATTTTGACG GTTTAACACG TAATACAGCC AGAGGCTCAA GCTTTAACGC CGGAAATCAG CGTCTACCTC GTATCTCATC AGTACTCCCT ACCCTCTCTG AACCTTGCTC CCCCACATGT TACTCATTAA CTAGTAATGA AGCGACATTA GGGGAAAGAC TGTTGTCCAT CCGACTGGAG AAGGAATGGT CGGAATCGGA CAAGCAACAG CTTATAGATA TACTTTCTGC ATACGAAGGC TCAAGACTCG AAAGAATATG TGGATTGAAA GATGTCTTCA ACCGAACTTG TGCAGAGGTG GGGTTTTCAA ACTGTATACA TTCATTTGCG TGGCTGACCA AATTTAAGGT CGCGCGGCAA GTCACCAGTA TCTTACAAGA TCGATCGAGA GGACCAAGTG TGCATGAGCC AGGTGCTGAC ATGGAATGTT CAACTCCTTC ATCTACATCT GGATCGTTAA GGCCACTTTT ACAAAGGTCA AAGTCCAGTA AAGCCCAATT GAGTACGTTG AAGGTTGTAT CTTTTGGCAG AGAAAGCTAA TAGGTTGTAG TACCGATAAC CAATCGTTTG CCCGAGTTCG TGGAGTGCGA ACATGAAGGC GAGTGTCTTC CAGGTGTTTG TAGTTGCGCC AATGGCAAGT TGCCATGTGG GCGACACTGC TCTGTAGGTT CTCTTCCCCC TTTTTCGTTC CCTTCTTGCT TATTAACGAG CCCAAAGTGT CCTTCCACGT GTACAAGGCG TCATCGCGGT TGCAACTGCC GCCGTATAGC GATACAAGAA GGCAGACCTG TAAGAGACGG CAAAATATGT ATTAATGGTA AATGTCCTTG TATAAGGAGT TTCAGAGAAT GTGACAAGGA GCTATGCGGC AGTTGCGGAG CTGCGTACGT TTTATGGCTT GTATCTTATC CATGGGTGCA CTGACCTTCG TTGGTAACTG TTAGTGAGGA GCTGGTACAA GACGAGGAGA TTCTCAAGAC GACAGGTAGT TTCGGAAAGG ATGGGGAATG GGTTGAGAAC AAGGATAAAA TGGCTCAAGG TCAAACATTC ATCAGCTGCG GAAATATAGC TTTACAAAAA GCAAAGTGGC CGGTCAGTAC ATCGTTCAGT TAATCTTGGT TATTGATCAT TAACTATCTG TTCGTAGAAG CTGAGAGTAG GGATAAGCAA GGTAGCAGGG TACGGGTTAT TTGCCGACGA AGATATCGGC CAACATGTGC CTGTAGGGGG TAAGTGATGT TGATGCTGCT TATGGGTACA AATAGCTGCT CATCTGTATA GAATATGTGG GGGAGTATAT TTCTGAATGG GAAGGTGATA ATCGAAAGTG AGTAGGCACC ACACATGATA TATTTTCACA GTCGCTGATA GCTCGATCCA GTTTCGCTGA AGTATGTCTC TAAAAAAGTT CCCTTCTACC CTTATCCTAC TAATCTTAAC TCATTCGCAG TCTATTAATA AACGGCGATA TCAATTCACC ATCAACCCAC AATTTATCAT TGATGCTGGT TTCTTTGGTA ACCACACGCG GTTCATCAAC TCTGCTCAAG GAAATAATGT GAATTGTGTC GCCCATCGTG AGTTTCGGTT AATATTATTA GGACTTTGAG CCTACTGACC CGTTTGGCAG AGAGAGCAGT GGGTCACGAG CTCAGGATAT TGTTTCTGAC AAGTGAGTCT GACATCTGAT TCAAAGTGAA GCTAATACTT TTGCCTGTTT AGCGAGACCC ATCAGACGAC ATGAGGAAAT CCATTTCAAC TATGGGTACG CAAAAAACCC TGTAAATACT GGTGAAAAAG CTGACCTTTT CTTAAGAGAC GACTTTTGGG ATAATCATTA GACCGGTTTG CAGGCTCAAT GAGAGGCGTA GAAGTGACAG AGGACGAAAG GAGGAAAGGA ATGTCGGTAG GCCCATCGCG CTGAGAAAAT GGCATTGGCA AAATAGATGT TGC
|
Protein sequence | MPPHQPGDLE PPPKVPPEIY NTVRQTWVQT WKDFYAWKPP STLPVEDGWV SMRAEEEDKN LVKDHEDFGK RMAALLEKFE QESGREGKLA EVEFEGAAEL KPLQNTNKSP IEWNVHGRPI RPLPPIYSLT PTIDPVPEYA FCIYTPRSIL SPDEVIMPFM PTFDDDTLDI PGFETEKEKY SSLFTSCLWD LPGRDADVDI IMFETLKRLE KLEVEKEDID RTRILPKECL YVESLDLRRD LPPFPLPPQN IDSVTGRKLP DGLNHVVGAK RKWEEPLELI EEDFEDDLEG FEEACCHYPT CTSVMCIRHT RGSSFNAGNQ RLPRISSVLP TLSEPCSPTC YSLTSNEATL GERLLSIRLE KEWSESDKQQ LIDILSAYEG SRLERICGLK DVFNRTCAEV ARQVTSILQD RSRGPSVHEP GADMECSTPS STSGSLRPLL QRSKSSKAQL IPITNRLPEF VECEHEGECL PGVCSCANGK LPCGRHCSCP STCTRRHRGC NCRRIAIQEG RPVRDGKICI NGKCPCIRSF RECDKELCGS CGAAEELVQD EEILKTTGSF GKDGEWVENK DKMAQGQTFI SCGNIALQKA KWPKLRVGIS KVAGYGLFAD EDIGQHVPVG EYVGEYISEW EGDNRNFAES INKRRYQFTI NPQFIIDAGF FGNHTRFINS AQGNNVNCVA HQRAVGHELR ILFLTTRPIR RHEEIHFNYG DDFWDNH
|
| |