Gene CNK01150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01150 
Symbol 
ID3254393 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp345377 
End bp348319 
Gene Length2943 bp 
Protein Length717 aa 
Translation table 
GC content45% 
IMG OID638253605 
Productpolycomb protein e(z), putative 
Protein accessionXP_567801 
Protein GI58260782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTCATGCC TCCCCACCAA CCCGGTGATC TCGAACCCCC CCCGAAAGTT CCACCAGAAA 
TTTATAACAC CGTTCGCCAA ACATGGGTTC AAACTTGGAA AGACTTTTAC GCCTGGAAAC
CTCCCTCCAC TCTCCCGGTC GAAGATGGCT GGGTGAGTAT GCGAGCAGAG GAAGAGGATA
AGAACCTTGT TAAAGATCAT GAGGATTTCG GTAAGCGCAT GGCCGCACTA CTGGAGAAGT
TTGAGCAGGA AAGTGGGAGA GAAGGAAAGT TGGCAGAGGT CGAGTTTGAA GGTGCTGCCG
AACTTAAGCC TCTACAGAAT ACCAACAAAT CGCCAATCGA ATGGAACGTA CATGGTAGAC
CGATTCGTCC TCTCCCACCC ATCTACAGCC TTACACCCAC TATTGATCCC GTCCCCGAGT
ACGCCTTTTG TATCTACACC CCTCGCAGCA TCTTGTCACC TGATGAAGTT ATTATGCCCT
TTATGCCTAC CTTTGACGAT GACACGCTCG ACATACCAGG ATTCGAGACT GAAAAAGAAA
AATATTCGTC TCTTTTTACT AGCTGCTTAT GGGACTTACC GGGACGAGAT GCTGATGTGG
ATATTATCAT GTTTGAAACG TTGAAAAGGC TGGAGAAGCT AGAGGTAGAG AAAGAAGATA
TCGACAGGAC AAGGATATTA CCAAAAGAGT GTCTCTACGT GGAAAGTCTC GATTTGAGAA
GAGATCTACC TCCATTTCCA TTACCTCCTC AAAATATCGA CTCAGTCACA GGAAGGAAGT
TGCCGGATGG CCTAAATCAT GTTGTAGGAG CAAAGAGGAA ATGGGAGGAA CCGTTAGAGC
TTATTGAAGA GGATTTCGAA GATGATCTAG AAGGGTTTGA GGAAGCATGT TGCCATTATC
CTACTTGCAC AAGCGTCATG TGTATTAGGC ATAGTATGTC AATATTTTTC CCTCGGCTTG
GCTCAACTAA CGCGAAGCTC AGCCGGCATG TATTTTGACG GTTTAACACG TAATACAGCC
AGAGGCTCAA GCTTTAACGC CGGAAATCAG CGTCTACCTC GTATCTCATC AGTACTCCCT
ACCCTCTCTG AACCTTGCTC CCCCACATGT TACTCATTAA CTAGTAATGA AGCGACATTA
GGGGAAAGAC TGTTGTCCAT CCGACTGGAG AAGGAATGGT CGGAATCGGA CAAGCAACAG
CTTATAGATA TACTTTCTGC ATACGAAGGC TCAAGACTCG AAAGAATATG TGGATTGAAA
GATGTCTTCA ACCGAACTTG TGCAGAGGTG GGGTTTTCAA ACTGTATACA TTCATTTGCG
TGGCTGACCA AATTTAAGGT CGCGCGGCAA GTCACCAGTA TCTTACAAGA TCGATCGAGA
GGACCAAGTG TGCATGAGCC AGGTGCTGAC ATGGAATGTT CAACTCCTTC ATCTACATCT
GGATCGTTAA GGCCACTTTT ACAAAGGTCA AAGTCCAGTA AAGCCCAATT GAGTACGTTG
AAGGTTGTAT CTTTTGGCAG AGAAAGCTAA TAGGTTGTAG TACCGATAAC CAATCGTTTG
CCCGAGTTCG TGGAGTGCGA ACATGAAGGC GAGTGTCTTC CAGGTGTTTG TAGTTGCGCC
AATGGCAAGT TGCCATGTGG GCGACACTGC TCTGTAGGTT CTCTTCCCCC TTTTTCGTTC
CCTTCTTGCT TATTAACGAG CCCAAAGTGT CCTTCCACGT GTACAAGGCG TCATCGCGGT
TGCAACTGCC GCCGTATAGC GATACAAGAA GGCAGACCTG TAAGAGACGG CAAAATATGT
ATTAATGGTA AATGTCCTTG TATAAGGAGT TTCAGAGAAT GTGACAAGGA GCTATGCGGC
AGTTGCGGAG CTGCGTACGT TTTATGGCTT GTATCTTATC CATGGGTGCA CTGACCTTCG
TTGGTAACTG TTAGTGAGGA GCTGGTACAA GACGAGGAGA TTCTCAAGAC GACAGGTAGT
TTCGGAAAGG ATGGGGAATG GGTTGAGAAC AAGGATAAAA TGGCTCAAGG TCAAACATTC
ATCAGCTGCG GAAATATAGC TTTACAAAAA GCAAAGTGGC CGGTCAGTAC ATCGTTCAGT
TAATCTTGGT TATTGATCAT TAACTATCTG TTCGTAGAAG CTGAGAGTAG GGATAAGCAA
GGTAGCAGGG TACGGGTTAT TTGCCGACGA AGATATCGGC CAACATGTGC CTGTAGGGGG
TAAGTGATGT TGATGCTGCT TATGGGTACA AATAGCTGCT CATCTGTATA GAATATGTGG
GGGAGTATAT TTCTGAATGG GAAGGTGATA ATCGAAAGTG AGTAGGCACC ACACATGATA
TATTTTCACA GTCGCTGATA GCTCGATCCA GTTTCGCTGA AGTATGTCTC TAAAAAAGTT
CCCTTCTACC CTTATCCTAC TAATCTTAAC TCATTCGCAG TCTATTAATA AACGGCGATA
TCAATTCACC ATCAACCCAC AATTTATCAT TGATGCTGGT TTCTTTGGTA ACCACACGCG
GTTCATCAAC TCTGCTCAAG GAAATAATGT GAATTGTGTC GCCCATCGTG AGTTTCGGTT
AATATTATTA GGACTTTGAG CCTACTGACC CGTTTGGCAG AGAGAGCAGT GGGTCACGAG
CTCAGGATAT TGTTTCTGAC AAGTGAGTCT GACATCTGAT TCAAAGTGAA GCTAATACTT
TTGCCTGTTT AGCGAGACCC ATCAGACGAC ATGAGGAAAT CCATTTCAAC TATGGGTACG
CAAAAAACCC TGTAAATACT GGTGAAAAAG CTGACCTTTT CTTAAGAGAC GACTTTTGGG
ATAATCATTA GACCGGTTTG CAGGCTCAAT GAGAGGCGTA GAAGTGACAG AGGACGAAAG
GAGGAAAGGA ATGTCGGTAG GCCCATCGCG CTGAGAAAAT GGCATTGGCA AAATAGATGT
TGC
 
Protein sequence
MPPHQPGDLE PPPKVPPEIY NTVRQTWVQT WKDFYAWKPP STLPVEDGWV SMRAEEEDKN 
LVKDHEDFGK RMAALLEKFE QESGREGKLA EVEFEGAAEL KPLQNTNKSP IEWNVHGRPI
RPLPPIYSLT PTIDPVPEYA FCIYTPRSIL SPDEVIMPFM PTFDDDTLDI PGFETEKEKY
SSLFTSCLWD LPGRDADVDI IMFETLKRLE KLEVEKEDID RTRILPKECL YVESLDLRRD
LPPFPLPPQN IDSVTGRKLP DGLNHVVGAK RKWEEPLELI EEDFEDDLEG FEEACCHYPT
CTSVMCIRHT RGSSFNAGNQ RLPRISSVLP TLSEPCSPTC YSLTSNEATL GERLLSIRLE
KEWSESDKQQ LIDILSAYEG SRLERICGLK DVFNRTCAEV ARQVTSILQD RSRGPSVHEP
GADMECSTPS STSGSLRPLL QRSKSSKAQL IPITNRLPEF VECEHEGECL PGVCSCANGK
LPCGRHCSCP STCTRRHRGC NCRRIAIQEG RPVRDGKICI NGKCPCIRSF RECDKELCGS
CGAAEELVQD EEILKTTGSF GKDGEWVENK DKMAQGQTFI SCGNIALQKA KWPKLRVGIS
KVAGYGLFAD EDIGQHVPVG EYVGEYISEW EGDNRNFAES INKRRYQFTI NPQFIIDAGF
FGNHTRFINS AQGNNVNCVA HQRAVGHELR ILFLTTRPIR RHEEIHFNYG DDFWDNH