Gene CNK02950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02950 
Symbol 
ID3254500 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp865595 
End bp866596 
Gene Length1002 bp 
Protein Length280 aa 
Translation table 
GC content50% 
IMG OID638253786 
Productendopeptidase, putative 
Protein accessionXP_567890 
Protein GI58260960 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0638] 20S proteasome, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0631891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATCT CAAACGTAAG TTTCCACGGC GGACGGAGGA CACCATATCC AGCGCTGACA 
TAACGCAGCA CCCCGCCCTC TGGAAACAGC CCGCTCCAGC AAACTCTGCT TTCAACGACT
ACAACACCTT TCCCCTCGGA CAAACGCAGC GCCACAACAC TTCTTCCCAC CATGGTCCCA
TGTCCCACAC CCAACAACCT CTTGTCACCG GTACTTCTGT TCTCGGCATC AAGTTTGACA
AGGGCGTGAT GATTGCGGCT GATAACCTCG GTTCATACGG TTCTCTTGCG AGGTTTAGAG
ATATCCAGCG TCTTCATCCT CTGGGGAAAC ATACCCTTTT GGGTGTGGCG GGCGACATGT
CTGATTATCA GTGGTTGAAA AGGGAGCTCG ATGGACTCTT GTATGTCCAT CTGTGGAAAA
CGGCGTAAGA TGTGATGCTG ACAGAATTAT GTAGACGAGA GGAATCTGCT CTCTCCCTGA
CCGATTCCCA CCCATCGCTT TCTCCTTCCA ATATTTACAC TCTTCTCTCC AATCTCTTCT
ACGCTCGTCG AAGCAAAGTT GACCCCATCT GGAACGCCGT CCTCGTCGGT GGTTGGGACG
ACACCAAAAA AGAAAGTTTC CTCGCATATG TCGATTTGCT TGGTACAACT TATTCTGCGC
CCACACTCGC GACGGGCTTT GGAGCCCATC TCGCGCAACC GCTATTGAGG GAAGCATATG
AAGCAAAGGC GGGGATTGAT GGCAAGGGGC CATTGTTGAC GCAGGAGGAG GCGGAGAAAT
TGATTGATGA TTGTATGAAG GTGTTGTTCT ACAGGGATGC GAGAAGTATC AACAAGGTCA
GTTTATATCT CTTATGGTTG AAAACATTTG TCTGATAAGA GGAAATAGTA CCAAGTCGCT
ACTATCACAG ATGAAGGTGT CAAGATCAGT GACTCTAGAT CAGCTCCTAC AGAATGGAAG
TTTGCAGAGG GTTTGAGAGG GTACGGGGCG CAGACCCAGT AG
 
Protein sequence
MAISNHPALW KQPAPANSAF NDYNTFPLGQ TQRHNTSSHH GPMSHTQQPL VTGTSVLGIK 
FDKGVMIAAD NLGSYGSLAR FRDIQRLHPL GKHTLLGVAG DMSDYQWLKR ELDGLLREES
ALSLTDSHPS LSPSNIYTLL SNLFYARRSK VDPIWNAVLV GGWDDTKKES FLAYVDLLGT
TYSAPTLATG FGAHLAQPLL REAYEAKAGI DGKGPLLTQE EAEKLIDDCM KVLFYRDARS
INKYQVATIT DEGVKISDSR SAPTEWKFAE GLRGYGAQTQ