Gene CNM01550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM01550 
Symbol 
ID3255091 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp449942 
End bp451897 
Gene Length1956 bp 
Protein Length464 aa 
Translation table 
GC content46% 
IMG OID638254308 
Productendopeptidase, putative 
Protein accessionXP_568336 
Protein GI58261852 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.865518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATAAACTT CTCTATATCC TGTACACCAT GTCTGCCCCA GCCCAAGACC CTCCTCCTCC 
TAACCCTCCG GCCGGAGACA GCAGCAAGCC ACAAGAATCA TCAGACGTCC CTCAGCAAAC
AACCCAGGAG ATTGAAAGCG CAGAGCAGCA ACAGCAGCAG GAAGTCGAGC AGCCCAAGGA
GGATACGTTT GAAGATGTCC CGGAACATGT CGTGAAGGTT GGTTATTTTG ATTGTTGCTT
GCTTCCAATA TGGATTAGTT TGCTCATTTG AATCTCCGCG ACAGTCTGAT GCGCAAGAGA
TCAAGATGCA GACACGTATG ATCGATAACG AAATCAAGAT GATGCGACAG GAAAACCTCC
GGCTGTCACA CGAGAGGGAG CAGATGGTTG AGAAGATTGC GGATAATACG ACAAAGATCA
AGCAGAACAA GGTTCTGCCT TACCTTGTCT CCAAGGTCGT CGAGGTATGT CCTTCATGTT
ATATCATTTC GGAGCGAGAA GAAGTGACTT GCTGATACAA GAGTGTCTGA ACAGATTCTT
GATGTGGACT CTGAAGAGCA AGAAGGTGCT ACTCATAATG AACAAAACGC TAAAAAGTCA
AAATGTGCTG TTATCAAAAC ATCAACGCGT CAGACAGTAT TTTTGCCCAT TATCGGGTTG
GTTCCGCACA ATCAGCTTGC GCCTGGAGAC TTGATTGGTG TCAACAAGGA TTCTTATCTG
GTTCTGGACA AACTTCCAGA TGGTGGGTCT ATCTCCTGTC ATTTGGGTGC GAAGCTGATT
GGGAGCAGAG TACGACGCGA GGGTAAAGGC GATGGAAGTG GATGAAAGAC CGACTGAGAC
ATACACGGAT ATTGGTGGTT TAGACAAACA GATTGAAGAA TTAGTAGAGG CCATGTTAGT
TCACCCATAT CATCGTGGTG GAAAAGCTAA CAGCTAGGCA GCGTCTTGCC CATGCAACAA
GCAGACAAGT TCAAGACTCT TGGTATCACC CCTCCAAAAG GATGCCTTAT GTACGGTCCG
CCTGGTACCG GTAAAACCCT GCTCGCCCGA GCCTGTGCCG CTCAGACCAA CGCTTGCTAC
CTCAAACTCG CCGGTCCCGC TCTAGTCCAG ATGTACCTCG GTGACGGTGC CAAACTCGTC
CGCGACGCTT TCGAACTTGC CAAGCAAAAA GCGCCTGCTA TCATTTTTAT CGACGAGTTG
GATGCGATTG GAACAAAGAG GTTTGACAGC GATAAAAGTG GTGACAGGGA AGTGCAGAGG
ACAATGTTGG AGTTGTTGAA CCAACTGGAT GGTTTTTCGA GTGATAGTCG AATCAAGGTG
TGTACTTTTT TAGCTCCTTG TGGAGTCGAA AGGAAAAAAA GGAAAGAAAT TGACAATGAG
TCAAGGTCAT TGCCGCTACA AACCGAATTG ATATCCTTGA CCCTGCCCTT CTCCGATCAG
GCCGTTTAGA TAGGAAGATT GAATTCCCTC TGCCAAATGA GTCTGCTCGA GAGCATATTT
TGCAAATTCA CTCTCGAAAG CTTAATCACC ACGGCGTCAA GTGAGTATCT GTTACTTTTA
GATCAGCGTG GAATGAATCG CTGACATTTT TCAAGCTTTG AGGAATTGGC TAGATCAACA
GAGGATATGA ACGGTGCGCA ATTGAAGGCT GTTTGTGTTG AAGCGGGCAT GGTACGCCTT
TATCCATAAT CTGCCTTTAA TCAAAACTTG AAACTGACGA TAATAATATA GTTGGCTCTT
CGACAAAACG CTACACAACT GACACACGAG CATTTCCATG GGGGTATTTT GGAAGTTCAA
GCACGCAAAG CCAAGGAACA CCACGTAAGT GCATTTGCCT GCCATGGATC CGAGGGATGG
GATTGGCTGA CATGGAACAG TACTTTGCAT AATGACGAGA ATTTGGTTGA TAGTAGATAT
GATGTTTGCA TATGTAGATG TATTCGATTG ATTATG
 
Protein sequence
MSAPAQDPPP PNPPAGDSSK PQESSDVPQQ TTQEIESAEQ QQQQEVEQPK EDTFEDVPEH 
VVKSDAQEIK MQTRMIDNEI KMMRQENLRL SHEREQMVEK IADNTTKIKQ NKVLPYLVSK
VVEILDVDSE EQEGATHNEQ NAKKSKCAVI KTSTRQTVFL PIIGLVPHNQ LAPGDLIGVN
KDSYLVLDKL PDEYDARVKA MEVDERPTET YTDIGGLDKQ IEELVEAIVL PMQQADKFKT
LGITPPKGCL MYGPPGTGKT LLARACAAQT NACYLKLAGP ALVQMYLGDG AKLVRDAFEL
AKQKAPAIIF IDELDAIGTK RFDSDKSGDR EVQRTMLELL NQLDGFSSDS RIKVIAATNR
IDILDPALLR SGRLDRKIEF PLPNESAREH ILQIHSRKLN HHGVNFEELA RSTEDMNGAQ
LKAVCVEAGM LALRQNATQL THEHFHGGIL EVQARKAKEH HYFA