Gene CNE03020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE03020 
Symbol 
ID3257827 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp857750 
End bp859539 
Gene Length1790 bp 
Protein Length438 aa 
Translation table 
GC content47% 
IMG OID638256885 
Productendopeptidase, putative 
Protein accessionXP_570849 
Protein GI58267386 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0528314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTTCCTCCT TTCCCTCAAA ACTCACCAAT CCTCTACACT CTAGCCAACA AGTAGTATTG 
TACAATGGTA TGTTCAGGTT GTTACTGTCT GATGAGTGCT GACAGTTGAT CTTCCATAAG
GGTCAAGCAC CATCCAGCGG CGCTGGAAAC AATAAGAAAG GCAACTCCAA AGACAACAAA
GACAAGGTTG GTTCTTGTGT TTCGATGTAG CATCGCAAGA GATTGATTTG CTGACTTACT
TCGGTAGCCT AAGTGGGAGC CCCCTGTGCC TACTCGTATC GGTAAGAAGA AGAGACGCGG
ACCCGATGCG TCGTCCCGAC TTCCAGCTGT ATATCCCACC ACTCGATGCA AACTCAAGTT
ATTGAAGATG GAGAGGATAC AAGACTACCT TCTCATGGAA GAGGAATTTG TGTCTAATCA
GGCATCGCAG TCTGGTGAAG ATAGGACGGC GGCAGATCGA ACTCGGGTGG ACGAGCTTCG
TGGCTCGCCT ATGGGAGTCG GCACTTTGGA AGAGATCATT GATGACGATC ACGCCATTGT
GTCTTCCGGA GGTGGATCTG AATACTATGT TGGGATCATG TCCTTCGTTG ATAAGGACCT
GCTTGAACCC GGTTGCTCAG TTCTCCTTCA CCACAAGACG CACGCTGTTG TGGGAGTGCT
TGCCGATGAC ACCGATCCCA TGGTCTCTGT CATGAAACTT GATAAAGCCC CCACTGAAAG
CTATGCTGAT ATTGGAGGCC TGGAAAGTCA AATTCAAGAA ATCAAAGAGT CAGTTGAACT
TCCACTTACA CACCCCGAAC TTTATGAAGA GATGGGCATC AGACCGCCCA AAGGTGTTAT
CTTGTATGGA GTACCTGGTA CTGGAAAGAC TCTTCTTGCA AAAGCCGTCG CCAATCAGAC
ATCTGCAACA TTTCTTCGTA TAGTCGGCTC CGAACTCATT CAAAAGTATC TTGGCGATGG
TCCAAAGCTT GTTCGGGAAC TATTCAGGGT GGCTGAGGAA AACGCCCCAA GCATCGTCTT
CATCGATGAG ATTGATGCCA TCGGGACAAA AAGATATGAC TCGACATCCA GTGGCGAACG
GGAGATACAG CGTACAATGC TGGAGCTGTT AAATCAGCTG GACGGTTTTG ATACGCGGGG
TGATGTTAAA GTCATAATGG CAACCAACAA GGTGAGTCAT CTTCGCTTAC AAGATTTTAA
CCCGCGTTGT TGACTAGTTT TTAGATTGAA AATCTCGACC CTGCGCTGAT TCGTCCAGGT
CGAATAGACA GGAAAATTGG CAAGTTTGCC CATAGCCATA GCTTACCTCG TGCTGACTTA
GATTCTCATC AGAATTCCCT CTTCCTGATA CAAAGACTAA GAGACACATC TTTAAATTAC
ACACATCGCG AATGTCTTTG GCAGATGATG TGGACATCGA GGAATTAGTC ATGACCAAGG
ATGAACTATC TGGGGCAGAC ATTAAAGCCG TTTGCAGTGA GTGTTTCATG TCGTGACCAT
CGCTGACAAT TAGCCGAGGC CGGTTTGCTG GCATTGAGGG AAAGGCGAAT GAGGGTTACT
CGTACCGTAG GTTCATGTCT AAATATAGGA CATCTGTTGC TGTTGCTGAT TTTTGGGGGA
TACCAGGATT TCACCACAGC ACGAGAAAAA GTCCTGTATG GAAAAGACGA GAATACTGTA
AGTCTATCTG CCTTTAACAC CGCTGACGCT GTAGCCCGCA GGCCTGTATC TCTAAGCGTA
CGGAGACTGG AGATTTTGGT TTATGTAGTA GACTGCATGC ACTTCAATCA
 
Protein sequence
MGQAPSSGAG NNKKGNSKDN KDKPKWEPPV PTRIGKKKRR GPDASSRLPA VYPTTRCKLK 
LLKMERIQDY LLMEEEFVSN QASQSGEDRT AADRTRVDEL RGSPMGVGTL EEIIDDDHAI
VSSGGGSEYY VGIMSFVDKD LLEPGCSVLL HHKTHAVVGV LADDTDPMVS VMKLDKAPTE
SYADIGGLES QIQEIKESVE LPLTHPELYE EMGIRPPKGV ILYGVPGTGK TLLAKAVANQ
TSATFLRIVG SELIQKYLGD GPKLVRELFR VAEENAPSIV FIDEIDAIGT KRYDSTSSGE
REIQRTMLEL LNQLDGFDTR GDVKVIMATN KIENLDPALI RPGRIDRKIE FPLPDTKTKR
HIFKLHTSRM SLADDVDIEE LVMTKDELSG ADIKAVCTEA GLLALRERRM RVTRTDFTTA
REKVLYGKDE NTPAGLYL