Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE03020 |
Symbol | |
ID | 3257827 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 857750 |
End bp | 859539 |
Gene Length | 1790 bp |
Protein Length | 438 aa |
Translation table | |
GC content | 47% |
IMG OID | 638256885 |
Product | endopeptidase, putative |
Protein accession | XP_570849 |
Protein GI | 58267386 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1222] ATP-dependent 26S proteasome regulatory subunit |
TIGRFAM ID | [TIGR01242] 26S proteasome subunit P45 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0528314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTTCCTCCT TTCCCTCAAA ACTCACCAAT CCTCTACACT CTAGCCAACA AGTAGTATTG TACAATGGTA TGTTCAGGTT GTTACTGTCT GATGAGTGCT GACAGTTGAT CTTCCATAAG GGTCAAGCAC CATCCAGCGG CGCTGGAAAC AATAAGAAAG GCAACTCCAA AGACAACAAA GACAAGGTTG GTTCTTGTGT TTCGATGTAG CATCGCAAGA GATTGATTTG CTGACTTACT TCGGTAGCCT AAGTGGGAGC CCCCTGTGCC TACTCGTATC GGTAAGAAGA AGAGACGCGG ACCCGATGCG TCGTCCCGAC TTCCAGCTGT ATATCCCACC ACTCGATGCA AACTCAAGTT ATTGAAGATG GAGAGGATAC AAGACTACCT TCTCATGGAA GAGGAATTTG TGTCTAATCA GGCATCGCAG TCTGGTGAAG ATAGGACGGC GGCAGATCGA ACTCGGGTGG ACGAGCTTCG TGGCTCGCCT ATGGGAGTCG GCACTTTGGA AGAGATCATT GATGACGATC ACGCCATTGT GTCTTCCGGA GGTGGATCTG AATACTATGT TGGGATCATG TCCTTCGTTG ATAAGGACCT GCTTGAACCC GGTTGCTCAG TTCTCCTTCA CCACAAGACG CACGCTGTTG TGGGAGTGCT TGCCGATGAC ACCGATCCCA TGGTCTCTGT CATGAAACTT GATAAAGCCC CCACTGAAAG CTATGCTGAT ATTGGAGGCC TGGAAAGTCA AATTCAAGAA ATCAAAGAGT CAGTTGAACT TCCACTTACA CACCCCGAAC TTTATGAAGA GATGGGCATC AGACCGCCCA AAGGTGTTAT CTTGTATGGA GTACCTGGTA CTGGAAAGAC TCTTCTTGCA AAAGCCGTCG CCAATCAGAC ATCTGCAACA TTTCTTCGTA TAGTCGGCTC CGAACTCATT CAAAAGTATC TTGGCGATGG TCCAAAGCTT GTTCGGGAAC TATTCAGGGT GGCTGAGGAA AACGCCCCAA GCATCGTCTT CATCGATGAG ATTGATGCCA TCGGGACAAA AAGATATGAC TCGACATCCA GTGGCGAACG GGAGATACAG CGTACAATGC TGGAGCTGTT AAATCAGCTG GACGGTTTTG ATACGCGGGG TGATGTTAAA GTCATAATGG CAACCAACAA GGTGAGTCAT CTTCGCTTAC AAGATTTTAA CCCGCGTTGT TGACTAGTTT TTAGATTGAA AATCTCGACC CTGCGCTGAT TCGTCCAGGT CGAATAGACA GGAAAATTGG CAAGTTTGCC CATAGCCATA GCTTACCTCG TGCTGACTTA GATTCTCATC AGAATTCCCT CTTCCTGATA CAAAGACTAA GAGACACATC TTTAAATTAC ACACATCGCG AATGTCTTTG GCAGATGATG TGGACATCGA GGAATTAGTC ATGACCAAGG ATGAACTATC TGGGGCAGAC ATTAAAGCCG TTTGCAGTGA GTGTTTCATG TCGTGACCAT CGCTGACAAT TAGCCGAGGC CGGTTTGCTG GCATTGAGGG AAAGGCGAAT GAGGGTTACT CGTACCGTAG GTTCATGTCT AAATATAGGA CATCTGTTGC TGTTGCTGAT TTTTGGGGGA TACCAGGATT TCACCACAGC ACGAGAAAAA GTCCTGTATG GAAAAGACGA GAATACTGTA AGTCTATCTG CCTTTAACAC CGCTGACGCT GTAGCCCGCA GGCCTGTATC TCTAAGCGTA CGGAGACTGG AGATTTTGGT TTATGTAGTA GACTGCATGC ACTTCAATCA
|
Protein sequence | MGQAPSSGAG NNKKGNSKDN KDKPKWEPPV PTRIGKKKRR GPDASSRLPA VYPTTRCKLK LLKMERIQDY LLMEEEFVSN QASQSGEDRT AADRTRVDEL RGSPMGVGTL EEIIDDDHAI VSSGGGSEYY VGIMSFVDKD LLEPGCSVLL HHKTHAVVGV LADDTDPMVS VMKLDKAPTE SYADIGGLES QIQEIKESVE LPLTHPELYE EMGIRPPKGV ILYGVPGTGK TLLAKAVANQ TSATFLRIVG SELIQKYLGD GPKLVRELFR VAEENAPSIV FIDEIDAIGT KRYDSTSSGE REIQRTMLEL LNQLDGFDTR GDVKVIMATN KIENLDPALI RPGRIDRKIE FPLPDTKTKR HIFKLHTSRM SLADDVDIEE LVMTKDELSG ADIKAVCTEA GLLALRERRM RVTRTDFTTA REKVLYGKDE NTPAGLYL
|
| |