Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNM01550 |
Symbol | |
ID | 3255091 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006682 |
Strand | + |
Start bp | 449942 |
End bp | 451897 |
Gene Length | 1956 bp |
Protein Length | 464 aa |
Translation table | |
GC content | 46% |
IMG OID | 638254308 |
Product | endopeptidase, putative |
Protein accession | XP_568336 |
Protein GI | 58261852 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1222] ATP-dependent 26S proteasome regulatory subunit |
TIGRFAM ID | [TIGR01242] 26S proteasome subunit P45 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.865518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATAAACTT CTCTATATCC TGTACACCAT GTCTGCCCCA GCCCAAGACC CTCCTCCTCC TAACCCTCCG GCCGGAGACA GCAGCAAGCC ACAAGAATCA TCAGACGTCC CTCAGCAAAC AACCCAGGAG ATTGAAAGCG CAGAGCAGCA ACAGCAGCAG GAAGTCGAGC AGCCCAAGGA GGATACGTTT GAAGATGTCC CGGAACATGT CGTGAAGGTT GGTTATTTTG ATTGTTGCTT GCTTCCAATA TGGATTAGTT TGCTCATTTG AATCTCCGCG ACAGTCTGAT GCGCAAGAGA TCAAGATGCA GACACGTATG ATCGATAACG AAATCAAGAT GATGCGACAG GAAAACCTCC GGCTGTCACA CGAGAGGGAG CAGATGGTTG AGAAGATTGC GGATAATACG ACAAAGATCA AGCAGAACAA GGTTCTGCCT TACCTTGTCT CCAAGGTCGT CGAGGTATGT CCTTCATGTT ATATCATTTC GGAGCGAGAA GAAGTGACTT GCTGATACAA GAGTGTCTGA ACAGATTCTT GATGTGGACT CTGAAGAGCA AGAAGGTGCT ACTCATAATG AACAAAACGC TAAAAAGTCA AAATGTGCTG TTATCAAAAC ATCAACGCGT CAGACAGTAT TTTTGCCCAT TATCGGGTTG GTTCCGCACA ATCAGCTTGC GCCTGGAGAC TTGATTGGTG TCAACAAGGA TTCTTATCTG GTTCTGGACA AACTTCCAGA TGGTGGGTCT ATCTCCTGTC ATTTGGGTGC GAAGCTGATT GGGAGCAGAG TACGACGCGA GGGTAAAGGC GATGGAAGTG GATGAAAGAC CGACTGAGAC ATACACGGAT ATTGGTGGTT TAGACAAACA GATTGAAGAA TTAGTAGAGG CCATGTTAGT TCACCCATAT CATCGTGGTG GAAAAGCTAA CAGCTAGGCA GCGTCTTGCC CATGCAACAA GCAGACAAGT TCAAGACTCT TGGTATCACC CCTCCAAAAG GATGCCTTAT GTACGGTCCG CCTGGTACCG GTAAAACCCT GCTCGCCCGA GCCTGTGCCG CTCAGACCAA CGCTTGCTAC CTCAAACTCG CCGGTCCCGC TCTAGTCCAG ATGTACCTCG GTGACGGTGC CAAACTCGTC CGCGACGCTT TCGAACTTGC CAAGCAAAAA GCGCCTGCTA TCATTTTTAT CGACGAGTTG GATGCGATTG GAACAAAGAG GTTTGACAGC GATAAAAGTG GTGACAGGGA AGTGCAGAGG ACAATGTTGG AGTTGTTGAA CCAACTGGAT GGTTTTTCGA GTGATAGTCG AATCAAGGTG TGTACTTTTT TAGCTCCTTG TGGAGTCGAA AGGAAAAAAA GGAAAGAAAT TGACAATGAG TCAAGGTCAT TGCCGCTACA AACCGAATTG ATATCCTTGA CCCTGCCCTT CTCCGATCAG GCCGTTTAGA TAGGAAGATT GAATTCCCTC TGCCAAATGA GTCTGCTCGA GAGCATATTT TGCAAATTCA CTCTCGAAAG CTTAATCACC ACGGCGTCAA GTGAGTATCT GTTACTTTTA GATCAGCGTG GAATGAATCG CTGACATTTT TCAAGCTTTG AGGAATTGGC TAGATCAACA GAGGATATGA ACGGTGCGCA ATTGAAGGCT GTTTGTGTTG AAGCGGGCAT GGTACGCCTT TATCCATAAT CTGCCTTTAA TCAAAACTTG AAACTGACGA TAATAATATA GTTGGCTCTT CGACAAAACG CTACACAACT GACACACGAG CATTTCCATG GGGGTATTTT GGAAGTTCAA GCACGCAAAG CCAAGGAACA CCACGTAAGT GCATTTGCCT GCCATGGATC CGAGGGATGG GATTGGCTGA CATGGAACAG TACTTTGCAT AATGACGAGA ATTTGGTTGA TAGTAGATAT GATGTTTGCA TATGTAGATG TATTCGATTG ATTATG
|
Protein sequence | MSAPAQDPPP PNPPAGDSSK PQESSDVPQQ TTQEIESAEQ QQQQEVEQPK EDTFEDVPEH VVKSDAQEIK MQTRMIDNEI KMMRQENLRL SHEREQMVEK IADNTTKIKQ NKVLPYLVSK VVEILDVDSE EQEGATHNEQ NAKKSKCAVI KTSTRQTVFL PIIGLVPHNQ LAPGDLIGVN KDSYLVLDKL PDEYDARVKA MEVDERPTET YTDIGGLDKQ IEELVEAIVL PMQQADKFKT LGITPPKGCL MYGPPGTGKT LLARACAAQT NACYLKLAGP ALVQMYLGDG AKLVRDAFEL AKQKAPAIIF IDELDAIGTK RFDSDKSGDR EVQRTMLELL NQLDGFSSDS RIKVIAATNR IDILDPALLR SGRLDRKIEF PLPNESAREH ILQIHSRKLN HHGVNFEELA RSTEDMNGAQ LKAVCVEAGM LALRQNATQL THEHFHGGIL EVQARKAKEH HYFA
|
| |