Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00870 |
Symbol | |
ID | 3255467 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | - |
Start bp | 280108 |
End bp | 282117 |
Gene Length | 2010 bp |
Protein Length | 423 aa |
Translation table | |
GC content | 44% |
IMG OID | 638254503 |
Product | endopeptidase, putative |
Protein accession | XP_568638 |
Protein GI | 58262456 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5159] 26S proteasome regulatory complex component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTTTCATC CAGTTACTAT ACCCATCGTA TCCTTCGCCA CGACATTCTG AGGCTTTAGC AATCGAACAC ATTCTCTTAC ATTAGATCAT CTTGTCATGT CCGTCGACAC ACCCACCTCT GAGAAACTCG ACCAAGCGGC GAGTGTCTTT GACAAAGATC CGACCACTGC AGAGCGACTC TATAAGGAAA TATTGCAAGA TGACAGTCAA CGTGAGTTCC TCCTCTATTG ATGCGCTTTA GCCTCCAGGG GTCCTGATAG AAGCCTTCTT CTTCATTAGC TGGAAACGAA GACCTTTTGA GAGACAAAGA GGTCGCTCTC ATCAAGTTGG GCACTCTCTA CAGAGATTCA AGGTATTTAT CTATCCGTAC AATGATCTAT AGAACCTTTC TGACACTTGT CTGTGACGAC AATTTTGGTA GCATGCTCGA CAAGTTGTCT CAGTTGATAA CGGACTCTAG AACTTTCATG TCACATATCG CAAAAGCCAA AACGACTAAG CTAGGTGAGC TTGCTTCCCC AGCCTTTTAA GTTACGACAT TAATCAGTCA ATTCCAAAGT GCGCACACTC CTGGACCTTT TCCCTCAAGA TTCAAAGGAT ATGCAGATGA AGGTCATTCA AGAGAATATA GACTGGGCCC GCACAGAAAA GAGGGTTTTC TTGCGCCAAA GCCTGGAGAT AAAACTCATT AACGTGTGAG GCCTCTTTGT TTGAGAAGAC TATGTTAGAT CTTCTTAGCT GACTACAATA CCAATCAGCT TGTTGGATGC CGAGAAATAC CAAGAGGCCT TGACTATCAC CCAAACTCTT CTCAAAGAAC TCAAAAAATT CGATGATAAG ATTATCCTGA CCGAGGTGTA TTTATTGGAG TCGCGTGCTG CCCATCACAT GCACAATCAC GCGCTGGCGA AGACGGCACT AACCTCAGCT CGCACAACTG CCAACAGTGT ATACTGTCCG CCTACGCTTC AGGCTCAACT TGACCTGCAA TCTGGAGTTA TCATGGCGGA GGACAAGGAT TACAAAACAG CGTACTCCTA CTTCTTTGAA GCCTTTGAAG GTTTCTGTCA ATCCGCCGAG AGAGACAATA GAGCACTGAG CGCCTTGAAA TACATGCTAT TGTGCAAGAT TATGATCGGA TCCGTGAGTA TGATGTTTGG CTAACTCGCC CATATGCTGA TGAAGTCTGG ATTCCCCCAG CCTAACGACG TCTTCTCGTT GTTATCATTG AAAAGCGCCG CCCCCTACAT AGGCAAAGAT GTGGACGCGA TGAAAGCAAT TGCGACGGCT CTTGAGGAAC GCAGTCTTGA TCTTTTCAAG ACAGCTTTGC AAAATTATTC CGACCGTAGG TCTGCGTAGT GTGAAGACTT AACAGCTAAC GGCTCCTCCA GAATTGCAGA AAGACGAAAT CATTCGTTCC CATCTCTCTT ATCTTTATGA CACGCTCTTA GAACAGAATC TTATCAGAGT CATTGAACCC TATTCTGCAG TCGAACTGTC ATGGATAGCT TCAGAAGTGG GGCAGAGCCT GCAAGTCATT GAAGACAAGT GAGTTTTTGT TTTCGTTCTC TTTTTACAAA GACCGACAAC TATTTAACCT ATTTTTCATC CAGGTTGAGT CAAATGATTT TGGACCAAAA GTTCTGTGGT ATTTTGAATG AACGCATGGG TACTCTCGAA GTTCATGATG ATTATTCAAA TGAGGTTAGT CGTTTTCCTA ATATCACGAA CAAAGACAGA AATGTGTGTT GACATAGCAT TCTTATAGGG GATATGTTCA ATGGCGTTGG GCACTCTGAA GCATATCAGC GACGTTGTGA ATGGCCTAAA TGATAAGGTT CGTTCATCTG AACATGTTGC GTCTAGACAG CAATAGCTGA TTTCTCTTAT CGTTGTCAGG CCGCGCAGAT GGTTTAATCA CCACAAGCAC TAGAACGAAC AGGGTTACCA GTAAAAGCAA ATTGGGCTTG CTATCTACAT CCACTAGACC ATTTTTATAA CATCTAGTGG
|
Protein sequence | MSVDTPTSEK LDQAASVFDK DPTTAERLYK EILQDDSQPG NEDLLRDKEV ALIKLGTLYR DSSMLDKLSQ LITDSRTFMS HIAKAKTTKL VRTLLDLFPQ DSKDMQMKVI QENIDWARTE KRVFLRQSLE IKLINVLLDA EKYQEALTIT QTLLKELKKF DDKIILTEVY LLESRAAHHM HNHALAKTAL TSARTTANSV YCPPTLQAQL DLQSGVIMAE DKDYKTAYSY FFEAFEGFCQ SAERDNRALS ALKYMLLCKI MIGSPNDVFS LLSLKSAAPY IGKDVDAMKA IATALEERSL DLFKTALQNY SDQLQKDEII RSHLSYLYDT LLEQNLIRVI EPYSAVELSW IASEVGQSLQ VIEDKLSQMI LDQKFCGILN ERMGTLEVHD DYSNEGICSM ALGTLKHISD VVNGLNDKAA QMV
|
| |