Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB05540 |
Symbol | |
ID | 3255893 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 1557731 |
End bp | 1558874 |
Gene Length | 1144 bp |
Protein Length | 299 aa |
Translation table | |
GC content | 51% |
IMG OID | 638255196 |
Product | proteasome subunit alpha type 1, putative |
Protein accession | XP_569297 |
Protein GI | 58264282 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0638] 20S proteasome, alpha and beta subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.000947044 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAGAA ACTCGTACGA CTCGGATAAC ACTACCTTCT CCCCCCAGGG AAAGCTTTTT CAAGTAGAGT ACGCTCTCGA AGCCGTCAAA CAGGGTTCAG CAGCCATCGG CCTCAGGTCC AACACCCACG CCGTCTTGCT TACTCTCAAG GTGTGTTGCT CTCTCGTGGG GTCGTGGCAG CTACAATGAA GGGAACGCTT ACAAGGAGCA ATGATGTAGC GATCAACTGG CGAGCTTGCG ACATATCAGA AGAAGCTCAT CAGGATTGAC GATCACGTTG GTGTCGCCAT TGCTGGTTTG ACCAGCGATG CTCGTGTCTT GAGGTATGTC TTTAACATTT CTTCTTAGAA GACGACATGC TCTCTTTTAA ATTATCAGGG TTGCTGACTG CCTGTAATAT AGCAATTATA TGCGACAAAG GGCTATGCAA TCTAGGATGA CATACGGTCG CGCCACGCCT GTCGCTCGTC TCGTCCAAAG TATCGCCGAC CGCGCTCAAA CAAACACTCA AGAGTATGGG CGAAGACCGT ACGGCGTTGG ATTCCTTGTT ATCGGAAAGG ACGTAGGTTG AACCCTTCAT CTCTCCCTCT GATTTCCTTT CTATTTTTTC TTTTCTTTTC GAACGCCGAG CTAATTCACA CATGTCGTGT CATAACAGGA AACCGGCCCT CACCTCTTTG AATTCTCCCC AGCCGGCACG GCTTTTGAAT ACTATGCGCA CTCCATCGGT GCCCGCTCCC AATCGGCAAA GACATACCTT GAAAAAAACT ATCATCTGTT CCCCAATGCC TCACTTGAAG AGTTGATCAA CCATGGTCTT TCGGCTTTGC ATGATACCCT TCAACAGGAC AAACATCTCT CCTCTTTGAA TACTTCTATA GCCATTATCG GTCCTGCCGA GGGACAAGGA GTGGAGGATG TGAGCAAATC AGCAGCGGCA CAGAGAGGTG GATTTAGGGT GTGGGAGAAT GAAGGTGTGG AAGGGATTTT GAGAGGATGG AGGAGGAGTA GGGGGGAGCC AGAGGAGGGG CCAGAGGCTG AAGGCGAGTC TCAAGCTGAG GCTTCAGCTG AGGGCCAGAA TGAAGGTGGG GCGGGACAGC AAGAGGGACA GCTCCCGGCG GAGGACGTGA CGATGCAAGA GTGA
|
Protein sequence | MFRNSYDSDN TTFSPQGKLF QVEYALEAVK QGSAAIGLRS NTHAVLLTLK RSTGELATYQ KKLIRIDDHV GVAIAGLTSD ARVLSNYMRQ RAMQSRMTYG RATPVARLVQ SIADRAQTNT QEYGRRPYGV GFLVIGKDET GPHLFEFSPA GTAFEYYAHS IGARSQSAKT YLEKNYHLFP NASLEELINH GLSALHDTLQ QDKHLSSLNT SIAIIGPAEG QGVEDVSKSA AAQRGGFRVW ENEGVEGILR GWRRSRGEPE EGPEAEGESQ AEASAEGQNE GGAGQQEGQL PAEDVTMQE
|
| |