Gene CNB05540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB05540 
Symbol 
ID3255893 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1557731 
End bp1558874 
Gene Length1144 bp 
Protein Length299 aa 
Translation table 
GC content51% 
IMG OID638255196 
Productproteasome subunit alpha type 1, putative 
Protein accessionXP_569297 
Protein GI58264282 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0638] 20S proteasome, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.000947044 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGAA ACTCGTACGA CTCGGATAAC ACTACCTTCT CCCCCCAGGG AAAGCTTTTT 
CAAGTAGAGT ACGCTCTCGA AGCCGTCAAA CAGGGTTCAG CAGCCATCGG CCTCAGGTCC
AACACCCACG CCGTCTTGCT TACTCTCAAG GTGTGTTGCT CTCTCGTGGG GTCGTGGCAG
CTACAATGAA GGGAACGCTT ACAAGGAGCA ATGATGTAGC GATCAACTGG CGAGCTTGCG
ACATATCAGA AGAAGCTCAT CAGGATTGAC GATCACGTTG GTGTCGCCAT TGCTGGTTTG
ACCAGCGATG CTCGTGTCTT GAGGTATGTC TTTAACATTT CTTCTTAGAA GACGACATGC
TCTCTTTTAA ATTATCAGGG TTGCTGACTG CCTGTAATAT AGCAATTATA TGCGACAAAG
GGCTATGCAA TCTAGGATGA CATACGGTCG CGCCACGCCT GTCGCTCGTC TCGTCCAAAG
TATCGCCGAC CGCGCTCAAA CAAACACTCA AGAGTATGGG CGAAGACCGT ACGGCGTTGG
ATTCCTTGTT ATCGGAAAGG ACGTAGGTTG AACCCTTCAT CTCTCCCTCT GATTTCCTTT
CTATTTTTTC TTTTCTTTTC GAACGCCGAG CTAATTCACA CATGTCGTGT CATAACAGGA
AACCGGCCCT CACCTCTTTG AATTCTCCCC AGCCGGCACG GCTTTTGAAT ACTATGCGCA
CTCCATCGGT GCCCGCTCCC AATCGGCAAA GACATACCTT GAAAAAAACT ATCATCTGTT
CCCCAATGCC TCACTTGAAG AGTTGATCAA CCATGGTCTT TCGGCTTTGC ATGATACCCT
TCAACAGGAC AAACATCTCT CCTCTTTGAA TACTTCTATA GCCATTATCG GTCCTGCCGA
GGGACAAGGA GTGGAGGATG TGAGCAAATC AGCAGCGGCA CAGAGAGGTG GATTTAGGGT
GTGGGAGAAT GAAGGTGTGG AAGGGATTTT GAGAGGATGG AGGAGGAGTA GGGGGGAGCC
AGAGGAGGGG CCAGAGGCTG AAGGCGAGTC TCAAGCTGAG GCTTCAGCTG AGGGCCAGAA
TGAAGGTGGG GCGGGACAGC AAGAGGGACA GCTCCCGGCG GAGGACGTGA CGATGCAAGA
GTGA
 
Protein sequence
MFRNSYDSDN TTFSPQGKLF QVEYALEAVK QGSAAIGLRS NTHAVLLTLK RSTGELATYQ 
KKLIRIDDHV GVAIAGLTSD ARVLSNYMRQ RAMQSRMTYG RATPVARLVQ SIADRAQTNT
QEYGRRPYGV GFLVIGKDET GPHLFEFSPA GTAFEYYAHS IGARSQSAKT YLEKNYHLFP
NASLEELINH GLSALHDTLQ QDKHLSSLNT SIAIIGPAEG QGVEDVSKSA AAQRGGFRVW
ENEGVEGILR GWRRSRGEPE EGPEAEGESQ AEASAEGQNE GGAGQQEGQL PAEDVTMQE