Gene CNN00870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00870 
Symbol 
ID3255467 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp280108 
End bp282117 
Gene Length2010 bp 
Protein Length423 aa 
Translation table 
GC content44% 
IMG OID638254503 
Productendopeptidase, putative 
Protein accessionXP_568638 
Protein GI58262456 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5159] 26S proteasome regulatory complex component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCTTTCATC CAGTTACTAT ACCCATCGTA TCCTTCGCCA CGACATTCTG AGGCTTTAGC 
AATCGAACAC ATTCTCTTAC ATTAGATCAT CTTGTCATGT CCGTCGACAC ACCCACCTCT
GAGAAACTCG ACCAAGCGGC GAGTGTCTTT GACAAAGATC CGACCACTGC AGAGCGACTC
TATAAGGAAA TATTGCAAGA TGACAGTCAA CGTGAGTTCC TCCTCTATTG ATGCGCTTTA
GCCTCCAGGG GTCCTGATAG AAGCCTTCTT CTTCATTAGC TGGAAACGAA GACCTTTTGA
GAGACAAAGA GGTCGCTCTC ATCAAGTTGG GCACTCTCTA CAGAGATTCA AGGTATTTAT
CTATCCGTAC AATGATCTAT AGAACCTTTC TGACACTTGT CTGTGACGAC AATTTTGGTA
GCATGCTCGA CAAGTTGTCT CAGTTGATAA CGGACTCTAG AACTTTCATG TCACATATCG
CAAAAGCCAA AACGACTAAG CTAGGTGAGC TTGCTTCCCC AGCCTTTTAA GTTACGACAT
TAATCAGTCA ATTCCAAAGT GCGCACACTC CTGGACCTTT TCCCTCAAGA TTCAAAGGAT
ATGCAGATGA AGGTCATTCA AGAGAATATA GACTGGGCCC GCACAGAAAA GAGGGTTTTC
TTGCGCCAAA GCCTGGAGAT AAAACTCATT AACGTGTGAG GCCTCTTTGT TTGAGAAGAC
TATGTTAGAT CTTCTTAGCT GACTACAATA CCAATCAGCT TGTTGGATGC CGAGAAATAC
CAAGAGGCCT TGACTATCAC CCAAACTCTT CTCAAAGAAC TCAAAAAATT CGATGATAAG
ATTATCCTGA CCGAGGTGTA TTTATTGGAG TCGCGTGCTG CCCATCACAT GCACAATCAC
GCGCTGGCGA AGACGGCACT AACCTCAGCT CGCACAACTG CCAACAGTGT ATACTGTCCG
CCTACGCTTC AGGCTCAACT TGACCTGCAA TCTGGAGTTA TCATGGCGGA GGACAAGGAT
TACAAAACAG CGTACTCCTA CTTCTTTGAA GCCTTTGAAG GTTTCTGTCA ATCCGCCGAG
AGAGACAATA GAGCACTGAG CGCCTTGAAA TACATGCTAT TGTGCAAGAT TATGATCGGA
TCCGTGAGTA TGATGTTTGG CTAACTCGCC CATATGCTGA TGAAGTCTGG ATTCCCCCAG
CCTAACGACG TCTTCTCGTT GTTATCATTG AAAAGCGCCG CCCCCTACAT AGGCAAAGAT
GTGGACGCGA TGAAAGCAAT TGCGACGGCT CTTGAGGAAC GCAGTCTTGA TCTTTTCAAG
ACAGCTTTGC AAAATTATTC CGACCGTAGG TCTGCGTAGT GTGAAGACTT AACAGCTAAC
GGCTCCTCCA GAATTGCAGA AAGACGAAAT CATTCGTTCC CATCTCTCTT ATCTTTATGA
CACGCTCTTA GAACAGAATC TTATCAGAGT CATTGAACCC TATTCTGCAG TCGAACTGTC
ATGGATAGCT TCAGAAGTGG GGCAGAGCCT GCAAGTCATT GAAGACAAGT GAGTTTTTGT
TTTCGTTCTC TTTTTACAAA GACCGACAAC TATTTAACCT ATTTTTCATC CAGGTTGAGT
CAAATGATTT TGGACCAAAA GTTCTGTGGT ATTTTGAATG AACGCATGGG TACTCTCGAA
GTTCATGATG ATTATTCAAA TGAGGTTAGT CGTTTTCCTA ATATCACGAA CAAAGACAGA
AATGTGTGTT GACATAGCAT TCTTATAGGG GATATGTTCA ATGGCGTTGG GCACTCTGAA
GCATATCAGC GACGTTGTGA ATGGCCTAAA TGATAAGGTT CGTTCATCTG AACATGTTGC
GTCTAGACAG CAATAGCTGA TTTCTCTTAT CGTTGTCAGG CCGCGCAGAT GGTTTAATCA
CCACAAGCAC TAGAACGAAC AGGGTTACCA GTAAAAGCAA ATTGGGCTTG CTATCTACAT
CCACTAGACC ATTTTTATAA CATCTAGTGG
 
Protein sequence
MSVDTPTSEK LDQAASVFDK DPTTAERLYK EILQDDSQPG NEDLLRDKEV ALIKLGTLYR 
DSSMLDKLSQ LITDSRTFMS HIAKAKTTKL VRTLLDLFPQ DSKDMQMKVI QENIDWARTE
KRVFLRQSLE IKLINVLLDA EKYQEALTIT QTLLKELKKF DDKIILTEVY LLESRAAHHM
HNHALAKTAL TSARTTANSV YCPPTLQAQL DLQSGVIMAE DKDYKTAYSY FFEAFEGFCQ
SAERDNRALS ALKYMLLCKI MIGSPNDVFS LLSLKSAAPY IGKDVDAMKA IATALEERSL
DLFKTALQNY SDQLQKDEII RSHLSYLYDT LLEQNLIRVI EPYSAVELSW IASEVGQSLQ
VIEDKLSQMI LDQKFCGILN ERMGTLEVHD DYSNEGICSM ALGTLKHISD VVNGLNDKAA
QMV