Gene CNA01760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01760 
Symbol 
ID3253833 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp471294 
End bp473648 
Gene Length2355 bp 
Protein Length708 aa 
Translation table 
GC content48% 
IMG OID638252509 
Producthypothetical protein 
Protein accessionXP_566542 
Protein GI58258259 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5533] Ubiquitin C-terminal hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCTCATAGC TCTCGGACCC TCCCGAGACA TAGCATCTAC GCAATGAGAT GAGGAAACGA 
CGCACGACTC CCAGATCAAA CATCTCAAAG AACGCCAACA TCAGCAAACA ATTGGCCACT
ACACTGCCTG AGGGGGAAAC GCAGTCCACA AGAAAGATAC AAGCTGCAAG CGTCGAGAAG
CAAAAACCGG TAATAGAGTT ATTAGGCGCC GGCAGATCCA CTCTTGTCTC CAACGACGAG
GACTACTTAT CTCCTCAAGT ACCACCCAAA CCCATTATCT CTTTGGAACA TCCATCGTCT
ACAACTTCGA CTATCAAGGT CGCCACTACC TTTCCGCACA TCGTGTTTGA TGATCTCTGG
ATAGATATTC TATCATTTTT TGCTCAAGTC TATCTCGCAT TATACGAAAT GACTCTTGGA
ATTGGAAAGT GGTGGGCCGG TACTGAGATA GAAGATAGCC ATAGGGGCAA AGAGACAAAA
GAAAAAGAGA TCAAGGCCAA GAGGCGTAGG CGAAGGGTCG AGCCGTCGAG CAATAGTTCT
GGTAGGTTTT TTTCTTTCTC CTTTTCCCTC GTTATATTAT ATGTATGATT AGTTTACGGG
TGAGAAACAG ATTCGGCAAC AGGCCGATTC TTTCCTGGCA TGGTAAATTT AAGTGGGACG
CTGTGTTATA TGAACTCTGT TCTTCAGGTG GGTTTCATAG CATCGGCTTC TCATGCTTCA
GTCGGACCTT ACTTGATGCT CCTTTTAGTC TGTCGCCTCC ATACCCTCAC TCATCATCCA
TCTCGAAAAA GTTATCGATT TGGCGGTAGA AGTGGACATA CCGACCCATG TCACAGATGC
TCTTCTCGAT GTTATCCGGG ATCTCAACAC TCCCAATAAA CGCCCCCCAC CTGCTCTTCG
ACCACATAAC CTTCTCACTG CACTATATCC TCTCCCTGCG ATTCGACGAC TATTGAGTAC
TCATGAGCAG CAGGACGCCC ATGAACTATT TCTGGTGCTG GCTGAGGCGG TCTCTGATGA
AGTTGTCAAG GTTGCTGCAG AGGTCGCCAA AGTGCGAGGC ATAGGAGAAA TTATCTCATT
ACAGGGATAC TTGTCCGGCA AAAATGATGG AAGTGACCAA CCTAGGAGAG CGGGCGACAT
GGAGGGAGCC AAAAGGAGGC AAAGAATAAG AGGTGTGGCT CAACCATGGG AGGGCTTGTT
GGCGAGAAGA AGAGTGTGTC AACGTTGCGG TTGGAGTGAG GCTATTAGGA TGGATACTCT
AGGGGGGATT GAATTGCCTG TCCCGTTACA CGTGTGTAAA ACAATTCACT CAATGTGCTG
TAGCTGAACG CTTGTTTTTA GGGCAATACC ACCCTTGACG CCTGTATAAT GGAGTACCTC
GCGCCCGAAA TACTTTCCGA TGTCACATGC GAAATGTGCT CTCTCAAGCT CACACTGGAT
TACTACACTT CCGAAGTTGA GCGATTGTCG GGCTCCTCCT GTAAAAAGAC TTCTACGAAG
GAAGAGAATG AGCCCGGTAG TGACAATCAA GTCAGTGCTA GTCGGAAAAA GAAAGCTCGC
GAAGCCAGGC GCGCAGAGAC CAGGCTACGA GAAATGGTCA ATTCCAACAC AGTTAGCGAT
TTTGGCGAAC CTAATCTAAC GCCCCTACCG TCATCCGGCT TGACTGCGCA AATACCGGTC
AAATGGCTTA CGGCTAAAAC AGCGTCCACT CGACAGTGTG TCATAGTTCG TCCCCCCCAA
TCTCTCCAAC TACATTTTAT CCGCTCGGAA TTCACCATGT ATGGGACGGT GCAAAAGAAG
ACCGCAAAAG TCTCATTTCC ATTGTTATTA GATCTCACAA GATTTGTCTC TGACGGAGTG
TGGGAAGAAA GAGGCGGGAT AAAAAATATG CTCGCATCAG TCTCAACCGA AAAAACGATC
CCCCCAATAA CAGGACAAAG GGTAATATAC CAACTTGAGT CAGCTATTCT TCACTACGGA
TTTACCCACT CTTCTGGTCA TTTCGTCAGT ATTCGACGGA AACCCTCCCC GTCTTTGACC
AAAGAGGAAA AGTCATTCCG ACCCTTGCAA GTGGCAAAAA ATTGTCCTGA TGGGTGTAAA
TGCCAGGATT GTGTTTACTT TGGGCAGGTG AGGGATCTCG AAAATACGAA AGTACCAGGC
AGAGGATGGC TGCGGATTAG CGATGCAGAT GTAGAAGAGG TCGGCGAAGA GGCACTTCAC
GAAGCTGGTG GGGCCGTCGT CATGCTTTTC TATGAGCGTG TCATGGAGTA TGTAGCAAAA
AAAGTCGTGC ATGATGAGCG ACTTCAGGGA GACGGTGACA GCCAAGATGC TGGAGGTTTA
GAAGATAACA TTTAA
 
Protein sequence
MRKRRTTPRS NISKNANISK QLATTLPEGE TQSTRKIQAA SVEKQKPVIE LLGAGRSTLV 
SNDEDYLSPQ VPPKPIISLE HPSSTTSTIK VATTFPHIVF DDLWIDILSF FAQVYLALYE
MTLGIGKWWA GTEIEDSHRG KETKEKEIKA KRRRRRVEPS SNSSDSATGR FFPGMVNLSG
TLCYMNSVLQ SVASIPSLII HLEKVIDLAV EVDIPTHVTD ALLDVIRDLN TPNKRPPPAL
RPHNLLTALY PLPAIRRLLS THEQQDAHEL FLVLAEAVSD EVVKVAAEVA KVRGIGEIIS
LQGYLSGKND GSDQPRRAGD MEGAKRRQRI RGVAQPWEGL LARRRVCQRC GWSEAIRMDT
LGGIELPVPL HGNTTLDACI MEYLAPEILS DVTCEMCSLK LTLDYYTSEV ERLSGSSCKK
TSTKEENEPG SDNQVSASRK KKAREARRAE TRLREMVNSN TVSDFGEPNL TPLPSSGLTA
QIPVKWLTAK TASTRQCVIV RPPQSLQLHF IRSEFTMYGT VQKKTAKVSF PLLLDLTRFV
SDGVWEERGG IKNMLASVST EKTIPPITGQ RVIYQLESAI LHYGFTHSSG HFVSIRRKPS
PSLTKEEKSF RPLQVAKNCP DGCKCQDCVY FGQVRDLENT KVPGRGWLRI SDADVEEVGE
EALHEAGGAV VMLFYERVME YVAKKVVHDE RLQGDGDSQD AGGLEDNI