Gene CNK00100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00100 
Symbol 
ID3254426 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp30894 
End bp34689 
Gene Length3796 bp 
Protein Length1099 aa 
Translation table 
GC content55% 
IMG OID638253504 
Productubiquitin-specific protease, putative 
Protein accessionXP_567778 
Protein GI58260736 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5533] Ubiquitin C-terminal hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0377728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCCC CCTTCCCGGA CAACGGCTCG CCGGTCCCCT CTGTGCCGTT CCCGAGCATC 
CACAACCCCC TCCCCTACAC GCCCCCCCCC TACCATGCGC CCGCCGCCTA CTACCGCCAC
GCCCCGCCGT ACTATCCCCC GCCCCCCGCC CCGTACCCCT CCTATGCGCC GATACCCATG
GGCGGCGCAG GGCCTGGACC CGCGACGATG CAGAACGGCT ACGGCGGCTA CGGCGAGTAC
GGCGTGTACT ATGGCTACGC GGGCTATCCA GACTACCAGC CACCCCACGC TGGCTACCCG
CCTGCAGACG GCGACGACTA CAGCGACCCC GCAAAGACCC CGAGCTCGGC ACACGCCGTG
CCCCAGCCGC CCGCCATCGT CGCGCCAAAC CACACCCCCC ACGGCATGCC CCAGCAGTTC
CCGTCCTATC CCCCGAATCA CCCGTATGCC TTTGGCGGCG GTGTGGGCTA TCCCCAGGGG
TACACCCGGC CGCCGCAGCA TATGCACCAG CACCAGCATC ATCCGCAACC GCACTTTCAC
CCGCAACATC AGCAACATCT CCAGCAGGGG CCGTATGGAG GGTATGGCTA CCAGGAGGGG
TATCATGGGG GGAAATTAAA TCCTGCAGCG CAGGGGTTCA AGTATAATCG GTTCCAGCAG
CAGCAGCAGC AACAGCAACA GCAGCAACAG CAACAGCAAC AGCAGCATCA ACAGCAACAG
CAACAGCAGC AACAACAGCA ACAGCAGCAT CAACAGCAAC AGCAGCAACA GCAGCAGCAA
CAGCAGCAAC AACAACAGTT ACAGCAAGCA CACGCCCACG CACACGCACA AGCCCAAGCC
CAAGCCCAAT CCCAAGGACC GCCACGACAA CCTGTACATT CTCAACAACC CCCTCCTGCA
CCTGCGCCTA TCGCTCCTAG TCAAAATCAA CCATCCGAAC CATTACAACC GCCGATCCCC
AACGGTCACT CCCACCCCGA ACCTCCTACT CCCTCAACAC CCACAGCCAC CACAAAAGAG
CGAGAAGCAC AGTCTGCCGC ACAGCAAGAG CCTGTTCCTG CATCTGCAGC TGTCGAGCAG
GAAAAAGAGA CTGCAAAGGA TGAAGAAACT TGCCCCGACG CGTCCTCGGC GATCGCCGCA
CCTCGATGGA ACTTTATCCA TCCGTCATCT CTCCCATCAT CGTCATCCAC CGGTATTGCC
ACTTTACATT TGGAAATGAA CAAACATGCC CATCCGCAGA GAATAAGGCT GGTGCGGGCT
CGTCCGGGAG AGGAGAGTGA TGGGAATAGT TTTGCGATAG AGGTGAAGAC TGGTTTGCCA
GAGGAGATTG TCGTTGATGA GCAATCGCCA TCTCAGTCGC AAGAAAAGGG AACAGGACGA
GGAAAAGGTG GGAAGAAGAA GACGAAGGAA ACGGGCAGAG GAGTGTGGAA GACGGGCGAG
AAGAGGAGGG TAGAGTTGGT GTTTGGCGAA ATTGTCCCGG AGCAGGAGAA GGAAAAGGAG
GACAAGGCAG AGTGGGTGGG TGAAAAGCCA GTGGTCGATG AAAAGGAGGA GGAAAAAGTG
GAGAAGAAGG AAAGGTCTAC ACCAGCGCGC GCACCAGCAC CCGCACCCTC GCCCGCCAAA
CCTCGTTCAT GGGCCGCCCT CCTCAAAACC CCTACTCCTT CTTCCCCCTC CACCCCCGGC
GCAGTCCCCA GTTCTTCCGC CCGCGTCACT TCCACCACCG AGGCTGGGCC TTCCCGTCCA
CGACCATCCA CCTCTGCCTC CCCTTCCACC CCAAACCTTA CCGCCAACGT CAATGGTCTG
CAACTTTCAC CGCCAACCCA AGGACAACAA CAACAACAAG CCAAAACGTT CAACTATGCT
GCGGCTGCAA TGTCGTCCGT GCCTTCCCCA CATGAAGAGT TGATCAAGCT TTTGAGCGAG
GGCGTGAGTT CTCTCCGGGG ATCGGGGACG ACAGTAGGGT TGAGTGTAAA GGAGAAGGAA
GCGTTGATGG TGCCGAGGGG TTTGATCAAT ACCGGTAACA TGTGTTTTGC CAACACTGTG
AGTTGAATTT TTGCGACTGG ATAAAAGATC ATGAGAAAAA TGCTGACAAC AGTCGATGCG
ATAGATCCTT CAAGTATTGG TTTACTGTCC GCCGTTTACA GAGTTGTTTG AGGAATTTGG
AAAGAGGCTG AAAGCTGATT TGGCGAGGAA GACTCCGTTA TTAGAGGCGA TGTAAGTCTC
GTCTTCTCGT CTCCCCCAAA AAACACAAGG CGCGGGCTGA CAATGCGACG ATTATAGGAT
CATCTTTTTG CGAGAGTTTG TCTCATCACC CGAATCATCA ACATCAACAA CGTCAACACC
CAAGGGGAAA GGAAAGGACA AGGATTCGAG GAAAGAGGCG TTTATCCCGG AGAATGTGTA
TGATGCTATG AAGGAGAATA AGAGGTTTGA CTCTATGCGC GTGAGTCGAC GTCTAAATCC
AAATCTCGCC AGTGCTATGA AAAACGTTTG ACTGATAACA TTTTTATTTA CTTATTTGAT
AAACACAGAG AGGTTACCAA GAAGATGCTG AAGAGTATCT CGGATTCTTC CTCAACACTT
TACACGAAGA AATCATCTAC CTCCTCTCAC AAACATCCAC CACTTCTTCA TCTATGCCCA
ACGGTCAACC CAGCGACTCT TCTAGTCGAC AAGTCGAACG GCCCGTTTCC CCCCGCGCCG
GCGCCGGCAT CAACGGCAAC GCCGACTCAT CCTCTGGCTG GCTCGAAGTC GGCAAGAAAC
AAAAGACGCA CGTCGTGCGC GCTACCGAAT CCCGCGAGTC GGCCGTCTCA CGTTTATTCG
GCGGTAAACT CCGTTCCATC CTCCATACCC CCGGTCAAAA AGATAGCGTC ACCATCGAAC
CTTACCAACC GCTCCAACTC GACATTACCG GCCCTGCCAT CTTGTCCATC ACCGACGCCC
TCCGTGCGAT CTCCACACCT GAGATTATAC ATGGAGTATA TTCGGCGGCT AAAGGTGGGG
AAGTGGATGC GACCAAGACG GTGTATGTGG AGACTTGGCC AAAGGTTTTG ATTTGTCATT
TGAAGAGGTT TGTGTATGAT ACAGAGGAAG GTGGTGTGGT GAAGAGGAGT AAGGCGGTGG
CGTATGGTGT AGACTTGGCG ATCCCTAACG GTGAGTTGTT GTTTTTTTTT TTTTTGTTTT
TTTGTCTCGT GGTCTGACAG CGTTTGTTCG GGTGACTAGA AATAATATCC CCCGCAAGGC
GGACATCGAC CTCTATAAAA TACGCCCTCT TCGGTGTAGT CTACCACCAC GGTTCTTCAG
CTTCAGGAGG ACATTACACC GTTTCCGTTG CTCGTCCCTC TTCTTCTTCT TCCAGCTCCA
CATCGTCCAC CTCTATTCAA AGCCCCGCGG GAACATCATC GTCATTACCA CCACCACCAT
CATCAACGAC AATAGCATCC AGGTGGCTCC ACTTTGATGA CGAAAATGTA CGTGAAGTGC
GAGAAGAAGA TGTGGTTGTC TCCCTGGACC AAGCCAAGGG TGGGGAAACC GGGTCGGTCG
GTGGGAGGGA GAGGTGTGCG TACTTGCTGT TCTATAAGAG GGTGCAGTAA GAAGCATAGA
GAAAAGAAAA AAAAAAAAGC GCGTATGCGC CGGGTTTTAG AACAGGGGAC GTTTGATCTT
GTAGAGCGGG TTGGGAGGTG TCGTGAGTAA AATTGATAGA CGTTGGGTGC TAGGAAGGGC
AGGGGTAGGC TTGAGAAGCG ATCGGTACGC GGAAGGAAGG GGAGGAGCCA AAGAAGGAGG
TAGAGGCGAG ATGGGG
 
Protein sequence
MAAPFPDNGS PVPSVPFPSI HNPLPYTPPP YHAPAAYYRH APPYYPPPPA PYPSYAPIPM 
GGAGPGPATM QNGYGGYGEY GVYYGYAGYP DYQPPHAGYP PADGDDYSDP AKTPSSAHAV
PQPPAIVAPN HTPHGMPQQF PSYPPNHPYA FGGGVGYPQG YTRPPQHMHQ HQHHPQPHFH
PQHQQHLQQG PYGGYGYQEG YHGGKLNPAA QGFKYNRFQQ QQQQQQQQQQ QQQQQHQQQQ
QQQQQQQQQH QQQQQQQQQQ QQQQQQLQQA HAHAHAQAQA QAQSQGPPRQ PVHSQQPPPA
PAPIAPSQNQ PSEPLQPPIP NGHSHPEPPT PSTPTATTKE REAQSAAQQE PVPASAAVEQ
EKETAKDEET CPDASSAIAA PRWNFIHPSS LPSSSSTGIA TLHLEMNKHA HPQRIRLVRA
RPGEESDGNS FAIEVKTGLP EEIVVDEQSP SQSQEKGTGR GKGGKKKTKE TGRGVWKTGE
KRRVELVFGE IVPEQEKEKE DKAEWVGEKP VVDEKEEEKV EKKERSTPAR APAPAPSPAK
PRSWAALLKT PTPSSPSTPG AVPSSSARVT STTEAGPSRP RPSTSASPST PNLTANVNGL
QLSPPTQGQQ QQQAKTFNYA AAAMSSVPSP HEELIKLLSE GVSSLRGSGT TVGLSVKEKE
ALMVPRGLIN TGNMCFANTI LQVLVYCPPF TELFEEFGKR LKADLARKTP LLEAMIIFLR
EFVSSPESST STTSTPKGKG KDKDSRKEAF IPENVYDAMK ENKRFDSMRR GYQEDAEEYL
GFFLNTLHEE IIYLLSQTST TSSSMPNGQP SDSSSRQVER PVSPRAGAGI NGNADSSSGW
LEVGKKQKTH VVRATESRES AVSRLFGGKL RSILHTPGQK DSVTIEPYQP LQLDITGPAI
LSITDALRAI STPEIIHGVY SAAKGGEVDA TKTVYVETWP KVLICHLKRF VYDTEEGGVV
KRSKAVAYGV DLAIPNEIIS PARRTSTSIK YALFGVVYHH GSSASGGHYT VSVARPSSSS
SSSTSSTSIQ SPAGTSSSLP PPPSSTTIAS RWLHFDDENV REVREEDVVV SLDQAKGGET
GSVGGRERCA YLLFYKRVQ