Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00100 |
Symbol | |
ID | 3254426 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 30894 |
End bp | 34689 |
Gene Length | 3796 bp |
Protein Length | 1099 aa |
Translation table | |
GC content | 55% |
IMG OID | 638253504 |
Product | ubiquitin-specific protease, putative |
Protein accession | XP_567778 |
Protein GI | 58260736 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5533] Ubiquitin C-terminal hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0377728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCCC CCTTCCCGGA CAACGGCTCG CCGGTCCCCT CTGTGCCGTT CCCGAGCATC CACAACCCCC TCCCCTACAC GCCCCCCCCC TACCATGCGC CCGCCGCCTA CTACCGCCAC GCCCCGCCGT ACTATCCCCC GCCCCCCGCC CCGTACCCCT CCTATGCGCC GATACCCATG GGCGGCGCAG GGCCTGGACC CGCGACGATG CAGAACGGCT ACGGCGGCTA CGGCGAGTAC GGCGTGTACT ATGGCTACGC GGGCTATCCA GACTACCAGC CACCCCACGC TGGCTACCCG CCTGCAGACG GCGACGACTA CAGCGACCCC GCAAAGACCC CGAGCTCGGC ACACGCCGTG CCCCAGCCGC CCGCCATCGT CGCGCCAAAC CACACCCCCC ACGGCATGCC CCAGCAGTTC CCGTCCTATC CCCCGAATCA CCCGTATGCC TTTGGCGGCG GTGTGGGCTA TCCCCAGGGG TACACCCGGC CGCCGCAGCA TATGCACCAG CACCAGCATC ATCCGCAACC GCACTTTCAC CCGCAACATC AGCAACATCT CCAGCAGGGG CCGTATGGAG GGTATGGCTA CCAGGAGGGG TATCATGGGG GGAAATTAAA TCCTGCAGCG CAGGGGTTCA AGTATAATCG GTTCCAGCAG CAGCAGCAGC AACAGCAACA GCAGCAACAG CAACAGCAAC AGCAGCATCA ACAGCAACAG CAACAGCAGC AACAACAGCA ACAGCAGCAT CAACAGCAAC AGCAGCAACA GCAGCAGCAA CAGCAGCAAC AACAACAGTT ACAGCAAGCA CACGCCCACG CACACGCACA AGCCCAAGCC CAAGCCCAAT CCCAAGGACC GCCACGACAA CCTGTACATT CTCAACAACC CCCTCCTGCA CCTGCGCCTA TCGCTCCTAG TCAAAATCAA CCATCCGAAC CATTACAACC GCCGATCCCC AACGGTCACT CCCACCCCGA ACCTCCTACT CCCTCAACAC CCACAGCCAC CACAAAAGAG CGAGAAGCAC AGTCTGCCGC ACAGCAAGAG CCTGTTCCTG CATCTGCAGC TGTCGAGCAG GAAAAAGAGA CTGCAAAGGA TGAAGAAACT TGCCCCGACG CGTCCTCGGC GATCGCCGCA CCTCGATGGA ACTTTATCCA TCCGTCATCT CTCCCATCAT CGTCATCCAC CGGTATTGCC ACTTTACATT TGGAAATGAA CAAACATGCC CATCCGCAGA GAATAAGGCT GGTGCGGGCT CGTCCGGGAG AGGAGAGTGA TGGGAATAGT TTTGCGATAG AGGTGAAGAC TGGTTTGCCA GAGGAGATTG TCGTTGATGA GCAATCGCCA TCTCAGTCGC AAGAAAAGGG AACAGGACGA GGAAAAGGTG GGAAGAAGAA GACGAAGGAA ACGGGCAGAG GAGTGTGGAA GACGGGCGAG AAGAGGAGGG TAGAGTTGGT GTTTGGCGAA ATTGTCCCGG AGCAGGAGAA GGAAAAGGAG GACAAGGCAG AGTGGGTGGG TGAAAAGCCA GTGGTCGATG AAAAGGAGGA GGAAAAAGTG GAGAAGAAGG AAAGGTCTAC ACCAGCGCGC GCACCAGCAC CCGCACCCTC GCCCGCCAAA CCTCGTTCAT GGGCCGCCCT CCTCAAAACC CCTACTCCTT CTTCCCCCTC CACCCCCGGC GCAGTCCCCA GTTCTTCCGC CCGCGTCACT TCCACCACCG AGGCTGGGCC TTCCCGTCCA CGACCATCCA CCTCTGCCTC CCCTTCCACC CCAAACCTTA CCGCCAACGT CAATGGTCTG CAACTTTCAC CGCCAACCCA AGGACAACAA CAACAACAAG CCAAAACGTT CAACTATGCT GCGGCTGCAA TGTCGTCCGT GCCTTCCCCA CATGAAGAGT TGATCAAGCT TTTGAGCGAG GGCGTGAGTT CTCTCCGGGG ATCGGGGACG ACAGTAGGGT TGAGTGTAAA GGAGAAGGAA GCGTTGATGG TGCCGAGGGG TTTGATCAAT ACCGGTAACA TGTGTTTTGC CAACACTGTG AGTTGAATTT TTGCGACTGG ATAAAAGATC ATGAGAAAAA TGCTGACAAC AGTCGATGCG ATAGATCCTT CAAGTATTGG TTTACTGTCC GCCGTTTACA GAGTTGTTTG AGGAATTTGG AAAGAGGCTG AAAGCTGATT TGGCGAGGAA GACTCCGTTA TTAGAGGCGA TGTAAGTCTC GTCTTCTCGT CTCCCCCAAA AAACACAAGG CGCGGGCTGA CAATGCGACG ATTATAGGAT CATCTTTTTG CGAGAGTTTG TCTCATCACC CGAATCATCA ACATCAACAA CGTCAACACC CAAGGGGAAA GGAAAGGACA AGGATTCGAG GAAAGAGGCG TTTATCCCGG AGAATGTGTA TGATGCTATG AAGGAGAATA AGAGGTTTGA CTCTATGCGC GTGAGTCGAC GTCTAAATCC AAATCTCGCC AGTGCTATGA AAAACGTTTG ACTGATAACA TTTTTATTTA CTTATTTGAT AAACACAGAG AGGTTACCAA GAAGATGCTG AAGAGTATCT CGGATTCTTC CTCAACACTT TACACGAAGA AATCATCTAC CTCCTCTCAC AAACATCCAC CACTTCTTCA TCTATGCCCA ACGGTCAACC CAGCGACTCT TCTAGTCGAC AAGTCGAACG GCCCGTTTCC CCCCGCGCCG GCGCCGGCAT CAACGGCAAC GCCGACTCAT CCTCTGGCTG GCTCGAAGTC GGCAAGAAAC AAAAGACGCA CGTCGTGCGC GCTACCGAAT CCCGCGAGTC GGCCGTCTCA CGTTTATTCG GCGGTAAACT CCGTTCCATC CTCCATACCC CCGGTCAAAA AGATAGCGTC ACCATCGAAC CTTACCAACC GCTCCAACTC GACATTACCG GCCCTGCCAT CTTGTCCATC ACCGACGCCC TCCGTGCGAT CTCCACACCT GAGATTATAC ATGGAGTATA TTCGGCGGCT AAAGGTGGGG AAGTGGATGC GACCAAGACG GTGTATGTGG AGACTTGGCC AAAGGTTTTG ATTTGTCATT TGAAGAGGTT TGTGTATGAT ACAGAGGAAG GTGGTGTGGT GAAGAGGAGT AAGGCGGTGG CGTATGGTGT AGACTTGGCG ATCCCTAACG GTGAGTTGTT GTTTTTTTTT TTTTTGTTTT TTTGTCTCGT GGTCTGACAG CGTTTGTTCG GGTGACTAGA AATAATATCC CCCGCAAGGC GGACATCGAC CTCTATAAAA TACGCCCTCT TCGGTGTAGT CTACCACCAC GGTTCTTCAG CTTCAGGAGG ACATTACACC GTTTCCGTTG CTCGTCCCTC TTCTTCTTCT TCCAGCTCCA CATCGTCCAC CTCTATTCAA AGCCCCGCGG GAACATCATC GTCATTACCA CCACCACCAT CATCAACGAC AATAGCATCC AGGTGGCTCC ACTTTGATGA CGAAAATGTA CGTGAAGTGC GAGAAGAAGA TGTGGTTGTC TCCCTGGACC AAGCCAAGGG TGGGGAAACC GGGTCGGTCG GTGGGAGGGA GAGGTGTGCG TACTTGCTGT TCTATAAGAG GGTGCAGTAA GAAGCATAGA GAAAAGAAAA AAAAAAAAGC GCGTATGCGC CGGGTTTTAG AACAGGGGAC GTTTGATCTT GTAGAGCGGG TTGGGAGGTG TCGTGAGTAA AATTGATAGA CGTTGGGTGC TAGGAAGGGC AGGGGTAGGC TTGAGAAGCG ATCGGTACGC GGAAGGAAGG GGAGGAGCCA AAGAAGGAGG TAGAGGCGAG ATGGGG
|
Protein sequence | MAAPFPDNGS PVPSVPFPSI HNPLPYTPPP YHAPAAYYRH APPYYPPPPA PYPSYAPIPM GGAGPGPATM QNGYGGYGEY GVYYGYAGYP DYQPPHAGYP PADGDDYSDP AKTPSSAHAV PQPPAIVAPN HTPHGMPQQF PSYPPNHPYA FGGGVGYPQG YTRPPQHMHQ HQHHPQPHFH PQHQQHLQQG PYGGYGYQEG YHGGKLNPAA QGFKYNRFQQ QQQQQQQQQQ QQQQQHQQQQ QQQQQQQQQH QQQQQQQQQQ QQQQQQLQQA HAHAHAQAQA QAQSQGPPRQ PVHSQQPPPA PAPIAPSQNQ PSEPLQPPIP NGHSHPEPPT PSTPTATTKE REAQSAAQQE PVPASAAVEQ EKETAKDEET CPDASSAIAA PRWNFIHPSS LPSSSSTGIA TLHLEMNKHA HPQRIRLVRA RPGEESDGNS FAIEVKTGLP EEIVVDEQSP SQSQEKGTGR GKGGKKKTKE TGRGVWKTGE KRRVELVFGE IVPEQEKEKE DKAEWVGEKP VVDEKEEEKV EKKERSTPAR APAPAPSPAK PRSWAALLKT PTPSSPSTPG AVPSSSARVT STTEAGPSRP RPSTSASPST PNLTANVNGL QLSPPTQGQQ QQQAKTFNYA AAAMSSVPSP HEELIKLLSE GVSSLRGSGT TVGLSVKEKE ALMVPRGLIN TGNMCFANTI LQVLVYCPPF TELFEEFGKR LKADLARKTP LLEAMIIFLR EFVSSPESST STTSTPKGKG KDKDSRKEAF IPENVYDAMK ENKRFDSMRR GYQEDAEEYL GFFLNTLHEE IIYLLSQTST TSSSMPNGQP SDSSSRQVER PVSPRAGAGI NGNADSSSGW LEVGKKQKTH VVRATESRES AVSRLFGGKL RSILHTPGQK DSVTIEPYQP LQLDITGPAI LSITDALRAI STPEIIHGVY SAAKGGEVDA TKTVYVETWP KVLICHLKRF VYDTEEGGVV KRSKAVAYGV DLAIPNEIIS PARRTSTSIK YALFGVVYHH GSSASGGHYT VSVARPSSSS SSSTSSTSIQ SPAGTSSSLP PPPSSTTIAS RWLHFDDENV REVREEDVVV SLDQAKGGET GSVGGRERCA YLLFYKRVQ
|
| |