Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA07360 |
Symbol | |
ID | 3253578 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 2011335 |
End bp | 2016131 |
Gene Length | 4797 bp |
Protein Length | 1312 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253060 |
Product | ubiquitin-specific protease, putative |
Protein accession | XP_567052 |
Protein GI | 58259279 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5533] Ubiquitin C-terminal hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCGGC CGCCGTCCCC GCCATCGTTC ACAGGGCTCC CGCTCTCGCA GCTGCTGCAG TACGCCTCGG ACGACAACGG CGCAGGCAGC CACCCGCCCA AAGTATGGTT CGAACGCGCA TGCTACAACG CCGACAAGGC AAAGTTGGCA GAGCGCAAAC AGAGCAAAGA GGACATGTTC GTCAGCTACA GCAGGGCGTG CCAGTCCTAT ATCAACGTTG CGATGCACAA CGACTGGCCA GAGACCAAGA AAAAGGATCC TCAGTTGGCC GCCAGAGTCA AGGATTTCAA GCCGGTATGT GCCTCTCCGC CGCTGTCCGT ATATGCCCAC ACTCATACCA TCTAGATGTA CGATTCATTT GTCGCAAAGG CCAAGGCTTT AAAAGAGGAA CTTCGGCATG CGGAGGCATC TTCACCCGCT CAGTCCAAAC CAGAATCATC CAAACCATCG GTATCAAGAC AACGATCTGG GCCTGAAATA ACATCAATCG GCAACATCAG AGATCGAATG CAGGCACTTG CCGGACACGG AATGGAGGTT GAGACTATTC AGTCGAAGCG CTTAAGTAGA GAGGCTCCTG CCAAAGCGCC AAAGCCTGCT GCATTGAGTA GCGTGACCAA CTCTGCGGCA AGGTCTAGGA GTGGCAGCGA GTCGCGACCT TTGTCCACAA CAACGACCAA TGGGCGGCAA CTCGTGGTCA CTGCTTCATC TAAACCACCA CCAACTGCGT CACAAGAGAA ATCACCTCCA TCTGCACCCG TCAACGTACA AGCCAATGGA TCGTCCAGCA GATCACGGCG ATCCACTTTG ACGAGTGAAG GGCAGGGCAT GAGCGTATCT GTTGGGTCTG TGGCAGCTTC ACCGGCGCAA TCACCGATGC CCACACCTGC AGCTGGCCCA TACAGATCGC CCTTACCCTC CATCCCTTCC TCTCCTCAGC CTCTTCCAGC ATCTCATCCG CCATTACCCT CTCCCGAGCC ACCACGACCA AATGTTCCCA TCAACTCGTC CGACATGGAG TCTCATCCTG AAGACGGCTT GGCAGAGTTT GAGCGTGCTT TCCCATCTCT CTCCGAATTT GGGAAACAAT GGGATGGCGA CTCGTTACAG CCTGACACGA ATAGTAATGA TATGTACCAC GCCCCGCAGT ACCCGAAACC TCCAAAGCAA CCGCCTATCT CTGAAGAAGA CGATATCCCA GACCTTCCAT CTTTACCTTC CGTTCCGACA TTCAAGCCCG GCCTGCCTCC CCCTCCTGCT CGACCGGACG CATATGCATT CTCACCACCA GCCACTTCCC CGTCTGTTCA AACTCGGGGA CAAACTCCCG TCCCCCGCGG GCCATCACCT CCTAAACCCG ATGTTGGTTC GGGTGTCGGT CTGCATCGAC CGGCTAGTAC GCCCATGCCT AATATTGCTG GTTTAGATCT GATAGATATG CCTGATGGGG ATGTACCTCA AGGAAAAGCC ATGAACGGTG GTGGAAAAAT GGAAGCTCTA AATTCCCCTG AGGCTGTATC TAGGCCTCCC CAATATTCTG ACGCAACATC CCGTCCTACT CCAACCACAC GGCATCCTCT TCCCCAACTT CCTTCTCGCC CTCCCACCTC TGATGCCGTA CAGCCTACTC TCAAACCCAA GGAGAAACCA AAGTTTCCTT TTAGTAATTC TATCACTCCG GATGAACTTC GCGAATACTT TCTCAATCCT TCAGTGGAGA TGTTGTTCTT GGACATAAGG CCGGAAGACG AGTGGAAGAA GGGATATGTA GGGAGAGAGT ATGAGAAGAG AGGTGCGAGG GTGGAGATTG TGTGGCTGGA TCCGACAGTC TTGCTTCGTG AAGGGTAATT GACTTTGTTT TCAGTGAAGA TGACAAGGCT AAAAAATGTA TTTTAGTATG ACCGCGAGCA AATTAGAAGA CGCACTTTCC CTTTCCCCCG CTGTTCAACG CGAAGCTTTT CAGAATCGAC ACAAATACGA CCTCGTCATA GTCTACGACA CTCGTTCTCC CGTATGGCCC AAGGACGGTC CTTTAAACCG AATTTGGGAT ATATTCTTTA TGGGCCACGA CGAGAAGCGT TTGCAGAGAA ATCCCGTCAT TTTGGTAGGA GGTTATGAAA AGTGGAGAGA GTTTATCAAG ATGCGAGCGG CTAGGCATGC ACATACGGCA AAGGGGAAGG ATGTGAGGGA GGGGGTGAAT GGGTATGCGG TGATGCGATC GGATGTTGTG TCCCCTGCGC CTTCCGAGAT TAGTGTTAAG AATGCGAACA GGGAAGCACC GGTCTACCAG GCATCACAGT ATGCGAAGAG TATCGCTGAA AACGTGAGCT ATTACATATA TATCCACGTG TGCCATGTGT TAATGTCTTG ACGTCTTTAC AAGTTTGGCG CTGGACCCCA ATCCATGACG GGAGACTCGT ACCGTCCCTC CACACATTCT CACTCCCAAT CGCAACTCTA CACGCCCACA TACAGGCATC ATCACTCCAG AACGGGCTCA ACTTACTCCT CCCACGGCGC TATTGCCGCT CCTCCACAAG CTTCCATTCA CCCCGGACCA GGTGCCAGAC GGCGAAGCGA CTACATTGAA CATACAGGTC AATCGTACTC TGGCTCAGCC TCGACTTCCC CCTCCATACA ATCGACTGGC ACCACACCTC AACCTCAACA GCAATACTAC GCTTCCCCAT CCCTCCCTGC TTCTAACTCC GTGACACCGT CCATATCATC CATGGCCTCT CCTCGGGCAT CGATCGATTA CCCTCAAGCG CACGCATTGG CAAAGGTACC AGTCCCGATG CCTCCCCCAG CCGTGGCGAG ACCGATGGAA CGACATGATG CGTACACCAG CACACATGTG CATGCGCAGA GTTTGGTACC GACTTCTTCC GGTTATGGTC AACTTCCATC ACAAGGGCAA GTGACGAGGA GTCAGGCGAT GAGGCGCATG GACACTGTGC CTGTTGGTGG GAAGGATAAA GTGGGGTATT GGCGGGATGT GGTGTTGGGT ATTACCGGGT TGAAGAACCT TGGAAAGTAA GTTTCACATT TCCTTGAAAA TGATAATTGT AAAAAGATTC TGACTGGGAT TAGTACTTGT TATATGAACT CGACAATACA GTGTCTCAGT GCGACATACC CATTCTCTAC GTTTTTCCTT GGTGAGTGGC CCATATGCCT AACCGCCGAG ATATCAGACT AATATGTATG GATGTCCTTT TAGATGGAAC GTTTGCACGT TCAATAAACA AAGAAAATCC CTTGGGGACG AAAGGCGAGT TGGCCAAAGC CTGGGCAGAA TTATTGAGAG TATTGTGGAG TGAGAAATAT GAGTTCTTAT CACCTATGAC TTTCCGGGTA AGTTTACCTT GTTTTTGTTC GTGGTTTTTT GGCTAAAATC AAGACCATGT ATTAGAAACA AATCACTCAC TTTGCTCCCC AATTCCTCGG TTCTGACCAA CATGACTCTC AAGAATTCTT GTCATTCGTC CTTGACGGCT TACATGAGGA CCTTAACCGA GTCAAGCGCA AACCCCCCCC TGTGGAGATG ACTCCCGAAA GAGAGGCGAT GCTGGAGAGC GCCCCACCAG AAGTGGCTTC TGAGAGAGAA TGGGCAATCT ACCGGCAGAG GAATGATAGT TTGATTGTGG ATCTGTTCCA GGGGCAGTAT AGGAATAGGT TGGAGTGTTT GACCTGTCAC AAGGTGAGTA TAACTTGCCC CAAGTCCATC ATTGAAAGCT ACTGACGATT ATTGGACCAG ACATCGACAA CTTATGATGC GTTCATGTAC ATGTCTCTGC CCGTTCCATC GGGTAAAACA AAGGTGGTTA TACAAGAACT CATTGACGAA TTCGTCAAGG CCGAAGTGAT GGAAAAGGAG AACGCCTGGT ACGCTTCATC TCTCCCTCTG ACACCCTCTC TGCACTGGGC TGACAGTATT CTAGGTATTG TCCTCGCTGC AAAACCAACC GTCGTGCTTC CAAGACACTA ACCATCGCTC GTCTTCCACC TGTACTGCTC ATCCAACTCA AACGATTCAC AACGCGAGAC GGTCTCTTCT GGGACAAGTC CGAGACACCG GTCATCTTCC CCATCAGAGG TTTAGATCTT ACACGGTACT TGCCTGGACC TGCAGGTTCG TCGATAGGAA GTGGAAGGCA GGTGGGGCCG GATGGAACGT TTGATCCAAG GGCTCAAGTG GGGCCGTTCA AGTATGATTT GTATGGTGTA AGTAATCATA TGGGGACGCT CAGTTCAGGT CATTGTGAGT TTAGCTTTTC CATATTTTTG GGGGGTCTCT GGTCTGGCAA GAAAAACGGG TGGCTGATTC GAGGTTCAGA TACGGCTTTT GTGAAGAGTA AAGAAGGATG GAAGTATTGT GAGGATAGTC AGGTGATGCC TGCACAGGAG AAGGACGTAA TTGTGAGCCT TTGTTCCCTA TAATACAAAT TTCTTTCGTC GTTGTCTTTA AATTAACCAT TGCTGTTTCT CCGACTCATA CAGTCCCGAC CAGCATATAT CTTGTAAGCA CCTATCCTTC CTTGATGCTT ACTATGGACC GTCAACTGAC CGAGTCTGTA GGTTCTATAA GCGAGTACCT GGATAGGAGA CTTTCTAATA GGTTTTGCGA TACAAAGTGC AAAGAAAGAG ATGAAAAGGT GGCGGTGGTA TTAAGATGTC AGATTAATTC GTAGACTCCC AATCCCAAGA AAAACGAGTG TTGTACTATC AATTTATCAA GTTACAAAAT TTTTCATCCT CCTTGTGCAC AAGTTGACAG AACGTATATA TATACTC
|
Protein sequence | MTRPPSPPSF TGLPLSQLLQ YASDDNGAGS HPPKVWFERA CYNADKAKLA ERKQSKEDMF VSYSRACQSY INVAMHNDWP ETKKKDPQLA ARVKDFKPMY DSFVAKAKAL KEELRHAEAS SPAQSKPESS KPSVSRQRSG PEITSIGNIR DRMQALAGHG MEVETIQSKR LSREAPAKAP KPAALSSVTN SAARSRSGSE SRPLSTTTTN GRQLVVTASS KPPPTASQEK SPPSAPVNVQ ANGSSSRSRR STLTSEGQGM SVSVGSVAAS PAQSPMPTPA AGPYRSPLPS IPSSPQPLPA SHPPLPSPEP PRPNVPINSS DMESHPEDGL AEFERAFPSL SEFGKQWDGD SLQPDTNSND MYHAPQYPKP PKQPPISEED DIPDLPSLPS VPTFKPGLPP PPARPDAYAF SPPATSPSVQ TRGQTPVPRG PSPPKPDVGS GVGLHRPAST PMPNIAGLDL IDMPDGDVPQ GKAMNGGGKM EALNSPEAVS RPPQYSDATS RPTPTTRHPL PQLPSRPPTS DAVQPTLKPK EKPKFPFSNS ITPDELREYF LNPSVEMLFL DIRPEDEWKK GYVGREYEKR GARVEIVWLD PTVLLREGMT ASKLEDALSL SPAVQREAFQ NRHKYDLVIV YDTRSPVWPK DGPLNRIWDI FFMGHDEKRL QRNPVILVGG YEKWREFIKM RAARHAHTAK GKDVREGVNG YAVMRSDVVS PAPSEISVKN ANREAPVYQA SQYAKSIAEN FGAGPQSMTG DSYRPSTHSH SQSQLYTPTY RHHHSRTGST YSSHGAIAAP PQASIHPGPG ARRRSDYIEH TGQSYSGSAS TSPSIQSTGT TPQPQQQYYA SPSLPASNSV TPSISSMASP RASIDYPQAH ALAKVPVPMP PPAVARPMER HDAYTSTHVH AQSLVPTSSG YGQLPSQGQV TRSQAMRRMD TVPVGGKDKV GYWRDVVLGI TGLKNLGNTC YMNSTIQCLS ATYPFSTFFL DGTFARSINK ENPLGTKGEL AKAWAELLRV LWSEKYEFLS PMTFRKQITH FAPQFLGSDQ HDSQEFLSFV LDGLHEDLNR VKRKPPPVEM TPEREAMLES APPEVASERE WAIYRQRNDS LIVDLFQGQY RNRLECLTCH KTSTTYDAFM YMSLPVPSGK TKVVIQELID EFVKAEVMEK ENAWYCPRCK TNRRASKTLT IARLPPVLLI QLKRFTTRDG LFWDKSETPV IFPIRGLDLT RYLPGPAGSS IGSGRQVGPD GTFDPRAQVG PFKYDLYGVS NHMGTLSSGH YTAFVKSKEG WKYCEDSQVM PAQEKDVISR PAYILFYKRV PG
|
| |