Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF03800 |
Symbol | |
ID | 3258405 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 1112095 |
End bp | 1114531 |
Gene Length | 2437 bp |
Protein Length | 599 aa |
Translation table | |
GC content | 49% |
IMG OID | 638257499 |
Product | ubiquitin specific protease, putative |
Protein accession | XP_571674 |
Protein GI | 58269036 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5533] Ubiquitin C-terminal hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCTGCTGCT CATAACCTGG ATCTGTGACT ACCCTCAGTA ACAGTCGCAG CTTGGTTATA CATCGAGACG GCATATAAAT CCCACCTGGC GCGATGGCGA CCCCTCAATC CCGCATATTT GACAGGACTG ACACATCTAT ATGCCCTCAT TTGTCCGCTC TTCTCAGCAT ACCGAGCGCC TCTTCAAGAA ACCCTGGCAC ATCAGGAGCC AAAGGGAACA CTCTAGGGTT TCCACCGGGA TCCAAAGGAG CTGAAATTGA GAAAAGGTTC GTGGACGTTG TTAAATGGGG AGCTTTACCG CAAGGTGTCA AGCGACGGAA GGTACGTTTG AGCTGGATAT GTCTTTTGCC GGACATTGGC TTATAATTCC ATAGACCATG TCTCCGGGGT GTCATACTTG CAAAACTCCT CTTTCCAGGC CATGGGCGTG CCTTACTTGC CCATACGTTG GCTGTATGCC GCTTGTGGGC AAAGGAGCTA ATGAGAAGGA TTGTATGAAG AGACATTGGA AGAGCAGCGG AAGGAAATGT GCTTTTGGTA CGTCTCAACT ATCCACCTGT AGCGTATCCT GCTGATAACA TAATGGTAGC TGTTGATCCC TCTACTGGTA CCATATTTTG TGAAGCTTGT GGGGATACAA CATATCCTGA TACTTTTGAA TCACTTTTCC TTACTACTCG AATCCGCGTT GAAGAGTCAA ACGATCATTC ACGCGAACCA GGTTTGGTTG GTGGCAAGGG AAGGGGGAGA GGCGAGTGGA AATCGTGGAA CCCGAACAAT ATTGCGGCGC TCAATGAGAG AGAGGTGGTG AGGACAAGTT GTCGTGGTGA GTCAAATTTC TCATCTTTTA TGGGATCAGC TAAAAGCTTA TGCGCGTCTA GGTCTACGGC CCCTTCTTAA TTTATCTCAA ACATGTTTCC TCTCGGCCGT CCTTCAAGCA CTTGTTCATA ATCCGCTTCT CAAAGCATAC TTCCTCTCAG ACAAACACAA TCGACATGTG TGCACAAACG GTGGCAAAGG CCTTTTGGTC GGGAAGCCGT TCTTAGGTGT AGAGAACGGG CCAGGTGCAG TGGGAAGCGA TAGAGAGAGG GGATGCATGT GCTGCGAGAT GGATAAGGCT TTTGAAGAGG TAATGAGTGC TGTTCGTTTT CACACGAGGG AGACGCTGAC TTCTGGCTCA GTTCTATAAT GAGGACAAGT CGCCTTTTGG ACCTATCACA ATGCTCTACG CCATGTGGCA CGCGAGCACA GAGCTCGAGG GTTACGGTCA GCAAGGTAAA TTGTTTGGCT TTGCCAAGTG ATACTACAGC TGACTCTACA CAGATGCCCA TTCTTTCTTT CTTGCTGCGC TGGACCAAAT CCATGCTCAT GCCAAGGGTC AGCTATCCAG CTGTAACTGC ATTGCCCGTA AATATGCGTT ATTTCGTACA GTTGATGATA CTAACCCTCA CCCAAAGATC AAACCTTTGC GGGCTCCCTC CAATCTTCCG TTATCTGCTC TAAATGCTCC AAGACCTCCA ACACTGTCGA TCCAATTCTT GACATCCAGC TCGACTTCCC ACCTCCGTCT GTTCCTTCAT CAGCATCCTC ATCCTCCGAC TCATCGGCTT TTGGCCCGTC CACCAATGGG CAAGCGGATC AACTAACATT AGCGGGCATG TTACGCAAGT TTTGTGCGCC AGAGCGTGTT GGAGATCCTG GAGGGAACGG ATACGAGTGT TCTGGATGTG GGGGCGGCGT GGGCGTAGTG GCCATGAGAA AATTGGGAGT GAAAAAGCTT GCTCCAGTGT TGTCATTCCA ACTCAAGGTA TGATTCTCTT TCTGGGTTAC ATATACGCCA ATACTAAAAT ATGGCCAATA GCGTTTTGCC CATTCATCCG CCACTACGTC CGTCAAGATC GAATCCCATG TCCGGTTCCC ATCCACCCTC GATATGCGTC CTTACGTAGA CTCTTCCTCA TCTTCTAAAA GTGGCAATGA CAGAAAGGAG AAAGAATTAC CAGACTCGCT GTACATATAC GATCTGTTCG CAGTTGTCAC TCATGAGGGC AAGCTGGACA ATGGGCATTA TTGGGCGGAT GTGAGGGACG GCGAGGAGTG GTGGCATTGT GATGATGATA AGGGTGAGTC TAAAATGGGC ACCAAGTCGT TGATAGTCGT TGCTGATGTT TCTGCGAATG TCAGTCACTC CTACATCTCT CTCTGCTGTA TTGGCGCAGA GAGCGTACAT GCTTTTTTAC GTCAAACGAT CCATAGCCTA TGCCCAGCCA ATGTCGAGGT TGTTGGCTGG CGGCAGCACT GGTACCAACG GTGCTTAACA GCCTTGGCGT CTGCATACAC CTAGAATCTA CATAACAATC TTTTCACTCT CATTTATTAT CCCGACTGAC TCAATTTTCT GTTGGTACCA TATCTCAACT GTTGACTTTC TTCAAAA
|
Protein sequence | MATPQSRIFD RTDTSICPHL SALLSIPSAS SRNPGTSGAK GNTLGFPPGS KGAEIEKRFV DVVKWGALPQ GVKRRKTMSP GCHTCKTPLS RPWACLTCPY VGCMPLVGKG ANEKDCMKRH WKSSGRKCAF AVDPSTGTIF CEACGDTTYP DTFESLFLTT RIRVEESNDH SREPGLVGGK GRGRGEWKSW NPNNIAALNE REVVRTSCRG LRPLLNLSQT CFLSAVLQAL VHNPLLKAYF LSDKHNRHVC TNGGKGLLVG KPFLGVENGP GAVGSDRERG CMCCEMDKAF EEFYNEDKSP FGPITMLYAM WHASTELEGY GQQDAHSFFL AALDQIHAHA KGQLSSCNCI AHQTFAGSLQ SSVICSKCSK TSNTVDPILD IQLDFPPPSV PSSASSSSDS SAFGPSTNGQ ADQLTLAGML RKFCAPERVG DPGGNGYECS GCGGGVGVVA MRKLGVKKLA PVLSFQLKRF AHSSATTSVK IESHVRFPST LDMRPYVDSS SSSKSGNDRK EKELPDSLYI YDLFAVVTHE GKLDNGHYWA DVRDGEEWWH CDDDKVTPTS LSAVLAQRAY MLFYVKRSIA YAQPMSRLLA GGSTGTNGA
|
| |