Gene CNF03800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03800 
Symbol 
ID3258405 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1112095 
End bp1114531 
Gene Length2437 bp 
Protein Length599 aa 
Translation table 
GC content49% 
IMG OID638257499 
Productubiquitin specific protease, putative 
Protein accessionXP_571674 
Protein GI58269036 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5533] Ubiquitin C-terminal hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCCTGCTGCT CATAACCTGG ATCTGTGACT ACCCTCAGTA ACAGTCGCAG CTTGGTTATA 
CATCGAGACG GCATATAAAT CCCACCTGGC GCGATGGCGA CCCCTCAATC CCGCATATTT
GACAGGACTG ACACATCTAT ATGCCCTCAT TTGTCCGCTC TTCTCAGCAT ACCGAGCGCC
TCTTCAAGAA ACCCTGGCAC ATCAGGAGCC AAAGGGAACA CTCTAGGGTT TCCACCGGGA
TCCAAAGGAG CTGAAATTGA GAAAAGGTTC GTGGACGTTG TTAAATGGGG AGCTTTACCG
CAAGGTGTCA AGCGACGGAA GGTACGTTTG AGCTGGATAT GTCTTTTGCC GGACATTGGC
TTATAATTCC ATAGACCATG TCTCCGGGGT GTCATACTTG CAAAACTCCT CTTTCCAGGC
CATGGGCGTG CCTTACTTGC CCATACGTTG GCTGTATGCC GCTTGTGGGC AAAGGAGCTA
ATGAGAAGGA TTGTATGAAG AGACATTGGA AGAGCAGCGG AAGGAAATGT GCTTTTGGTA
CGTCTCAACT ATCCACCTGT AGCGTATCCT GCTGATAACA TAATGGTAGC TGTTGATCCC
TCTACTGGTA CCATATTTTG TGAAGCTTGT GGGGATACAA CATATCCTGA TACTTTTGAA
TCACTTTTCC TTACTACTCG AATCCGCGTT GAAGAGTCAA ACGATCATTC ACGCGAACCA
GGTTTGGTTG GTGGCAAGGG AAGGGGGAGA GGCGAGTGGA AATCGTGGAA CCCGAACAAT
ATTGCGGCGC TCAATGAGAG AGAGGTGGTG AGGACAAGTT GTCGTGGTGA GTCAAATTTC
TCATCTTTTA TGGGATCAGC TAAAAGCTTA TGCGCGTCTA GGTCTACGGC CCCTTCTTAA
TTTATCTCAA ACATGTTTCC TCTCGGCCGT CCTTCAAGCA CTTGTTCATA ATCCGCTTCT
CAAAGCATAC TTCCTCTCAG ACAAACACAA TCGACATGTG TGCACAAACG GTGGCAAAGG
CCTTTTGGTC GGGAAGCCGT TCTTAGGTGT AGAGAACGGG CCAGGTGCAG TGGGAAGCGA
TAGAGAGAGG GGATGCATGT GCTGCGAGAT GGATAAGGCT TTTGAAGAGG TAATGAGTGC
TGTTCGTTTT CACACGAGGG AGACGCTGAC TTCTGGCTCA GTTCTATAAT GAGGACAAGT
CGCCTTTTGG ACCTATCACA ATGCTCTACG CCATGTGGCA CGCGAGCACA GAGCTCGAGG
GTTACGGTCA GCAAGGTAAA TTGTTTGGCT TTGCCAAGTG ATACTACAGC TGACTCTACA
CAGATGCCCA TTCTTTCTTT CTTGCTGCGC TGGACCAAAT CCATGCTCAT GCCAAGGGTC
AGCTATCCAG CTGTAACTGC ATTGCCCGTA AATATGCGTT ATTTCGTACA GTTGATGATA
CTAACCCTCA CCCAAAGATC AAACCTTTGC GGGCTCCCTC CAATCTTCCG TTATCTGCTC
TAAATGCTCC AAGACCTCCA ACACTGTCGA TCCAATTCTT GACATCCAGC TCGACTTCCC
ACCTCCGTCT GTTCCTTCAT CAGCATCCTC ATCCTCCGAC TCATCGGCTT TTGGCCCGTC
CACCAATGGG CAAGCGGATC AACTAACATT AGCGGGCATG TTACGCAAGT TTTGTGCGCC
AGAGCGTGTT GGAGATCCTG GAGGGAACGG ATACGAGTGT TCTGGATGTG GGGGCGGCGT
GGGCGTAGTG GCCATGAGAA AATTGGGAGT GAAAAAGCTT GCTCCAGTGT TGTCATTCCA
ACTCAAGGTA TGATTCTCTT TCTGGGTTAC ATATACGCCA ATACTAAAAT ATGGCCAATA
GCGTTTTGCC CATTCATCCG CCACTACGTC CGTCAAGATC GAATCCCATG TCCGGTTCCC
ATCCACCCTC GATATGCGTC CTTACGTAGA CTCTTCCTCA TCTTCTAAAA GTGGCAATGA
CAGAAAGGAG AAAGAATTAC CAGACTCGCT GTACATATAC GATCTGTTCG CAGTTGTCAC
TCATGAGGGC AAGCTGGACA ATGGGCATTA TTGGGCGGAT GTGAGGGACG GCGAGGAGTG
GTGGCATTGT GATGATGATA AGGGTGAGTC TAAAATGGGC ACCAAGTCGT TGATAGTCGT
TGCTGATGTT TCTGCGAATG TCAGTCACTC CTACATCTCT CTCTGCTGTA TTGGCGCAGA
GAGCGTACAT GCTTTTTTAC GTCAAACGAT CCATAGCCTA TGCCCAGCCA ATGTCGAGGT
TGTTGGCTGG CGGCAGCACT GGTACCAACG GTGCTTAACA GCCTTGGCGT CTGCATACAC
CTAGAATCTA CATAACAATC TTTTCACTCT CATTTATTAT CCCGACTGAC TCAATTTTCT
GTTGGTACCA TATCTCAACT GTTGACTTTC TTCAAAA
 
Protein sequence
MATPQSRIFD RTDTSICPHL SALLSIPSAS SRNPGTSGAK GNTLGFPPGS KGAEIEKRFV 
DVVKWGALPQ GVKRRKTMSP GCHTCKTPLS RPWACLTCPY VGCMPLVGKG ANEKDCMKRH
WKSSGRKCAF AVDPSTGTIF CEACGDTTYP DTFESLFLTT RIRVEESNDH SREPGLVGGK
GRGRGEWKSW NPNNIAALNE REVVRTSCRG LRPLLNLSQT CFLSAVLQAL VHNPLLKAYF
LSDKHNRHVC TNGGKGLLVG KPFLGVENGP GAVGSDRERG CMCCEMDKAF EEFYNEDKSP
FGPITMLYAM WHASTELEGY GQQDAHSFFL AALDQIHAHA KGQLSSCNCI AHQTFAGSLQ
SSVICSKCSK TSNTVDPILD IQLDFPPPSV PSSASSSSDS SAFGPSTNGQ ADQLTLAGML
RKFCAPERVG DPGGNGYECS GCGGGVGVVA MRKLGVKKLA PVLSFQLKRF AHSSATTSVK
IESHVRFPST LDMRPYVDSS SSSKSGNDRK EKELPDSLYI YDLFAVVTHE GKLDNGHYWA
DVRDGEEWWH CDDDKVTPTS LSAVLAQRAY MLFYVKRSIA YAQPMSRLLA GGSTGTNGA