Gene CNA04350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04350 
Symbol 
ID3253356 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1167772 
End bp1169904 
Gene Length2133 bp 
Protein Length576 aa 
Translation table 
GC content49% 
IMG OID638252755 
Productconserved hypothetical protein 
Protein accessionXP_566789 
Protein GI58258753 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5533] Ubiquitin C-terminal hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG CCTCCCCCTT CCTCTTCCAG TCGTCCCTGC ACGACACGGG GCTTATTCAC 
GAAATGCTTT CCAACCCGCT CAAATTCGGG GCTCCTGTTA ACAAAAAGAG TTTGGGCTTC
GAGGCGGGTA TGAAGGAGGT TGTTTCAGAG CCAGAGTCTC CCAATCTTGT CAAAAGACAG
CTGAAAAACG TAAATGATGA GGTGAAGTAC GACACGGAAG GAGAACTGGC GGCGAAGGAT
AAGTCGCAGA AACAGTTTAA TGGCAATGCG AAGAACCCCC AAAGGGTGAA TCCGGAGTTC
ATAAATCAAT CAGTAACTCC CCTTTCGCCA GCGAAATCCC AAATCCCTGA TACAAATGAG
GGCGACAATA CTCAAGGTCT CTTCCCGTCC ACCTTCGACC TTTCTTGGCC AGAAGCCATT
GCCACCGCTA AGCGTGCAGC TGGACTGCAT AATCCTTCGA TGGCATGCTA TGCCAATGCC
ACTTTGCAGG TCCTGCTGCA TACGCCGCCC GTCCTGAGAA TCGCTTTGAC ACACGATGAG
GGAAGCTGTG GGTGCTCAAT GGTTCATAAC TTCTCCAGTC CTTTGACTGA CTTCATCTGT
AGGCTCACAA ATTAAAAAGA AGAATTTCTG CATGTTATGT TCTCTCAAGC ACATGGCTGA
AGGATCGCAC TGGTCTGGTC GAAAGGCTTA CGCCCCAGGA ATCCACAGAA GCTTGTCGCG
TAAGTCGTCG GAGTCCAACT TGTTTTATCA CCGACTTAAC TTTAATAGAA ATCAAGAAGG
GCTTCAGCAA GAACAGGCAG GAAGACACCC ATGAGTTCTT CCGGTTTGTC ACCGACGCCC
TGCAGAACAC TGCATTGGCC AAGCTTCCTA AGTGTGTCCT TCCTTACAAT CATCGGAAGT
TTGCTCATAC AACATCCAGG GATACTCCTG AAAAGATCAA GCACACCTCT TGGGTTTACC
GAATTTGGGG TGGCCGAGTG CGCTCACGTG TTGTTTGTTC ACGATGTAAC AACCCGTCAG
ACACCTTTGA TTCCTTCTTG GATTTGAGTT TGGATGTGAA CAAGCAGGGC AAGAAAAGCG
TGCTTGGGAT GTTGGCTGGC TTCACCAAGG AAGACAGACT CGAGGGAGAC AACAAGTATC
ATTGTGAAAG GTGAGTTTCA TTTGTTGTGT TCAAAAGAGC GATAGCTTAC TTCAAAAAGG
TGCAAACGTA AAGCCAATGC CACGAAGAGC TTCAAAATTG ACCAAGCACC TCCCATCTTG
ACTCTTCACT TGAAACGGTT CAGTGTCAAC TACAATCCTT ACAGTGGCCG AGCTCGAGCA
GAAAAATTTA ATCAGCCCAT CAAATTTGAA CAAACTCTTG ATATCGCGCC CTATATGGTT
GACCCTGCGT CTCCCGGTAC CAAGTACAGA TTGTTCGGTG TCACCTGCCA TCGTGGTACT
GAGCTTCGTT TTGGTCATTA CACTTCCTAT GTCCGAGGTC CTTCCGGTCA ATGGTTCCAT
GCCGATGACG ATGAAGTGTC TCCTGTCCAG TTGGAGCAAG TCTTGAACGA CAAGACGGCT
TATCTGTTAA GTTACATCCG CGTGGACAAT GGGAACGAGG GGCTGTGTGA ATCGCCTGCA
GTTAGGGACA GAGTGAAAGG CTTGGTCAAC GGGAGTGCAA AGGGTATGAG AGATGACGAG
TCGGAGAGTC AATCAGAGGC TGAGTCAAGC TCGCACAAGT CATCGTCACC GATCAAGCGT
AAATCCACTT ATGACCCTGA AGACCCACCG CGCATGAAAA TTGGCGCCTT TGTCAACAAC
AAGGCCTACG CACCTTCAAC AAACAAATCT GAGTCGCCAT TCTCAGACGG AGAGAACAAA
ATGCCCCCCG AACTCCCCAA ATTCGGATAT AAACCTAAAC CCACCATTCG CGCCCCTGCT
CCCGTGGAAG CTTCATCTTT CTACACTTCC CCTGTCGCTC GACCATCCAA TTCATTGGCA
GGTATGAGTA AGAAGGAAAA GAAGAAGTTC AAGCATAAGG AAAAGGGAAA GCCTAGACAT
AGCGCTACGC CAATGCCCTT CGCCCAAGGA AGGGTGGGTA ATGGTAGAAA CAGGCAGCCA
GGTGTTCTTT CGAGGATGAA GGGCAGAGCG TAA
 
Protein sequence
MTTASPFLFQ SSLHDTGLIH EMLSNPLKFG APVNKKSLGF EAGLFPSTFD LSWPEAIATA 
KRAAGLHNPS MACYANATLQ VLLHTPPVLR IALTHDEGSC SQIKKKNFCM LCSLKHMAEG
SHWSGRKAYA PGIHRSLSQI KKGFSKNRQE DTHEFFRFVT DALQNTALAK LPKCVLPYNH
RKFAHTTSRD TPEKIKHTSW VYRIWGGRVR SRVVCSRCNN PSDTFDSFLD LSLDVNKQGK
KSVLGMLAGF TKEDRLEGDN KYHCERCKRK ANATKSFKID QAPPILTLHL KRFSVNYNPY
SGRARAEKFN QPIKFEQTLD IAPYMVDPAS PGTKYRLFGV TCHRGTELRF GHYTSYVRGP
SGQWFHADDD EVSPVQLEQV LNDKTAYLLS YIRVDNGNEG LCESPAVRDR VKGLVNGSAK
GMRDDESESQ SEAESSSHKS SSPIKRKSTY DPEDPPRMKI GAFVNNKAYA PSTNKSESPF
SDGENKMPPE LPKFGYKPKP TIRAPAPVEA SSFYTSPVAR PSNSLAGMSK KEKKKFKHKE
KGKPRHSATP MPFAQGRVGN GRNRQPGVLS RMKGRA