Gene CNI04230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI04230 
Symbol 
ID3259777 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp1121846 
End bp1122944 
Gene Length1099 bp 
Protein Length329 aa 
Translation table 
GC content51% 
IMG OID638258918 
Productspliceosomal zinc finger-containing protein, putative 
Protein accessionXP_572920 
Protein GI58271528 
COG category[R] General function prediction only 
COG ID[COG5152] Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAG CGCCTGCACC AGTAGTCACA TTCAAAAAAG GCCCTTCTCG CCGTCCCGCC 
CAATCTCGCC AACGTCGTCG CTCTCCATCG CCTCTCGACC CTGTCGCTGA AGCATCCGCA
TCCGCTTCCG GCTCCAATGT CGTTCGACCG GAGAGAAAAT CTCTCGCTAA CCCTCTCGTC
CAAGGCACAA AGCGTCGGAG AACAAATGCC AATAATGAAG AGGAAGAGGA TGGTGTGGGA
GGCGGATTGG ATGAGTTTGA TTATGCTGCC GAAGGAGGAC TGACGAGGAA AGGGGATGAG
CTTGCAACGA GGGCAAATGA TTGGGATTTG GAGGATGTAG ATGGACAAGG GCAAAGGGAT
AAGAAAGTCA GGCTAGATGA GGTGAGTCAT GCTCTGAATT TAGTTTATTT ACGGTGAGAT
TCTGACAAGC CTCAAAGGAC GGCGAGATCG TGACAGATGA CGGCCTGTAT CGAGGTGCAT
CCGCCTACTT ACCGACAATA AACAAGACCC GCGAAACACT CGACAAGAAG ATGAAATCCG
GTCCTATCAA AGCTACCTCC CACGTACGCA CAATCACCCT CATGGACTAC CAGCCCGACG
TCTGCAAAGA TTATAAAGAG ACCGGTTTCT GTGGATATGG CGATTCTTGT AAATTCTTGC
ATGATAGAGG AGATTATCTG GCGGGCTGGC AGCTGGATAA GTTGCCGGAA GAAGGAGTGA
GAGAGGTAGA GGAGGAGGAT GAGGAAGAGG AAGTACCGTT TGCGTGTTTA ATCTGTAGAC
AACCGTTTAC ACAGCCGGTG GTTACCAAAT GCGGGCATTA CTTCTGCATG GGGTAAGTAT
TCATTCGCTC ACTTTCATTT TTCCCGGCTG ACAAATATGT TCTAGGTGCG CTGCGAAACG
ATTCCAAAAA TCACCCAAGT GCTACGCCTG CGGTGCCCCG ACGCAGGGTA TATTCAACAT
CGCCGATAAA GTAATTGCCA AAATCGAAGC TCGTAACAAG GCAAGGCGAG AGGCGAGAGA
GGAACGGGCA GAGCAAACGG GTGGTGGCGG GATTGAGATT GGTGGTGGGT CTGATGAAGA
GGGTAGCGAT GAGGAGTAA
 
Protein sequence
MSEAPAPVVT FKKGPSRRPA QSRQRRRSPS PLDPVAEASA SASGSNVVRP ERKSLANPLV 
QGTKRRRTNA NNEEEEDGVG GGLDEFDYAA EGGLTRKGDE LATRANDWDL EDVDGQGQRD
KKVRLDEDGE IVTDDGLYRG ASAYLPTINK TRETLDKKMK SGPIKATSHV RTITLMDYQP
DVCKDYKETG FCGYGDSCKF LHDRGDYLAG WQLDKLPEEG VREVEEEDEE EEVPFACLIC
RQPFTQPVVT KCGHYFCMGC AAKRFQKSPK CYACGAPTQG IFNIADKVIA KIEARNKARR
EAREERAEQT GGGGIEIGGG SDEEGSDEE