Gene CNI03040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI03040 
Symbol 
ID3259541 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp824498 
End bp826346 
Gene Length1849 bp 
Protein Length471 aa 
Translation table 
GC content48% 
IMG OID638258796 
Productexpressed protein 
Protein accessionXP_573014 
Protein GI58271716 
COG category[R] General function prediction only 
COG ID[COG5273] Uncharacterized protein containing DHHC-type Zn finger 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCACAAGCTA CACTTCCATA CATACCAGTG TGTTTTGCCT ATCATTCCAC AGTCACAAGG 
AGATAAGAGA GCCTTCCAGG TGGATATATA TACCTCTTAC GCCAGGGATA TGGCCATCCC
TCTCGTATGG CTGTCTCAGC AGCTCATACC CCCGCTTTTG CTCACCTACT TCTTGTGCGT
CTTTTCGTCC CCTTCCGGGA CTACTGTAGG TTGATTTGTT ACAAGGTGAC TGATTGATGC
TGTCATTTAG TATCTCATGG AAAGTAGCTA CATTCGAGCT GGCTCGTATG TCCTTGTTAC
CATTGCTTAC CCCTTAGCTG AGGACCCAAA CGTCCAATAT TAAAACCATC ATTTAGCATC
CCTATCTCTG GATGTACTGG CATGGAGTCA ACCAGATAGC GGGAATATCA GTACCTTTAC
AAATGGTATA AAGAGGATTA TCGCCTACAA CATTCTTTTC AGCCTCCATA TCCTACCGAC
TCTTCTAATC GGCTTTATTT CTGCTTTATA CCTTCGCCTA TACTTTTTGC CACGCTCGCA
ATCTGTCCCA CCTTATGATC CTCCATTCGA GGTACTAGAC AAGCAAGTCA TGTTTGCTTG
CCTTTTTCCT AGGTCAGCTT CAAGATCAAG GTCGCATTTA GAGCTAGAGT CAGAGTTTGG
CGATGTTTTG TTGATCACCG AGCAAGAACC GATAGTGGAA AGATGCTATA AAGGAAGGTG
TGGAGGTAGA TGGAAGCCTG CCAGGACAAG GCATTGTACG CAATGCGGCG TGTGCAGAGC
TGGCTTTGAC CACCACTGCC CATTTGTAAG TTACCCTATC CACATTTCCT AGCCTAACAA
CTTGACGTCT GATGAACGGT TTACGAAATG TTAGTTTGCA AACTGTCTTA CCGCACCATA
CATCCCCACT TTCCTCGCTG TCCTCCTATA TACGCCCCCC ACCGTCCTCA TTCTCTCCTT
CCCCCTCCTA TCCCCCCTTC TCCACCGCTT TATTGCGGCT TATTCCCAAG CATGTGACTC
TTCCGAGATA ATAGCTTATT GGTGGAACTG GAAATGGAGC TGGGTCGTCG CAGGTGGCCC
TGTAGGCAGG TTTGCAGGCG GTATCATTCT CGGGTGGAGA GAGCTTGATA GACAAGATGG
TGGAGGGCTA TATAGATTGG CTGTCGGGCT GTTAGTCGCT TTCGGCTTTA TTCTGTCTGG
GATCACCGCG GTGAGTCTCT TGAGAACGAT CTTGCCCATC TTTGGATTAT GGAAGTCTGA
ACTCGAACTG ACAAAATAGC TATCTAAAGA GTCTAGCATA TTCGACTATC CGTATCCTTC
GGGACGGTGA CTTTACCATC GACCGCGAAC GATCTAGCGC CCACAGACGT ATCCTCTCCA
CCATCAAAGG CCTCCCTCGG GAACAACCTA TCCCGGATAA ACTACGACAA AACCTTGCCC
GATTTTCAGA CCGTCCAGCA TTTTACCTGC CTCCCAGGGA TATAGATCGA ATTTGTCCAG
GTCAAGGTCA AGATCGGAAG TGGGATTGCC ATGGGAGGAA AAATCCTAAG GGGTATGTCG
TCCAGCTCAA TGACAAAGCG AGGCCGTATG ATCATGGACC TAGAATGAAT ACGCAATTGG
TCTTGGGAAC ACCATGGGGT GAGGGATGGA GCTGGTTGTT GCCTTGGAGA GCAATCCGGC
CGGGTATAGA GTATGATGGG GGGGAAAGAT GTTTATTCAA CTGGCCGGTG GCGGAAGGGG
TCAGGCAAGA GATAGAAGGG GCTATTGGAT CGGGTATTCT GAATAGACTG TCAGAGGGAG
ACCATCTAAA CTGTAGATGA GATGTTTATG CTGCTGTGAA TAGGACATG
 
Protein sequence
MAIPLVWLSQ QLIPPLLLTY FFISWKVATF ELAPSLSLDV LAWSQPDSGN ISTFTNGIKR 
IIAYNILFSL HILPTLLIGF ISALYLRLYF LPRSQSVPPY DPPFEVLDKQ VMFACLFPRS
ASRSRSHLEL ESEFGDVLLI TEQEPIVERC YKGRCGGRWK PARTRHCTQC GVCRAGFDHH
CPFFANCLTA PYIPTFLAVL LYTPPTVLIL SFPLLSPLLH RFIAAYSQAC DSSEIIAYWW
NWKWSWVVAG GPVGRFAGGI ILGWRELDRQ DGGGLYRLAV GLLVAFGFIL SGITASLAYS
TIRILRDGDF TIDRERSSAH RRILSTIKGL PREQPIPDKL RQNLARFSDR PAFYLPPRDI
DRICPGQGQD RKWDCHGRKN PKGYVVQLND KARPYDHGPR MNTQLVLGTP WGEGWSWLLP
WRAIRPGIEY DGGERCLFNW PVAEGVRQEI EGAIGSGILN RLSEGDHLNC R