Gene CNI03470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI03470 
Symbol 
ID3259734 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp938737 
End bp940699 
Gene Length1963 bp 
Protein Length460 aa 
Translation table 
GC content48% 
IMG OID638258841 
ProductMachado-Joseph disease protein 1 (Ataxin-3), putative 
Protein accessionXP_572615 
Protein GI58270918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.396092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCAAAGAAT AGGCTGGTTT CACATCTTAT ACCCAGGGTT AGGCAACAGA AGCTACATTA 
TCCAGTAACT ATCGCCGTGA TAGAAATAAA ACCTTCCCTT AGGATTTGAC GAGATGGATC
TCGTTCCGTG TAAGTTTCTC GGCCTTCCAC ATCGTTAGTC TTTGAACAGA TATTAACTTC
GCCCCGTAGA CATGTATTAT GAGAAGCAAG AGGCTGGATC TCAGCTCTGT AAGCACCACT
CTAGTCCGAA AATGACCTGA CTAAGCTGAT CTCCCCTGAT AGGTGCTCAA CACTGCCTGT
ATGTCAAGTT GTCATGGCGG AATCAGCGCA TGGCCGCCCA TATAGCAGAC TGATGTATAC
TTCCGTAGGA ATAACCTTCT CCAGCAGTAT ACCTACTCTG AGTTTGACTT GGCTGATATC
GCGAAGAGGT AAACATCTGC ATGACTGTTG TATCACATTA CACTAACGCA TTCCCAGACT
TGACCAAGCT GAGAACGCTA CTCTCGATGT TAACCATCAG CTTAGAAAGT CTTACAACTA
CGATGATACT GGTTATTTCT CCATTTCAGT ATTAGAACGC GCATTGGAAG TTTGGGACCT
AACCATGGTT AGGTGGAGAG GTGAAGCCAT GAAGCCATAT CAAGACCATC CAGAGTAAGC
TGCATTTTAC GATCAACAGC ACAATCCGTT GACCAAACAT GTAACAGAGA TCAAGCCGCT
TTTATTCTTA ATCTCGCATC TCACTGGTTT ACCCTCCGTC GCTTCGCCCC CAATCCTCCT
CATGCTGCTG CCTCCAAAAG GTGGTACAAC CTCAACTCCT TCCTCGCTGA CGGCCCCGAA
TGGATCTCCC CTACCTACCT TCACATGGTG CTGACACAGG CCGAGCAAGA AGGCTACTCC
GTATTTGTCA TCAGAAAGGC AACACCGGGG ACCAAAGAAG GCGAAGAAGC AGGTGAAGCG
GAAGGATGGG GAGATGGCGG CATCGGTCAG TTGCCGGAAT CTCTGGGTGA TGTGATGGCC
GTGGAGCTAG GCGAGCCAGT GGGAAGGTCT GGTGGACTGT TAAGTGGGAC GATTGGACCC
ACAAAAGAAT CCAAAACTAC TCAAATGTCT ATACCCGCCG ACCCGAATAC TACCACTGTA
GATGCTGAAC CTTCGTCACC GTCCCGACCT CCTCGTCGAC GCCGTCAGCC GGATCTATCG
TCAGACCCAA CCGAGATCGT CGACGACCCT TACGCTCGAC CCGCTCCTTC TCGCTCACGA
CAATCATCTT CACGTTCCAA CCCTGCCCAA CATCAAGTTA TCGGAGATGA TGAAGGCGAC
ATCTTTGATC AGACTCATGG TATTCCATCA CATTATGATG AGACACCTGA CGATGATAAT
GCTGACGACG ATTTTGAAAT GAGCCGTTCT CGGGCGTACG CTGGTACGAT GGACTTCCAG
TTTCAAAGTC GGAGTTATGA TGACGAGGAT GAGGCTCTCC AAGCAGCTCT GAAAGCTAGT
ATGGCAGACC TGCCCGAAGG ATGGGAGATG CCTGATATCT TGAAACCAGA AAGCGAGAGA
CAGGCATTTA CTACTACTAC CAGCATTATG ACTACTACTA CTACTACTCC ACCGGTAGCG
CCGGAAGCGC AAAGGGAAGA ATCTCCTGTG GCGACATCAG TAGCTGAGGA AAAGGACATT
GTAGAGTCAA ACGAAGTAGA GGACGATAGC GATGACGTTC AACCAGCCGA GGAACCATCC
CCTGGTAAGT CTCTGTCATT TCATTATCTA CAGATGACAA AGCTAACATC ATTGCAGAGG
AGATTCGACG AAGGCGTCTT GCTCGCTTCG GTTAGTCTTG TCCCAGGCTT CCAGTTAATT
TTTCTTTCCA TCAGTTATTC AAGATTATGA TGGGGAATTT TCACCTTTGG AAGATGCGGA
CTAATGCGCG ATAGAGTCAA GAGGAATATG CATATTGTAC AAA
 
Protein sequence
MDLVPYMYYE KQEAGSQLCA QHCLNNLLQQ YTYSEFDLAD IAKRLDQAEN ATLDVNHQLR 
KSYNYDDTGY FSISVLERAL EVWDLTMVRW RGEAMKPYQD HPEDQAAFIL NLASHWFTLR
RFAPNPPHAA ASKRWYNLNS FLADGPEWIS PTYLHMVLTQ AEQEGYSVFV IRKATPGTKE
GEEAGEAEGW GDGGIGQLPE SLGDVMAVEL GEPVGRSGGL LSGTIGPTKE SKTTQMSIPA
DPNTTTVDAE PSSPSRPPRR RRQPDLSSDP TEIVDDPYAR PAPSRSRQSS SRSNPAQHQV
IGDDEGDIFD QTHGIPSHYD ETPDDDNADD DFEMSRSRAY AGTMDFQFQS RSYDDEDEAL
QAALKASMAD LPEGWEMPDI LKPESERQAF TTTTSIMTTT TTTPPVAPEA QREESPVATS
VAEEKDIVES NEVEDDSDDV QPAEEPSPEE IRRRRLARFG