Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI03470 |
Symbol | |
ID | 3259734 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 938737 |
End bp | 940699 |
Gene Length | 1963 bp |
Protein Length | 460 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258841 |
Product | Machado-Joseph disease protein 1 (Ataxin-3), putative |
Protein accession | XP_572615 |
Protein GI | 58270918 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.396092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCAAAGAAT AGGCTGGTTT CACATCTTAT ACCCAGGGTT AGGCAACAGA AGCTACATTA TCCAGTAACT ATCGCCGTGA TAGAAATAAA ACCTTCCCTT AGGATTTGAC GAGATGGATC TCGTTCCGTG TAAGTTTCTC GGCCTTCCAC ATCGTTAGTC TTTGAACAGA TATTAACTTC GCCCCGTAGA CATGTATTAT GAGAAGCAAG AGGCTGGATC TCAGCTCTGT AAGCACCACT CTAGTCCGAA AATGACCTGA CTAAGCTGAT CTCCCCTGAT AGGTGCTCAA CACTGCCTGT ATGTCAAGTT GTCATGGCGG AATCAGCGCA TGGCCGCCCA TATAGCAGAC TGATGTATAC TTCCGTAGGA ATAACCTTCT CCAGCAGTAT ACCTACTCTG AGTTTGACTT GGCTGATATC GCGAAGAGGT AAACATCTGC ATGACTGTTG TATCACATTA CACTAACGCA TTCCCAGACT TGACCAAGCT GAGAACGCTA CTCTCGATGT TAACCATCAG CTTAGAAAGT CTTACAACTA CGATGATACT GGTTATTTCT CCATTTCAGT ATTAGAACGC GCATTGGAAG TTTGGGACCT AACCATGGTT AGGTGGAGAG GTGAAGCCAT GAAGCCATAT CAAGACCATC CAGAGTAAGC TGCATTTTAC GATCAACAGC ACAATCCGTT GACCAAACAT GTAACAGAGA TCAAGCCGCT TTTATTCTTA ATCTCGCATC TCACTGGTTT ACCCTCCGTC GCTTCGCCCC CAATCCTCCT CATGCTGCTG CCTCCAAAAG GTGGTACAAC CTCAACTCCT TCCTCGCTGA CGGCCCCGAA TGGATCTCCC CTACCTACCT TCACATGGTG CTGACACAGG CCGAGCAAGA AGGCTACTCC GTATTTGTCA TCAGAAAGGC AACACCGGGG ACCAAAGAAG GCGAAGAAGC AGGTGAAGCG GAAGGATGGG GAGATGGCGG CATCGGTCAG TTGCCGGAAT CTCTGGGTGA TGTGATGGCC GTGGAGCTAG GCGAGCCAGT GGGAAGGTCT GGTGGACTGT TAAGTGGGAC GATTGGACCC ACAAAAGAAT CCAAAACTAC TCAAATGTCT ATACCCGCCG ACCCGAATAC TACCACTGTA GATGCTGAAC CTTCGTCACC GTCCCGACCT CCTCGTCGAC GCCGTCAGCC GGATCTATCG TCAGACCCAA CCGAGATCGT CGACGACCCT TACGCTCGAC CCGCTCCTTC TCGCTCACGA CAATCATCTT CACGTTCCAA CCCTGCCCAA CATCAAGTTA TCGGAGATGA TGAAGGCGAC ATCTTTGATC AGACTCATGG TATTCCATCA CATTATGATG AGACACCTGA CGATGATAAT GCTGACGACG ATTTTGAAAT GAGCCGTTCT CGGGCGTACG CTGGTACGAT GGACTTCCAG TTTCAAAGTC GGAGTTATGA TGACGAGGAT GAGGCTCTCC AAGCAGCTCT GAAAGCTAGT ATGGCAGACC TGCCCGAAGG ATGGGAGATG CCTGATATCT TGAAACCAGA AAGCGAGAGA CAGGCATTTA CTACTACTAC CAGCATTATG ACTACTACTA CTACTACTCC ACCGGTAGCG CCGGAAGCGC AAAGGGAAGA ATCTCCTGTG GCGACATCAG TAGCTGAGGA AAAGGACATT GTAGAGTCAA ACGAAGTAGA GGACGATAGC GATGACGTTC AACCAGCCGA GGAACCATCC CCTGGTAAGT CTCTGTCATT TCATTATCTA CAGATGACAA AGCTAACATC ATTGCAGAGG AGATTCGACG AAGGCGTCTT GCTCGCTTCG GTTAGTCTTG TCCCAGGCTT CCAGTTAATT TTTCTTTCCA TCAGTTATTC AAGATTATGA TGGGGAATTT TCACCTTTGG AAGATGCGGA CTAATGCGCG ATAGAGTCAA GAGGAATATG CATATTGTAC AAA
|
Protein sequence | MDLVPYMYYE KQEAGSQLCA QHCLNNLLQQ YTYSEFDLAD IAKRLDQAEN ATLDVNHQLR KSYNYDDTGY FSISVLERAL EVWDLTMVRW RGEAMKPYQD HPEDQAAFIL NLASHWFTLR RFAPNPPHAA ASKRWYNLNS FLADGPEWIS PTYLHMVLTQ AEQEGYSVFV IRKATPGTKE GEEAGEAEGW GDGGIGQLPE SLGDVMAVEL GEPVGRSGGL LSGTIGPTKE SKTTQMSIPA DPNTTTVDAE PSSPSRPPRR RRQPDLSSDP TEIVDDPYAR PAPSRSRQSS SRSNPAQHQV IGDDEGDIFD QTHGIPSHYD ETPDDDNADD DFEMSRSRAY AGTMDFQFQS RSYDDEDEAL QAALKASMAD LPEGWEMPDI LKPESERQAF TTTTSIMTTT TTTPPVAPEA QREESPVATS VAEEKDIVES NEVEDDSDDV QPAEEPSPEE IRRRRLARFG
|
| |