Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00200 |
Symbol | |
ID | 3255349 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | - |
Start bp | 84012 |
End bp | 86149 |
Gene Length | 2138 bp |
Protein Length | 607 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254435 |
Product | nucleolus protein, putative |
Protein accession | XP_568528 |
Protein GI | 58262236 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGCA TCGAGCTCGG CATCCAGGAC CTCGACCACA GCTATCAGAT TGACACCCAA GCCCTGTCAT CGGACGATGA AGAAGATCGT CAACAAGAGC CCGAACGACC AACCAAGCGA AAACGCACTA AAGAAGAAAA GGAGAGGAGA AGGTCTGAGA AGAGAGCGAG AAAGGAGAAA AGAAGGTCGG GAGCCGCCCA GACTGCACAG GTTCAGGCTC AAGCACAAGC TGGAATTGAA GCAGAAGTGG CCGAAGCACC CACCGACTAT GATGAAGTTG AACAAGTAGT GGAGGATGCT CCTGTGAGCG AGGGAGACAA GAAGCGAAAG GAGAAGAAGC ACAAGAAAGA CAAAAGCAAG GGCAAAGGTA AAGGAAAGAA GAAGGAAGGG AGTGAAGAGC AAGAAGAAGA AGACAGGGAG GCTATAGCTG CATCAGCTGT AGCTACCCTC GCCCAGGCAT TGGTGAGCTC GGAAAGTGGC AAGGTTGCCC CGTCTGTGGA TAAAGTTCAA ACTCCAAGTC GCCCGACTGC TGCCACTTCT ACCATCGCTA CCTCACGAAT AGGTCCTACC CAGTACGGCA GCATCAAAAT CAAGAAAGGC CCCTTGGAAT CATCACCAGC CCCACCCGCA TCGTCACTCA CGCCCGCCCT TCCCGCGACC CAGCTCATCC CTGTGACTCC ATTCACGGCC GCCTCGACCG ATGCGAGTTC ATTTGCTGTT AGGGACAAGA TCAATTCTCT CAAGCACCCA AAGTCTTCTA GTAATGCCTC CAAAGCTATA AGAAGCACGA GGAAGGGCGA GGATACCAAG GAAAGTGATG CGCAGCTTCG ATTGAGGTTC CAGGACCCCA AGGCACAGGA AGAGTGGTTG GCTAGTACAT CAATTGGGAA GACTGAGCTT CTGAGATTGG AAAAGGAGGG TAGTAAGTGT AGTTTCTAGT TTAAAATATA CAGATGGCTA ATTCCACGTT CTAGTTCTGT CGTACAAGAA GGGAAAGTTC ACTGAGGACG AGAAGGTTTC AATCAAAAAG GCTTTGGAGA ATTATCAAAA GATACATCGA ATAAGCTCTT TCGATCTTGT TGAGCTCGTC ATGACCAAAA CACTTCAAGC CACGGATAAA GAAACTGTCC GTGAATTTTG GAAAGATATC GGTATGACCA TTTATTTCAT TCTGTAAAAC ACTATTGATT GTGAAACAGC CGCTTCTGTC CCCGGTCGCC CGATCCTCAA CGTCCAACCA TTCGTGCGAC GAATGCTCGA CCCTAAAGCT CATAAAGGCC GCTGGACCCC GGAAGAAGAC GAACTCCTCC TTCGCGCATA CGCACAACAC CCTCGCGAAT GGACCAAAAT CTCCTCCATC GTTGACCGTA CCGAGGTGGA TTGTAGGGAT CGTTATTTGA AGGAACTCGT GAATCGTGAT ACCCGAACAG CGGGTAGGTG GACAAAAGAT GAGGAGGACA AGTTGGAAGA GGTGGTGAAC AGGGTTGCGA AGGGATTGCG TGCGGAACAG GTGCATGGGG AGAAGAGGAA GGGTCTGGAA GAAGGAGCAG AGCTGGTGGA ACCATCGGAC GTCCCTTGGG ATATTGTTTC GAAAGAGATG GGCAACACAC GATCAATGAC ACAGTGTCGT ATCAAGTATC GCGATGCCAT CTGGCCCAGA AAACTGGGTT TGGGTAAAGA TGATCATGTC GGAAGGACGT TGAAGGTCCT CACAAGGTAT TTTTTTCTCT CGTTCTTTTT TTCAAGCTTG TTCATTCCTG ATGTGACCAC TTAGACTCAA AAACTTGAAC TATGAGTCCG AGAAGCACAT CTCTTGGTCA CAAGTCCGTG AAACCCTCGA GAAATACTCC CTCAAGGAAA TCAGAAATTC GTATACCAAT CTCAAAAAGA GTGTAATGAG CGATCCCCAT GTTGCCAGTC TCAATTACCC CGGTTTGTCA AACGTCCCAT ACTTTCCCAG ACGCGAAGAC GACTTGAACT GATGATGCTT GCTGGATCCA GAATTGATCA ATGTCATGTA CGATAAAGCG GTCATGCAAA GGGGGAGGAA AGTGAGGGCG GATCAGAGGG ATTATCCGAG TAAGGAGACG GTGGAGTCGG GGGATGAAGC GTATTAACGA GGGTGCGCCA AGGAAGAT
|
Protein sequence | MEGIELGIQD LDHSYQIDTQ ALSSDDEEDR QQEPERPTKR KRTKEEKERR RSEKRARKEK RRSGAAQTAQ VQAQAQAGIE AEVAEAPTDY DEVEQVVEDA PVSEGDKKRK EKKHKKDKSK GKGKGKKKEG SEEQEEEDRE AIAASAVATL AQALVSSESG KVAPSVDKVQ TPSRPTAATS TIATSRIGPT QYGSIKIKKG PLESSPAPPA SSLTPALPAT QLIPVTPFTA ASTDASSFAV RDKINSLKHP KSSSNASKAI RSTRKGEDTK ESDAQLRLRF QDPKAQEEWL ASTSIGKTEL LRLEKEGILS YKKGKFTEDE KVSIKKALEN YQKIHRISSF DLVELVMTKT LQATDKETVR EFWKDIAASV PGRPILNVQP FVRRMLDPKA HKGRWTPEED ELLLRAYAQH PREWTKISSI VDRTEVDCRD RYLKELVNRD TRTAGRWTKD EEDKLEEVVN RVAKGLRAEQ VHGEKRKGLE EGAELVEPSD VPWDIVSKEM GNTRSMTQCR IKYRDAIWPR KLGLGKDDHV GRTLKVLTRL KNLNYESEKH ISWSQVRETL EKYSLKEIRN SYTNLKKSVM SDPHVASLNY PGLSNVPYFP RREDDLN
|
| |