Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK01590 |
Symbol | |
ID | 3254565 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 468080 |
End bp | 471120 |
Gene Length | 3041 bp |
Protein Length | 838 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253648 |
Product | conserved hypothetical protein |
Protein accession | XP_567833 |
Protein GI | 58260846 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.196806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTCTTTCTT CCATCCGGAA AGGATTGTTC AGACCGAAGC ACACTGCGTA CTGCACCACA AGTCTCAAGG AAATAGCACT GCCAAAATCT TAGATTAGGA CGGGGGATTG GAGAAGGTAT ACTCTCACAT TACAAGACCT TAAATTATTG TCATTTCATA ATGGTCAGTG TACATTGTGT TTCGTTGTAT AGATGAAAGC TAAATAGTTC ACAACAGCGA GTTATTGTCA AGGGCGGTGT TTGGAGAAAC ACCGAGGACG AAATTCTCAA GGCCGCCATT TCAAAATACG GTAAGAATGT GAGTATAACC CGCACATATA TGGGAATGAA GCTGACGAGC TTGGTTTCAC AGCAATGGGC TCGTATCTCA TCACTTTTGG TTCGAAAAAC CCCCAAGCAA TGTAAAGCGA GGTGGTACGA ATGGTTAGAC CCTTCAATCA AGAAGGTGGA ATGGTCAAAG GCAAGCGCAT TCGGTGTTAT GCAATGGCTA ACGATGGTTT CAGACTGAAG ACGAAAAGCT TCTTCATCTT GCCAAGCTCA TGCCTACTCA ATGGCGAACC ATTGCTCCTA TCGTCGGTCG TACAGCTACG CAGTGTCTTG AGCGATACCA GAAGCTCTTG GATGACGCGG AGGCGAGGGA CAACGAAGAG CTGGGATTAG GAGCAGGAGA GGATGAATCG AGCAAACCAG CTACCGATGC TAGGGGTCTC AGGCCGGGAG AAATTGACAC AGACCCTGAG ACAAGACCCG CGAGGCCTGA CCCCATCGAT ATGGATGATG ACGGTACGTG TATCATAATG TTTGGTTGCT CTGTGCGATT CATTGATATC CTATTAGAGA AGGAAATGTT GTCGGAAGCG CGAGCGAGAT TGGCCAACAC ACAAGGCAAA AAAGCCAAAC GCAAGGCCCG AGAAAGGCAA TTGGAAGAGG CCAGGCGATT AGCTTTTCTA CAAAAGAAGC GTGAATTAAA AGCCGCTGGT ATTAACCTTC GTGCCAAGCC GAAGAAGAAG GGTATGGATT ACAACGCCGA CATACCCTTT GAAAAGCAGC CTGCTCCTGG TTTCTACGAC GTCACGGAGG AGCAGGCCAA GGTTCACGCT GCTCCTGTCG GTTCAACACT TCGGGCTCTC GAGGGAAAAC GCAAGCAGGA GTTGGACGAA ATCGAGGAAA GGAAGAAGCG ACAGAAGAAG GGAGATGGCA AGTCTAACCA AACGCAGCAA TTTGTGGCGG CCCGAGAAGC GCAGATCAAG AAACTTAAGG AGCAAGAACA GATTATTAGG AGGAGGAAGC TAAATTTGCC GATACCTCAG GTTGGAGAGC GAGAACTGGA AGATATTGTC AAGATCGGGC AAGCAGGAGA ATTAGCAAGG GAGCTGGTTG GTGACGGCAA CAAGGCGACA GAGGGTTTGC TAGGCGAGTA TGAGGCTTTG GGTCAGGCTA AGATGGCGAG GACACCAAGA ACAGCTCCTC AACGTAAGAT CATTATCTGT TTTTCAGTCG TTGTGCGGTG CTAATGTTGA GTCAGAGGAC AATGTTATGG CCGAGGCCCG AAATCTCCGA AATATGATGG CAGCACAAAC TCCTCTTTTG GGAGAAGAGA ACACTCCTCT ACACGGCCCT TCTGTAGGCA CTGGATTCGA AGGGGCCACA CCTCGACACG ATGTTGCCGC GACCCCCAAT CCTCTTGCAA CTTCGGCTCG AGGTGGTGTA CTCACTTCAA CTCGAACAGT CCCTGGTGTT GGTACCACTC CCCTGCGGAC CCCTTTCAGA GATGATTTGA ACATCAACGA CGATGCGTCC GTGTACGGCG AAACTCCCAT GAACGACAGG CGCCGCCTTG CCGAGTCTCG CCGAGCTTTG AAGGCTGGCT TTGCGGCCTT GCCCAAACCT GAAAATAATT TTGAGCTTGC TGAGACAGAA GAGGATGAAG AGGAGGCGGA AGAAGCGGAG CCTCTAACAG AGGAAGATGC TGCCGAGAGG GATGCGAGAT TAAAGGCTGC TAGAGAGGAA GAGGAACGAC GCGAGCTTGA GAGGAGAAGT ACTGTTATAA AGAAGGGTTT GCCTCGACCC GTCAACGTTA ACACATACAA GCTTCTCGAC GATCTCAACT CTGCTATAGT TGAGCAGACC GACGAGGAGA TGGCCGCAGC GTTCAAGCTC GTCAATCTTG AAGTCGCCAT GCTCATGAAG CACGACTCCA TCGCTCACCC TCTGCCTGGA ACTTCTACCC CTGGTGGCCT GGCTTCTGAA TATGATATGC CAGAGGATGA CTTTGTTGCT GAGGCCAAGA ATGCTATCCA CACAGAATTG GCTAACGCAT TGGGCTTGCC GGGTGCGAGC GATGAACATT TACGCTTGGC AATTGGCGCA GCCGCCGAGG AAAACGAAGC TGCCTTTGCA GAAGCGTGGG CCGAGGAACG CGAAGGTCTT GTCTACTCCC CTTCAACTCG AACTTGGGTT GATAAAACCT CTCTTTCCCC AGAGGAGCTA TCCGCATGCT ACGCTGCGAT GATCAACGCT TCTCGAGATC GCGTTATTGC CGAGGCTACC AAAGCCGCCA AAGCAGAGAA GAAGCTCGGT AAGCAGCTGG GTGGTTACCA GACGCTCAAT GAGAAGGCAA AGAAAGCCAT TGTGGACGTC ATGGAGGAGA TTCACCAGAC CAAACGGGAT ATGGAGACAT TCCTTATGCT TAAGGGCATA GAAGAGGCTG CAGCCCCGGC CAGGTTGGAG AAGATTAGGG AAGAGGTTGC TGTTTTGAAG AAGAGAGAGA GAGATCTGCA GGCTAGATAT GCAGAGTTGA ACGACAGGAG GAGGGAGAAC CTCGCAGCTA TTGAACAGGT ACGTCAATGT CACTTCTTGT CGAAATTACT TAGACCAATG ATATTTCACA GCTCGAGGAA GACAAGATCG TTCTCGCTGC TCAAGTGGCA TTGGAAGCTC AAGAAGGAGA GGTTGCAGAT GGTGATGTTG ATATGAACGG GGCTTAGAAG TGCAAACATA TTATTGGTAT ATCAATGCAT TGTTGTCATA TTGGTTGTGT G
|
Protein sequence | MRVIVKGGVW RNTEDEILKA AISKYGKNQW ARISSLLVRK TPKQCKARWY EWLDPSIKKV EWSKTEDEKL LHLAKLMPTQ WRTIAPIVGR TATQCLERYQ KLLDDAEARD NEELGLGAGE DESSKPATDA RGLRPGEIDT DPETRPARPD PIDMDDDEKE MLSEARARLA NTQGKKAKRK ARERQLEEAR RLAFLQKKRE LKAAGINLRA KPKKKGMDYN ADIPFEKQPA PGFYDVTEEQ AKVHAAPVGS TLRALEGKRK QELDEIEERK KRQKKGDGKS NQTQQFVAAR EAQIKKLKEQ EQIIRRRKLN LPIPQVGERE LEDIVKIGQA GELARELVGD GNKATEGLLG EYEALGQAKM ARTPRTAPQQ DNVMAEARNL RNMMAAQTPL LGEENTPLHG PSVGTGFEGA TPRHDVAATP NPLATSARGG VLTSTRTVPG VGTTPLRTPF RDDLNINDDA SVYGETPMND RRRLAESRRA LKAGFAALPK PENNFELAET EEDEEEAEEA EPLTEEDAAE RDARLKAARE EEERRELERR STVIKKGLPR PVNVNTYKLL DDLNSAIVEQ TDEEMAAAFK LVNLEVAMLM KHDSIAHPLP GTSTPGGLAS EYDMPEDDFV AEAKNAIHTE LANALGLPGA SDEHLRLAIG AAAEENEAAF AEAWAEEREG LVYSPSTRTW VDKTSLSPEE LSACYAAMIN ASRDRVIAEA TKAAKAEKKL GKQLGGYQTL NEKAKKAIVD VMEEIHQTKR DMETFLMLKG IEEAAAPARL EKIREEVAVL KKRERDLQAR YAELNDRRRE NLAAIEQLEE DKIVLAAQVA LEAQEGEVAD GDVDMNGA
|
| |