Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK02380 |
Symbol | |
ID | 3254670 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 686793 |
End bp | 691592 |
Gene Length | 4800 bp |
Protein Length | 1275 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253730 |
Product | nucleus protein, putative |
Protein accession | XP_567711 |
Protein GI | 58260602 |
COG category | [B] Chromatin structure and dynamics [K] Transcription [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat [COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAATCACAAA GACTATAGAC AGCAACTTAC ATCAGTTAGA CACAATGGGT AAACACGGTA AAACAAAGAC TACCAGCAAG GGCGCCCAGA GACCTGCTCC TTATCAAAAG AAGGCCAAGG TTGATGTGAC CTTTGACACT GGCGTGAAAC CTTCCAAGGC TGAGAATAAG CCCAAAGCTA CGAACGGAAA AGCGAAGAAG GACAAGGAGA CCTCCAAACC TTCCAAGGCC AAGCAGCCTA AGGAAAAGAA GGAGGAGACC GCCAAGGTCT CGTCACCAAA GCCTACCAAG TCCGACAAGG GCAAGGGCAA GTCCATTCTC CCTCCAACTG TTGGACCTTC AACATTCATT GTTATTGCGG GTTCCTACGA GAAACTCTTA TATGGATTGG AAGGTTCCTA CCCAACTGGA TCTACCGCTC CCGTCCTCGA GCCTATCTTC ATCTTTCCAG CTCATTTGGC ATGTGTCAAA GCCGTTGCGG CGAGTCCTGG CGGTAAATGG CTGGCTACTG GAAGTGAAGA CGAGTTTGTG AAAGTTTGGG ACTTGAGAAG AAGGAAGGAG GTTGGTAGTC TGAGTCAGCA CACAGGTAAG TTCCTCATAT CCAGATTCAA TATACGATTA GAAGACAGCT GATATGTTTG TAGGATCAAT AACATCGCTT CATTTCCCTA CCCCTTCTCA CCTTTTGACC ACATCTGTTG ATTCTACTCT ATCGCTCTTC CGTACTTCTG ATTGGTCGCT TCTCAAGTCA CTCAAGGGTC ACTCCGGAAG GGTCAATCAT GTTGATGTGC ATCCAACGGG AAGGGTTGCT CTAAGTGTTG GAAAGGACCA AACTTTGAAG ATGTGGGATT TGATGAGAGG TCGAGGAGCG GCGAGTCTGC CTCTGGGAAG TGGTAAGTCC AGATGCCTTG AAGGAAATAG AGATATTAAT GGAATGTGTT TAGAGGCGGA GCTCGTCAAG TTCTCTCAAC AAGGCACCCA TTTTGCTGTT CTCTTCCCCA GAAAGATCCA AATCTACTCG CTCGTAAGTC ACTGACTATC CTTTGGCGCC CCTTGTCCTG TAACTTACAC CACGTAGACC TTGAAACTCC TGTACACCCT CGAGACAAAG TCTCGTTTCA ACCACCTTCT TTTCCATCTT CTCCCTGCCT CTACGGAGGA TGGCGAAGAG ACTGAGGTGC TTTGTATCGG TACTGAGAAA GGTGTCGTCG AGATCTATCG AATAACATTG GGTGAAGGGG AGGAGTCTGA AGACGAAGAG GACGATGACG AGGAGATTGA AAAGGAAACC AAGGGCAAGG GGGCCGAATT GGAGAGAATC GCCACGCTTG TTGGACACAC GAACCGGTAC GCTTACGTTT TGTATTGTTG CAAGGTTTTG CTAAAAAGTC CTCTAGAGTG AAGGCCATTT CATCACTTCC TTTCCTCGCA CCTACCTCTA CAGGGGATGT GCGCAAGACA ATCCTCCTCA CAACTGTCTC GTCTGACGGT CTCATCAACG TCTACGACCT CTGTGCGGCT ACCACCGATA TCGAAAAGGG AGAAGAGAAC AAGGTTGAAC CCGTTGCATC TTACGATACC AAGGGTACTC GATTGACCTG TGTTTACCTT GCTGATGGAC AAGATGGAAC CAAGGAGATC AAGGTCAAAG AGGAAGAGAT CAGTGAGAGC GAGGACGAGG TTGAGGACAT CTATGGAAGC GAACAAGCGA GTGACGATGA TGAAGACGAC GACGGGATGG AGGTCGAATT CGAGGATGAA GAAGAGGAAA TTGAGGGTGA AGAAGAGTAG GCAATTTGCA TGCGTACCAG ACAGACCGCG CTACATAGCA GTGCATATAC ACAACTACAT ACATATAGAG AGAGAGTCGG GAAGGAAAAG CAGTAAGCAA CGGGGTAGTC ATCTGCCACA TGTATGAATA AGTCGGCGTT TGCGGCGATA TGGAGTAGCG TGGTCATGGT CCTTTGAGTG GCGTGTCCCC TTGTGCCTTT TTTCGGGTGG GTTATAAATA CCAAACACGC TCAAACGCTC ATCATTTTTC GCCCCCCTCC ACTTTGTTTT TCAAAAATTC ACAACCAGCG GACGTCGCCG CTTCTTCTTG CCGTCGCGTC CCGTCCACGT TCTCCACTGC GTCACTCCAC TCGTTTAAGC TGTGAAGGTG AGCGAGTGAG GATGACCATG CATTGCTGAT ATATTGCTGA CACTGTTGCC CGTGCTGCTG ACAGTATGTC TGAGGTTACT GAGACGCCCG TTCCTCCTCT TTCCAACGTT ACCGAGCCGC CCAGGAGTCC TAACCCGCTC AACCCCGCTA CAGACGCGCA GGGCGAGAAC ATCCAACCGC TAGCCGAGCG TGAGCCGCAG AATGACGTGC CAGGAGTGAC CGCCGCTCCC GATCAGCCAC CTTTGCCAGT GGATGCACCT CTCCCTCCCT CTCCTGCCGC GAGCGACGAA CCTCCCCCTT CCTTACCTTC CAGCGCTATT CCCACAGCTG TGCCTACCCC TACCGTGATT ACAGTAGGCG AGGATAAGCC CGAGGTTCCC GCCGGAGCCC CGTTGTCTAC CGTGACGCAA GGATCAGCGT CAATGGATCT TGATGAGGCC ACACCTCAAC CAGTCAAACG CGCCGGGGAG GAGCTCGATG GTGAAAGAGA GGAGAAGAGA TTAAAGGAGG ATGGGCCTGC CGTGGAGGCT GCTGCAGAGG CTGGAGAGGC TACTGGGGAG GCTATCGCTG TAGCAGTGCA GTCTGACGTT CCTCTTGTTG GACCGGATGG TCAACCATTG CCTCCTCCGG CCTGGCTTTC CTATGTTCCT CCTGCCCCCA AGTTCGCCGG TCCCAACACA CCGCTTACGC TCACTCAACA CAAGTACATG CTCAATGCTG TGCGCTCACT GAAGAAGCGC CTTCCTGATG CGTACAACTT TCTTGTTCCT GTCGACACTG TTCGATTTAA CATTCCTCAT TACCACACCG TTATCGATAC ACCTATGGAT TTGGGTACGG TTGAGACCAA GTTGATTGTC AGTGATCCCA GGGGTCCTCC CAAAGACAAG AGTAAGATGA GCAAGTGGGA TACTAGCAAA GGCAAGTACA ACAATGTTGC CGAGGTGACT GAGGATGTGA GGAGGATCTG GGAGAATTCA CGCAAGTTTA ATGGGAAAGA GCATCCTGTC AGTCAGATGG CTACTCGATT GGAGGAGGCT TTTGAGAGGT CTTTAAGCAA CCTGCCAGCT GAAGTAAGTT GCAAAAAGCC TTGTCTTCCA AGTGAAAATA CATGTTAAAT TTCAATCCAG CCTGTTATCG CATCTCCTGC CTCTGCTGGT CCCTCTCATG TCCGCCGCTC TTCCATTAGT CAACCTCCTG TCGTCCGACG AAGTTCCGAT GACACTCGTC CTAAGCGCGA GATTCATCCT CCTCCCCCCA AAGATCTTGC TTACGAAGAG TCTCCTGGCT CTGCGCGGAA ACCTAAACGT AGGAACGACC CCCAACTCCA GTGGGCTTCA AGGGCCATCA AGAGTTTGGA GATTTCCAAC AAGTACTATG TTGCCGTTTC ACCTTTTCTT TACCCCGTCG ACAAAATCAT TGAGGAAGTT CCGGATTATG CTACTGTCAT TAAACGTCCC ATTGACTTCA ACATTATCAA GAACAAACTC GCCGAGAATA CATACGAGGA TGTGAATCAG GTGGATGACG ACATTCGTTT GATGGTTGCC AATGCGCAAA AATTCAACCC GCCTGGCCAC GAAGTGCATA CATCTGCTAC GCAATTACTG CAAATCTGGG AAGAAAAGTG GAGGACGGTG CCGGCGAAAG TGGAAACTAG AGATTCTAGC GAAGACCCAA TGGCCGAAGC CTTTGACGAT TACTCGAGCG ATGAAGATAG TAAGCGCTTT CAGATTGATT GCTGATACGG TGGCTGACTA CTGGGTAGAC GCACAACTGA GATCACTGGA GAGCCAAGTC ATTGCCTTGA ACCAGCAAAT CTCCGCTCTC CGTTCCAAGA TGACCAAACG TCGTGCTGCT CGAGGCTCTA AATCCAAGTC AAAACCCAAG ACCGCTCCTC GGAAATCTTC CGTCTCCAAA CCTTCCCCCA ACATCAACGG CAATAGTCAA CCGAAGAAGT CCAAGAAGGC TCCCAAGGAG GCCAATCTTA TGTACAGAGA AGATGACGAT GAAAGCGAAG AAGAAGAGGA CATCAGTCAT CTTTCTCATG CGCAGAAGCA AGAGCTGGCG GAGAAGATTG GGCAGACTGA TGGGGAGACT TTGAGCAAGG TGATTTCTAT AATTCAGCAG TCCACCAACA TTGGAGGGGT AAGTCAATCC ATACAGTATC TTTATATTAT CTCAATACTG ACGTCCCCGC AGAGCAACCA AGAGATTGAG CTGGATATCG ATTCTCTTCC ACCGGCCACT GTTATCAGGT TGTATAACCT TGTTTGTCGA GGAGGGCGAG GATCAGGTAG CAAGCGCGGA CGTGGAGGTG TCGTGAAGAA GGCCGGTGGT ACTGGCCGAA AGGCCATGGG TGGTGTGTCT CGACGCAGTG TGAACGAACG GGAAGAAGCG GAGCGTATTA GGCGCATGGA GCAGCAATTG CAGGCGTTTG AATCTTCTCG CCCTGTACCG CAGGCGGTTG GCTACGAAGA AGAAGAGTCT AGCTCGGAAG AGGAGAGCAG CGAGGAAGAG TAAAAGAATA CATGTTACAG CCGGTTGTTT TTTGTTGTTA GAAGTCACGA GATATATAGT ATATTTGGGA ATGCCGCGTG CGCGATAGGC GATAATAATC GCCAGTTGGG CGGTTGATGT
|
Protein sequence | MGKHGKTKTT SKGAQRPAPY QKKAKVDVTF DTGVKPSKAE NKPKATNGKA KKDKETSKPS KAKQPKEKKE ETAKVSSPKP TKSDKGKGKS ILPPTVGPST FIVIAGSYEK LLYGLEGSYP TGSTAPVLEP IFIFPAHLAC VKAVAASPGG KWLATGSEDE FVKVWDLRRR KEVGSLSQHT GSITSLHFPT PSHLLTTSVD STLSLFRTSD WSLLKSLKGH SGRVNHVDVH PTGRVALSVG KDQTLKMWDL MRGRGAASLP LGSEAELVKF SQQGTHFAVL FPRKIQIYSL TLKLLYTLET KSRFNHLLFH LLPASTEDGE ETEVLCIGTE KGVVEIYRIT LGEGEESEDE EDDDEEIEKE TKGKGAELER IATLVGHTNR VKAISSLPFL APTSTGDVRK TILLTTVSSD GLINVYDLCA ATTDIEKGEE NKVEPVASYD TKGTRLTCVY LADGQDGTKE IKVKEEEISE SEDEVEDIYG SEQASDDDED DDGMEVEFED EEEEIEGEED MSEVTETPVP PLSNVTEPPR SPNPLNPATD AQGENIQPLA EREPQNDVPG VTAAPDQPPL PVDAPLPPSP AASDEPPPSL PSSAIPTAVP TPTVITVGED KPEVPAGAPL STVTQGSASM DLDEATPQPV KRAGEELDGE REEKRLKEDG PAVEAAAEAG EATGEAIAVA VQSDVPLVGP DGQPLPPPAW LSYVPPAPKF AGPNTPLTLT QHKYMLNAVR SLKKRLPDAY NFLVPVDTVR FNIPHYHTVI DTPMDLGTVE TKLIVSDPRG PPKDKSKMSK WDTSKGKYNN VAEVTEDVRR IWENSRKFNG KEHPVSQMAT RLEEAFERSL SNLPAEPVIA SPASAGPSHV RRSSISQPPV VRRSSDDTRP KREIHPPPPK DLAYEESPGS ARKPKRRNDP QLQWASRAIK SLEISNKYYV AVSPFLYPVD KIIEEVPDYA TVIKRPIDFN IIKNKLAENT YEDVNQVDDD IRLMVANAQK FNPPGHEVHT SATQLLQIWE EKWRTVPAKV ETRDSSEDPM AEAFDDYSSD EDNAQLRSLE SQVIALNQQI SALRSKMTKR RAARGSKSKS KPKTAPRKSS VSKPSPNING NSQPKKSKKA PKEANLMYRE DDDESEEEED ISHLSHAQKQ ELAEKIGQTD GETLSKVISI IQQSTNIGGS NQEIELDIDS LPPATVIRLY NLVCRGGRGS GSKRGRGGVV KKAGGTGRKA MGGVSRRSVN EREEAERIRR MEQQLQAFES SRPVPQAVGY EEEESSSEEE SSEEE
|
| |