Gene CNK02380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02380 
Symbol 
ID3254670 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp686793 
End bp691592 
Gene Length4800 bp 
Protein Length1275 aa 
Translation table 
GC content50% 
IMG OID638253730 
Productnucleus protein, putative 
Protein accessionXP_567711 
Protein GI58260602 
COG category[B] Chromatin structure and dynamics
[K] Transcription
[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat
[COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TAATCACAAA GACTATAGAC AGCAACTTAC ATCAGTTAGA CACAATGGGT AAACACGGTA 
AAACAAAGAC TACCAGCAAG GGCGCCCAGA GACCTGCTCC TTATCAAAAG AAGGCCAAGG
TTGATGTGAC CTTTGACACT GGCGTGAAAC CTTCCAAGGC TGAGAATAAG CCCAAAGCTA
CGAACGGAAA AGCGAAGAAG GACAAGGAGA CCTCCAAACC TTCCAAGGCC AAGCAGCCTA
AGGAAAAGAA GGAGGAGACC GCCAAGGTCT CGTCACCAAA GCCTACCAAG TCCGACAAGG
GCAAGGGCAA GTCCATTCTC CCTCCAACTG TTGGACCTTC AACATTCATT GTTATTGCGG
GTTCCTACGA GAAACTCTTA TATGGATTGG AAGGTTCCTA CCCAACTGGA TCTACCGCTC
CCGTCCTCGA GCCTATCTTC ATCTTTCCAG CTCATTTGGC ATGTGTCAAA GCCGTTGCGG
CGAGTCCTGG CGGTAAATGG CTGGCTACTG GAAGTGAAGA CGAGTTTGTG AAAGTTTGGG
ACTTGAGAAG AAGGAAGGAG GTTGGTAGTC TGAGTCAGCA CACAGGTAAG TTCCTCATAT
CCAGATTCAA TATACGATTA GAAGACAGCT GATATGTTTG TAGGATCAAT AACATCGCTT
CATTTCCCTA CCCCTTCTCA CCTTTTGACC ACATCTGTTG ATTCTACTCT ATCGCTCTTC
CGTACTTCTG ATTGGTCGCT TCTCAAGTCA CTCAAGGGTC ACTCCGGAAG GGTCAATCAT
GTTGATGTGC ATCCAACGGG AAGGGTTGCT CTAAGTGTTG GAAAGGACCA AACTTTGAAG
ATGTGGGATT TGATGAGAGG TCGAGGAGCG GCGAGTCTGC CTCTGGGAAG TGGTAAGTCC
AGATGCCTTG AAGGAAATAG AGATATTAAT GGAATGTGTT TAGAGGCGGA GCTCGTCAAG
TTCTCTCAAC AAGGCACCCA TTTTGCTGTT CTCTTCCCCA GAAAGATCCA AATCTACTCG
CTCGTAAGTC ACTGACTATC CTTTGGCGCC CCTTGTCCTG TAACTTACAC CACGTAGACC
TTGAAACTCC TGTACACCCT CGAGACAAAG TCTCGTTTCA ACCACCTTCT TTTCCATCTT
CTCCCTGCCT CTACGGAGGA TGGCGAAGAG ACTGAGGTGC TTTGTATCGG TACTGAGAAA
GGTGTCGTCG AGATCTATCG AATAACATTG GGTGAAGGGG AGGAGTCTGA AGACGAAGAG
GACGATGACG AGGAGATTGA AAAGGAAACC AAGGGCAAGG GGGCCGAATT GGAGAGAATC
GCCACGCTTG TTGGACACAC GAACCGGTAC GCTTACGTTT TGTATTGTTG CAAGGTTTTG
CTAAAAAGTC CTCTAGAGTG AAGGCCATTT CATCACTTCC TTTCCTCGCA CCTACCTCTA
CAGGGGATGT GCGCAAGACA ATCCTCCTCA CAACTGTCTC GTCTGACGGT CTCATCAACG
TCTACGACCT CTGTGCGGCT ACCACCGATA TCGAAAAGGG AGAAGAGAAC AAGGTTGAAC
CCGTTGCATC TTACGATACC AAGGGTACTC GATTGACCTG TGTTTACCTT GCTGATGGAC
AAGATGGAAC CAAGGAGATC AAGGTCAAAG AGGAAGAGAT CAGTGAGAGC GAGGACGAGG
TTGAGGACAT CTATGGAAGC GAACAAGCGA GTGACGATGA TGAAGACGAC GACGGGATGG
AGGTCGAATT CGAGGATGAA GAAGAGGAAA TTGAGGGTGA AGAAGAGTAG GCAATTTGCA
TGCGTACCAG ACAGACCGCG CTACATAGCA GTGCATATAC ACAACTACAT ACATATAGAG
AGAGAGTCGG GAAGGAAAAG CAGTAAGCAA CGGGGTAGTC ATCTGCCACA TGTATGAATA
AGTCGGCGTT TGCGGCGATA TGGAGTAGCG TGGTCATGGT CCTTTGAGTG GCGTGTCCCC
TTGTGCCTTT TTTCGGGTGG GTTATAAATA CCAAACACGC TCAAACGCTC ATCATTTTTC
GCCCCCCTCC ACTTTGTTTT TCAAAAATTC ACAACCAGCG GACGTCGCCG CTTCTTCTTG
CCGTCGCGTC CCGTCCACGT TCTCCACTGC GTCACTCCAC TCGTTTAAGC TGTGAAGGTG
AGCGAGTGAG GATGACCATG CATTGCTGAT ATATTGCTGA CACTGTTGCC CGTGCTGCTG
ACAGTATGTC TGAGGTTACT GAGACGCCCG TTCCTCCTCT TTCCAACGTT ACCGAGCCGC
CCAGGAGTCC TAACCCGCTC AACCCCGCTA CAGACGCGCA GGGCGAGAAC ATCCAACCGC
TAGCCGAGCG TGAGCCGCAG AATGACGTGC CAGGAGTGAC CGCCGCTCCC GATCAGCCAC
CTTTGCCAGT GGATGCACCT CTCCCTCCCT CTCCTGCCGC GAGCGACGAA CCTCCCCCTT
CCTTACCTTC CAGCGCTATT CCCACAGCTG TGCCTACCCC TACCGTGATT ACAGTAGGCG
AGGATAAGCC CGAGGTTCCC GCCGGAGCCC CGTTGTCTAC CGTGACGCAA GGATCAGCGT
CAATGGATCT TGATGAGGCC ACACCTCAAC CAGTCAAACG CGCCGGGGAG GAGCTCGATG
GTGAAAGAGA GGAGAAGAGA TTAAAGGAGG ATGGGCCTGC CGTGGAGGCT GCTGCAGAGG
CTGGAGAGGC TACTGGGGAG GCTATCGCTG TAGCAGTGCA GTCTGACGTT CCTCTTGTTG
GACCGGATGG TCAACCATTG CCTCCTCCGG CCTGGCTTTC CTATGTTCCT CCTGCCCCCA
AGTTCGCCGG TCCCAACACA CCGCTTACGC TCACTCAACA CAAGTACATG CTCAATGCTG
TGCGCTCACT GAAGAAGCGC CTTCCTGATG CGTACAACTT TCTTGTTCCT GTCGACACTG
TTCGATTTAA CATTCCTCAT TACCACACCG TTATCGATAC ACCTATGGAT TTGGGTACGG
TTGAGACCAA GTTGATTGTC AGTGATCCCA GGGGTCCTCC CAAAGACAAG AGTAAGATGA
GCAAGTGGGA TACTAGCAAA GGCAAGTACA ACAATGTTGC CGAGGTGACT GAGGATGTGA
GGAGGATCTG GGAGAATTCA CGCAAGTTTA ATGGGAAAGA GCATCCTGTC AGTCAGATGG
CTACTCGATT GGAGGAGGCT TTTGAGAGGT CTTTAAGCAA CCTGCCAGCT GAAGTAAGTT
GCAAAAAGCC TTGTCTTCCA AGTGAAAATA CATGTTAAAT TTCAATCCAG CCTGTTATCG
CATCTCCTGC CTCTGCTGGT CCCTCTCATG TCCGCCGCTC TTCCATTAGT CAACCTCCTG
TCGTCCGACG AAGTTCCGAT GACACTCGTC CTAAGCGCGA GATTCATCCT CCTCCCCCCA
AAGATCTTGC TTACGAAGAG TCTCCTGGCT CTGCGCGGAA ACCTAAACGT AGGAACGACC
CCCAACTCCA GTGGGCTTCA AGGGCCATCA AGAGTTTGGA GATTTCCAAC AAGTACTATG
TTGCCGTTTC ACCTTTTCTT TACCCCGTCG ACAAAATCAT TGAGGAAGTT CCGGATTATG
CTACTGTCAT TAAACGTCCC ATTGACTTCA ACATTATCAA GAACAAACTC GCCGAGAATA
CATACGAGGA TGTGAATCAG GTGGATGACG ACATTCGTTT GATGGTTGCC AATGCGCAAA
AATTCAACCC GCCTGGCCAC GAAGTGCATA CATCTGCTAC GCAATTACTG CAAATCTGGG
AAGAAAAGTG GAGGACGGTG CCGGCGAAAG TGGAAACTAG AGATTCTAGC GAAGACCCAA
TGGCCGAAGC CTTTGACGAT TACTCGAGCG ATGAAGATAG TAAGCGCTTT CAGATTGATT
GCTGATACGG TGGCTGACTA CTGGGTAGAC GCACAACTGA GATCACTGGA GAGCCAAGTC
ATTGCCTTGA ACCAGCAAAT CTCCGCTCTC CGTTCCAAGA TGACCAAACG TCGTGCTGCT
CGAGGCTCTA AATCCAAGTC AAAACCCAAG ACCGCTCCTC GGAAATCTTC CGTCTCCAAA
CCTTCCCCCA ACATCAACGG CAATAGTCAA CCGAAGAAGT CCAAGAAGGC TCCCAAGGAG
GCCAATCTTA TGTACAGAGA AGATGACGAT GAAAGCGAAG AAGAAGAGGA CATCAGTCAT
CTTTCTCATG CGCAGAAGCA AGAGCTGGCG GAGAAGATTG GGCAGACTGA TGGGGAGACT
TTGAGCAAGG TGATTTCTAT AATTCAGCAG TCCACCAACA TTGGAGGGGT AAGTCAATCC
ATACAGTATC TTTATATTAT CTCAATACTG ACGTCCCCGC AGAGCAACCA AGAGATTGAG
CTGGATATCG ATTCTCTTCC ACCGGCCACT GTTATCAGGT TGTATAACCT TGTTTGTCGA
GGAGGGCGAG GATCAGGTAG CAAGCGCGGA CGTGGAGGTG TCGTGAAGAA GGCCGGTGGT
ACTGGCCGAA AGGCCATGGG TGGTGTGTCT CGACGCAGTG TGAACGAACG GGAAGAAGCG
GAGCGTATTA GGCGCATGGA GCAGCAATTG CAGGCGTTTG AATCTTCTCG CCCTGTACCG
CAGGCGGTTG GCTACGAAGA AGAAGAGTCT AGCTCGGAAG AGGAGAGCAG CGAGGAAGAG
TAAAAGAATA CATGTTACAG CCGGTTGTTT TTTGTTGTTA GAAGTCACGA GATATATAGT
ATATTTGGGA ATGCCGCGTG CGCGATAGGC GATAATAATC GCCAGTTGGG CGGTTGATGT
 
Protein sequence
MGKHGKTKTT SKGAQRPAPY QKKAKVDVTF DTGVKPSKAE NKPKATNGKA KKDKETSKPS 
KAKQPKEKKE ETAKVSSPKP TKSDKGKGKS ILPPTVGPST FIVIAGSYEK LLYGLEGSYP
TGSTAPVLEP IFIFPAHLAC VKAVAASPGG KWLATGSEDE FVKVWDLRRR KEVGSLSQHT
GSITSLHFPT PSHLLTTSVD STLSLFRTSD WSLLKSLKGH SGRVNHVDVH PTGRVALSVG
KDQTLKMWDL MRGRGAASLP LGSEAELVKF SQQGTHFAVL FPRKIQIYSL TLKLLYTLET
KSRFNHLLFH LLPASTEDGE ETEVLCIGTE KGVVEIYRIT LGEGEESEDE EDDDEEIEKE
TKGKGAELER IATLVGHTNR VKAISSLPFL APTSTGDVRK TILLTTVSSD GLINVYDLCA
ATTDIEKGEE NKVEPVASYD TKGTRLTCVY LADGQDGTKE IKVKEEEISE SEDEVEDIYG
SEQASDDDED DDGMEVEFED EEEEIEGEED MSEVTETPVP PLSNVTEPPR SPNPLNPATD
AQGENIQPLA EREPQNDVPG VTAAPDQPPL PVDAPLPPSP AASDEPPPSL PSSAIPTAVP
TPTVITVGED KPEVPAGAPL STVTQGSASM DLDEATPQPV KRAGEELDGE REEKRLKEDG
PAVEAAAEAG EATGEAIAVA VQSDVPLVGP DGQPLPPPAW LSYVPPAPKF AGPNTPLTLT
QHKYMLNAVR SLKKRLPDAY NFLVPVDTVR FNIPHYHTVI DTPMDLGTVE TKLIVSDPRG
PPKDKSKMSK WDTSKGKYNN VAEVTEDVRR IWENSRKFNG KEHPVSQMAT RLEEAFERSL
SNLPAEPVIA SPASAGPSHV RRSSISQPPV VRRSSDDTRP KREIHPPPPK DLAYEESPGS
ARKPKRRNDP QLQWASRAIK SLEISNKYYV AVSPFLYPVD KIIEEVPDYA TVIKRPIDFN
IIKNKLAENT YEDVNQVDDD IRLMVANAQK FNPPGHEVHT SATQLLQIWE EKWRTVPAKV
ETRDSSEDPM AEAFDDYSSD EDNAQLRSLE SQVIALNQQI SALRSKMTKR RAARGSKSKS
KPKTAPRKSS VSKPSPNING NSQPKKSKKA PKEANLMYRE DDDESEEEED ISHLSHAQKQ
ELAEKIGQTD GETLSKVISI IQQSTNIGGS NQEIELDIDS LPPATVIRLY NLVCRGGRGS
GSKRGRGGVV KKAGGTGRKA MGGVSRRSVN EREEAERIRR MEQQLQAFES SRPVPQAVGY
EEEESSSEEE SSEEE