Gene CND05520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND05520 
Symbol 
ID3257096 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp1493971 
End bp1497223 
Gene Length3253 bp 
Protein Length925 aa 
Translation table 
GC content53% 
IMG OID638256490 
Producttranscription factor, putative 
Protein accessionXP_570545 
Protein GI58266778 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.21086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTTGCCAG CTTCCATCTT GCCAGCTCGC CAAAGACGCG TAAGCTTCCC CGCAAGACAC 
CGCCATGGAG CCGCCCAGGT AAGCCGTCTT CCGTCGCCGC CCTCCCGCAT CTCATCCCAT
TGTGCAATAG CAACCCCATC CAGCCCCCCG TCACCCCGTC CCACCACTCT CTCCTTTCAG
CCATCTCCCC GGCTCTGTCA GAACAGACAC CAGCTCCTAT CCACACCCTC CCTCCCCATC
TTCGGCCCTC CATTCCGCAA CCCCATATCG CTCCTCCACG CCCCAGCTCT GTACAGCCAA
CAATGGAGGA GCAACAGAGA ATGCATCACA TCCAACAGCA TCAGCAGCAA CAACATTTCC
AACAGCAACA GAATGATGAG AATGTTTTTG GCTCAGTGAT GGGGGCACCA GGCCATGTTC
CGGGACATGA GGCTCCGATG AGTACCCAGC CTAAAGTCTA TGCAAGTGTC TATTCTGGAG
TACGTCTTTC CATCCAACCT CTAAACAAAA GAAAGTTACT CATGGTGACC GCAGGTTCCC
GTATTTGAAG CCATGATTCG AGGTATCTCT GTGATGAGGC GTGCTTCCGA CTCGCAAGTC
TTCCTCTTCC ACACATCGCG TCATTGGTCC CGTCTAACGT GATGTCTTGC AGATGGGTCA
ACGCGACACA AATTCTCAAA GTTGCCGGCG TGCACAAGTC CGCTCGAACC AAGATACTGG
AAAAGGAAGT GCTCAACGGC ATTCATGAAA AAATTCAAGG CGGATGTGCG TGCAATTGTA
TCCGCTCTCG TTAATCTACG CTAAAACGTT CACTGTAGAC GGAAAATACC AAGGAACCTG
GGTTCCACTT GACCGTGGGC GGGATCTTGC AGAACAATAC GGTGTCGGAA GCTACCTGTC
TTCTGTCTTT GACTTTGTTC CTTCCGCGTC CGTCATTGCT GCCCTCCCCG TGATTCGCAC
AGGTACTCCT GACCGTTCTG GACAACAAAC TCCTTCCGGA TTGCCAGGTC ACCCTAATCA
GCGAGTCATC TCTCCCTTTG CTAATCACGG CCAAACGACT CCCCATATGC CTCCTCCTCA
ATTCATACAT CAAGGTAACG AGCAAATGAT GAACCTTCCT CCCCACCCCT CCTCCTTGGC
TTACCCTACA CAGCCTAAAC CTTACTTCTC CATGCCTCTT CAGCATACTG TCGGTCCACA
GTATGATGAA AGACATGAAG GTATGACCAT GACACCTACC ATGAGCATGG ACGGCTTGGC
CCCTCCGGCT GATATTGCCC GCATGGGTTT CCCATACAAC CCATCCGACA TTTATATTGA
CCAATACGGC CAGCCACATG CCACCTACCA AGCTTCGCCT TATGGGAAGG AAAGTGGCCA
TCCATCTAAG CGTCAGAGAT CAGATGCCGA GGGCAGCTAT ATCGAGAGCG GTGCCGCTGT
CCAACAACAT GTTGAACAAG ATGAAGAAGC CGACGATGGT TTGGACAATG ACTCTACCGC
GTCGGACGAC GCCCGCGACC CTCCCCCGCT CCCAAGTTCG ATGCTTCTTC CCCATAAACC
GATCCGACCC AAGGCTACTC CAGCCAACGG CCGTATCAAG AGCAGGCTCG TCCAGATATT
TAACGTGGAA GGTCAAGTTA ATCTCCGAAG CGTCTTTGGA TTGGCACCAG ATCAGCTACC
CAATTTTGAC ATTGACATGG TAATCGACGA CCAAGGTCAC TCTGCCTTGC ATTGGGCTTG
TGCCCTCGCC AGACTGTCCA TCGTGCAACA GCTCATCGAA CTTGGTGCCG ATATCCATCG
AGGCAACTAC GCCGGAGAGA CCCCCCTTAT TCGCGCTGTC CTTACTTCCA ACCACGCCGA
AGCTGGCTCC TTTACTGATC TTTTGCACCT CCTTTCCCCG TCGATTCGCA CGCTTGACCA
TGCCTACCGC ACGGTTCTGC ACCATATTGC GCTGGTCGCT GGTGTCAAGG GCCGAGTACC
TGCTGCGAGG ACTTATATGG CCAGTGTTCT CGAGTGGGTC GCCAGGGAAC AACAGGCCAA
TAACACGCAT AGTATCACAA ACCCTCCCAA CCCTGCTGAT CGCAATGAGC TGGCACCGAT
CAACCTTCGT ACTCTTGTGG ACGTTCAAGA CGTACATGGT GATACTGCTT TGAATGTCGC
CGCACGAGTG GGTAACAAGG GACTGGTGGG TTTGCTATTG GATGCTGGTG CGGACAAGAC
ACGGGCCAAC AAACTGGGAC TCAGGCCGGA AAACTTTGGC TTGGAGATTG AGGCTCTCAA
GATCTCGAAT GGCGAGGCTG TCATGGCAAA CCTCAAATCA GAAGTGTCCA AGCCCGAGAG
GAAGAGCCGC GACGTGCAGA AAAGTGAGCA TTATTCATTT ACTCAGTCTT ATGAGACGTA
CTAATCATGT CCCGCACAGA CATTGCGACC ATCTTTGAAT CCATATCCTC CACCTTTTCG
AGTGAAATGC TCGCCAAACA AACGAAATTG AATGCCACCG AAGCTTCTGT CCGCCATGCC
ACTCGCGCGC TTGCGGACAA ACGGCAACAC CTTCACCGCG CTCAAGAGAA ACTCGCTACG
ATGCAACTGT TTGAGCAACG TTCTGAAAAC GTGCGGCGTA TCATGGACGC CATCGCCGCC
GGCACGCTGT TGACGCCTGC AGAGTTTACT GGCCGAACGC AGACGATGCA CGAAAAATCC
ACGGGCCAAC TGCCTCCTCT TGCATTCCGG CATGTTCCAG GCTTGGCACT CGACGCGTCC
TCGCAATCCC AGCTGAACGG CGCGCCCCCA TCCACACCGC TTTCCGTCGA GGACCAAGAG
GACATTGCTT TGCCTGAGCG AGACGATCCA GAATGTCTGG TAAAACTCAG ACGTATGGCT
CTGTGGGAAG ATCGGATTGC AGAAGTGTTG GAAGACAAGA TTAGGGCAAT GGAGGGGGAA
GGTGTGGATA GGGCGGTCAA GTATCGCAAG TTGGTTAGTG TGTGCGCCAA GGTTCCTGTG
GATAAAGTAG ACTCTGTAAG TTTCTGTTTC CTTCGCCGCT GTATATGTGA GATGGCTAAA
ACGGATGGGA ACAGATGTTG GACGGGCTAG TCGCTGCTGT GGAGAGTGAA GGGCAAGGGC
TGGATTTCTC TAGAGCCAGC AATTTTGTGA ACCGGATAAA AGCGACGAAA TCATAAGACT
TGTTGTCAAG AACGACTACA TGTTTTGTTT TTGTTTTTTC TTGTCGTTTT GAATTTCTTT
GATCTTCTAA TGT
 
Protein sequence
MEPPSNPIQP PVTPSHHSLL SAISPALSEQ TPAPIHTLPP HLRPSIPQPH IAPPRPSSVQ 
PTMEEQQRMH HIQQHQQQQH FQQQQNDENV FGSVMGAPGH VPGHEAPMST QPKVYASVYS
GVPVFEAMIR GISVMRRASD SWVNATQILK VAGVHKSART KILEKEVLNG IHEKIQGGYG
KYQGTWVPLD RGRDLAEQYG VGSYLSSVFD FVPSASVIAA LPVIRTGTPD RSGQQTPSGL
PGHPNQRVIS PFANHGQTTP HMPPPQFIHQ GNEQMMNLPP HPSSLAYPTQ PKPYFSMPLQ
HTVGPQYDER HEGMTMTPTM SMDGLAPPAD IARMGFPYNP SDIYIDQYGQ PHATYQASPY
GKESGHPSKR QRSDAEGSYI ESGAAVQQHV EQDEEADDGL DNDSTASDDA RDPPPLPSSM
LLPHKPIRPK ATPANGRIKS RLVQIFNVEG QVNLRSVFGL APDQLPNFDI DMVIDDQGHS
ALHWACALAR LSIVQQLIEL GADIHRGNYA GETPLIRAVL TSNHAEAGSF TDLLHLLSPS
IRTLDHAYRT VLHHIALVAG VKGRVPAART YMASVLEWVA REQQANNTHS ITNPPNPADR
NELAPINLRT LVDVQDVHGD TALNVAARVG NKGLVGLLLD AGADKTRANK LGLRPENFGL
EIEALKISNG EAVMANLKSE VSKPERKSRD VQKNIATIFE SISSTFSSEM LAKQTKLNAT
EASVRHATRA LADKRQHLHR AQEKLATMQL FEQRSENVRR IMDAIAAGTL LTPAEFTGRT
QTMHEKSTGQ LPPLAFRHVP GLALDASSQS QLNGAPPSTP LSVEDQEDIA LPERDDPECL
VKLRRMALWE DRIAEVLEDK IRAMEGEGVD RAVKYRKLVS VCAKVPVDKV DSMLDGLVAA
VESEGQGLDF SRASNFVNRI KATKS