Gene CNI02950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI02950 
Symbol 
ID3259556 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp800098 
End bp803626 
Gene Length3529 bp 
Protein Length881 aa 
Translation table 
GC content48% 
IMG OID638258786 
Producttranscription corepressor, putative 
Protein accessionXP_572650 
Protein GI58270988 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.414493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTACAGCGA AGTTCGTGAA TACCGTGTAT TTCACTTCTT CCTCTGTAAC TTTCCAAGGA 
AGAGGATAAC TGTGAGAAGG GTATTATGAA GGTTACCAAG CCCAACTGGG TGGAACATAC
CGGTAAGCCG TTGTTTTCAA TGGAGAGTGC TCGCTAATCT TTTTTGGTTG CAGTGGGGGA
GAAGAAAGCC AAAACAGCCA TCTACAGCAT TTCTGTACAT CCAGATGGTA CCCGATTGGC
AACAGGTGGC CTAGGTGCGT GGATTTATTC AATAGTTTGT CTATTCTAAT GTGAGGCCTG
GTTTCAATTG GTAGACCATA AAGTTAAGAT ATGGTCGACG CTGCCTATTC TCGACGTGGA
AGCAGAGAAG GAAGAAGAAA ACCCTAAGCT GTTATGCACG ATGTCTTCCC ATACCGGTTA
ATATCGTTTT TAAACTAGTA CGACAAGGCA ACTTTGCTGA TGGAGACGCC AGGATCTGTG
CTGTCCGTGA GGTGGGCTCA CCACGGAAGA TTCCTAGCGA CAGGATCAGA TGACCAGGTT
ATTATGATAT GGGGTCTTGA TCCGTACGTA GAGCCTTTCT TTCACCGATA GAAAGGAAAA
AAAAAAAAAC GCTGATCAAT GCTTCGGGGC CAGTGATGGA GGTGGTCGAC TTTGGGGTTC
TGACGAAATT AATGTCGAAA ATTGGAAGGC TTTAACGCGA TTGGTTGGGC ATGTAGCAGG
TATGAGACTG TCACATCCTC CTTCTCATTT GGCGTTTGAC ATTAATGTCC TATTTCAGAT
GTTGTGGATC TAGCTTGGTC AAGGGACGAC ACCATGTTGG CATCCGTCGG CCTTGACAGT
ACTGTCTGGA TTTGGGATGG CTTGACATTT GGTAGGACTC TTCTACACTT TTTATTTTGA
TCAAGCTGAA GGATCATTTC TTACAGAACG ACTGAGGAAA CTTGATTTAC ATCAGGGATT
TGTTAAAGGA GTTTGTTGGG ATCCTGTTGG GAATTACCTT GCTACCCAGG CGAGTCTAAA
TTGCTGCGCA GTTTTATGAC AATTCTAACA CGTTCATCAC TCCTAGTCCG ACGACAAGAC
GGTGAAGATC TGGAACACTG AGGACTGGTC ATTAGCTGAA ACCATTTCCA AACCGTTCGA
GACATCACCG CAAAGCACAT TCTTCCGAAG ATTGAGCTGG TCGCCTGACG GTGCTTTCAT
CGCAGCGTCC AATGCTATGA ATGGACCAGT GTTTGTTGCT GCGGTGATTG ATCGAGAGGG
TTGGGCGTCG GATATTTCCT TTGTTGGACA CGAAAATACA ATCCAAGTGG CTGTGAGTAT
CCATAACTCC GTAGTGATCT GTGTTGTTCA CTCGATTGTC AGGCTTTCAA CCCTCGCCTC
TTCTTTCCTG AGGGTGAACC TAAAGGAAGA GCGACGGCTT CCAGCATGCT TGCGCTTGGT
GCAAACGATT TTAGCATTTC CATCTGGCGA AATACACTTT ACAAGCCTTT GGTGGTGTTG
AAAGATATTT TCGGAGCCGA CTTGATGGAC CTCTGCTGGT AAGTCTGAGC ACTGCTTGCA
ATTGAAATGT GACCTAAATT TTATTCCTGT AGGTCAAACG ACGGATATGT CTTGTACGGT
TCCTCCGTTG ATGGCTCAGT GTGTGCTATC CAGTTTGAAC CCTCTGAGTT CACCGACCTT
GCCGACTTTT CTGCGACCGA ACTCGTCCTT CGAGAATACG ATTACAAACC CAAACGGGCT
CACCAACCTC TTGCGGTTCA TTCCTCCGCT GCCTCTATCA CCAACGGCTT TGGCCCCTCC
ACCACTACCT CCACTCATGT CAACGTCTTA CAACCTAAAA AGGGCAAAGC CAAACGCCGT
GTGGATCTCT CTAATGGTAA CATTAACGCT GGCCCTAGTG CAGGTCCAAG CCGCCAAGCT
CTTCGACCCC CACCACCAGT TGATCCATTC AGTGGACCTA TACAAGGTTT TGCCAGTCCC
TCGACGGCCC AAGCGTCAAC AGCGAGGATG TTTGAAGATG CGCACCGAGC TTTTGGACCT
GGTAGTGGAA GTATATCAGG CACCTCACCT AGAGCTGGGG ACAAGAGAAA GGCCAGCGGG
TCATATGAGG ATCCTACGAG AGGTGTACGA GGGAGGGGCA TGCCCGTACA GCAACCAATT
CAGCAGTTTG AAGTCCAAAT TATCCGAGCA CCTATGGTGG CACCTTCTCC CTCGGATGCT
GGTCCATCAA AAGCCTATCT CCCGTACCCT CAAGTACAAT CAATTTTGCG GGCCCAAGCT
ATTGGGAATG AAAGTCGTAG TATCTATCTC GAAGCCCGAA ATACTTCAGA TCCGAAAGGG
GAAAATGTGC TTTGCTACTT TGCAGACGGT GAACAGAGGT GGATGGATTA TTTGCCGAAA
GCGGCTTTGG CGGTTACTGT CACGAAGAAT TTTTGTGCGG CGGCTTGCGA GGATGGTAGT
CTGAGGGTGT ATTCCCCTGC AGGACGATTG TGAGTCACAA CCTGTGTCTG ATGTGGATCA
CTCAGTTGAT CACGGAACAG GATACTAAAC ATGAAATTAT CGGGCTTGGT GTATGACTTA
CAAGGGGAAG ACAAGATGCT GTTGATTATA ACAATGGATT GTCAAGTGCG CGTCATGTAA
GCTTTTCTAA GATTCCTTTT GACACATGTA GAAGCTGATA GCCTTGCTAG AAACGTCCGC
AACGGCAAAG CATTTTCCCC ACCGTCGAGC ATCCACCATC TTCTTTTCCC GGGATCATCT
TCATTCCATT CTTTTGACAT CATTTCATGC ACTGTCCGAC CTAATGGTGT TCCTGTGATC
ATCACTTCCG AACCTCAAGC CTTTGCTTAC GACGCATCCT TGCATGAATG GAGCACCATT
GCCTCCCCCC CTATTGCTGG CATCCAGCCT TTGCCGAGTG GCCCTTCAGG TCCTCTCTCT
GTTGTTGATC AGATTGTCGC CAAGTCGGCA CCAGTGACGA CGACTGAAAA GAGTAATGCA
CCTTGGATAG AAGAATCATA TGTCATGTCA CAGTTTGAAA TGAAACTTCG AGGGACGGTG
CTGTTGGATT CAAAGGAGGA ACACAAACAC TGGTTACTGG GGTATATGAA ATATTTGGGA
GATGAGAACT TTGCGGAAAG GGCTGGAGAG GTGATGAAGG ACCTCATTGG CCCTGTATAC
CAGTAAATGA CCCCCTCAGC ACTATCTATG CGGGCTGACG GCTGCATTTT AGTCAATCGA
AACCCACCGG ATGGGAACCC AAACTCTTGG GTGTTGATAA GCGTGAAATA GCGGCGGAAG
TGTTGGATGT GCTCTCGAAG ACATTGCAGG GCAAAAATGT AGCATCGGTG TGGTACGATG
TGCTAGATAA GATGAAGGCA GACGAGGGAT CCTGGTAGTT GTAGGTCGCT TCCCCCAAAG
ACTTATCGGC GTGCTGACCA TTTGTTAGGT AAAACAATAT ATGGTATACT CAATGAACAC
GTTTACATGC ATCTTTCTCC TGGCCTTTGT CTTCTTGAAG CTTCTCAAT
 
Protein sequence
MKVTKPNWVE HTVGEKKAKT AIYSISVHPD GTRLATGGLD HKVKIWSTLP ILDVEAEKEE 
ENPKLLCTMS SHTGSVLSVR WAHHGRFLAT GSDDQVIMIW GLDPDGGGRL WGSDEINVEN
WKALTRLVGH VADVVDLAWS RDDTMLASVG LDSTVWIWDG LTFERLRKLD LHQGFVKGVC
WDPVGNYLAT QSDDKTVKIW NTEDWSLAET ISKPFETSPQ STFFRRLSWS PDGAFIAASN
AMNGPVFVAA VIDREGWASD ISFVGHENTI QVAAFNPRLF FPEGEPKGRA TASSMLALGA
NDFSISIWRN TLYKPLVVLK DIFGADLMDL CWSNDGYVLY GSSVDGSVCA IQFEPSEFTD
LADFSATELV LREYDYKPKR AHQPLAVHSS AASITNGFGP STTTSTHVNV LQPKKGKAKR
RVDLSNGNIN AGPSAGPSRQ ALRPPPPVDP FSGPIQGFAS PSTAQASTAR MFEDAHRAFG
PGSGSISGTS PRAGDKRKAS GSYEDPTRGV RGRGMPVQQP IQQFEVQIIR APMVAPSPSD
AGPSKAYLPY PQVQSILRAQ AIGNESRSIY LEARNTSDPK GENVLCYFAD GEQRWMDYLP
KAALAVTVTK NFCAAACEDG SLRVYSPAGR LILNMKLSGL VYDLQGEDKM LLIITMDCQV
RVINVRNGKA FSPPSSIHHL LFPGSSSFHS FDIISCTVRP NGVPVIITSE PQAFAYDASL
HEWSTIASPP IAGIQPLPSG PSGPLSVVDQ IVAKSAPVTT TEKSNAPWIE ESYVMSQFEM
KLRGTVLLDS KEEHKHWLLG YMKYLGDENF AERAGEVMKD LIGPVYHQSK PTGWEPKLLG
VDKREIAAEV LDVLSKTLQG KNVASVWYDV LDKMKADEGS W