Gene CNI01500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI01500 
Symbol 
ID3259419 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp432699 
End bp436224 
Gene Length3526 bp 
Protein Length950 aa 
Translation table 
GC content51% 
IMG OID638258633 
Productspecific RNA polymerase II transcription factor, putative 
Protein accessionXP_572987 
Protein GI58271662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGCGACCCA AATGAATGAT CTATCAGCAG CCTCGAGGGA AAAGATCAGG AGAGGATACC 
GCGCTTGGTA AGCACCCACT CACAGAGCCA GTCCCAGCGG TCCGCCCACC GCCATCCTCC
TCTCGGTACT GACACCAGCA CGCACCTCAC AGCCTCCACT GCAGGTCTCG CAAAGCAAAG
TGCGATCTCG CAAGTATTCC GGGTCTCGCT ATCTACGCCA TACGCATGCT GATCCACAAT
CTTCATAGGG AGACATTGAC GCGCCCTCTT CTCCGCCATG CAGCCGATGC AAGCGCGAAA
GCCGAGAATG TGTGTTTGCT CCCTCTCGCC GGGGAGGAAA CAACAAGAAG AGGGCACGTA
CGGATTCAGA AGACATTTCC AGAGAGGACG AAGATCCGCC GCGCCCACTG GGTCGTTCCA
CAGCCGAAGT TACGTCGGGC GAAGAACATT CCTCCCAGAC TTTTCCGTAC CCTCAACCGC
CTCCACAGCC CCGACATCCA TCTGTGCATA ATCTTCTAGG TCATTCTCCG CCACCCGCTC
GATACCGTCA GCATTTCGGC CATACATCTT CCAGTTCACA ATCGTCTCCT CAAACTAGCT
TTTCCGCCGA CCATGCAACT CCACAGACCC GACCCACACT GTCTAATGTT CAGTCTCACA
CAGCCACAGG ATACCCAGAC TCTCCTCCTT CACCCAGACG AAGACGCACG GCCAATCCAC
CGTTGCACGC TGCCGATCCA AGTAGTATTG TTGTAGCAGA CATGCGTAAT GAGAGTGATG
CGTTGCAGAT CCTTGCGCTT GCTAGTGGTC AGGCAGCAAA TAGGGATGGT GAAGAAGAGA
GATCTGATAG GCATGACGGC CATAGCGTGC CAGGCACTGT CGATTCTACA GGTATGGGAG
GTCATCAGCA GCAAATGAGG GGAGCGCCGT CGTCTCCGGA AAAAGAAATG CAGCTGGCAA
AGCTGGCGCA GTTCCCTCTG GTTAAATTGG GAATCCTCAG CGTTGAGCAG ACGACAAGAC
TTGTGGACAT GTTTTTCAGG TGTCACCATC ACTTTTTCGT TAGTGTCTCC GCTTCTTGCA
TTGCTCAACT CAAAATTCAG AAAATGCTAA CTCTAAACTT TAGCCAATTA TTCCCTCGGA
TGGCATCCCA AGGACGCTTG AACAATTATC CGTCTTTGCC CAAAACGAAA AATACCTTTT
GGCGACCATC ATCATCATTT CGAGCCGGGT GGAAAACACG CCAGAGATGA GAGAGATCCA
TGAACGCTCA TGGGCAGTCA TGCGCAGGTG GATCTCCAAA GTCCAATGCC TCGGTGAACC
ACCCACCATA GGCCTCGTCG AATCCCTCCT TTTACTCGCC GAGAACCTGC CACGCACTTC
ACGTGAGATG ATGTCTGATG ATTCTACCGA GACGGATGCG ATAGAAGAGC CGCACGGTGT
GGAGAACAGG CAGGCTTGGC AGATGATAGG TTTGGCGGTG AGGAGTGCGT ATGAGATGGG
GTTGGATAAG TTGGGTCTGC AGTTGATACC GGAGACAGAG AGAACGCTGG AGTTGGAGAG
GGCAAAGTTG GTTTGGGTCT GTGAGTGCGA CCATGTCCCT ACACGAATTG ACGTGATGCT
GATTGTTTGG CAAGACTGCT ACCTTTTTGA CAGACAGTGA GTTGATTAGT TTTAGCTTTG
GATTTTATTC CAAACGCTAA TTTTTTTTCC TAGTGTCTCT ATGCGACTCG GTAAAGGTTT
CTGGACACGA GGTGGCGCAG TCTGTTTCCA AGGCTACTCT TCCTCTGCTC AAACTGGCCC
TGCCGCTGCT CTCGTCAATT TCCCCTTTTT ACGCGAAATC AGGCCTGGCG ATCCTCATAG
CGATCATCCA CAAGATGACT TGGGTAGTTT GGTACAGGCG TATCTGGAAC TGACGATGAT
GATGAGTAAC GCGCATGATG TACTTTATCC TAATGCGGCG AGAACAAGGT CACTTGTTGT
GTAGGTATTT TTTTAAATTT TTTTTTGTCT TCTTTATGGG ACAAGTACCC GAGAAATGCT
GATGGTTGTT AGTTACGGAG AGTACTTCAA ATATATTGAC GAAATGGCAC GATCGCTGGA
TGGGTTCAAG ATTTTATGGC GACGTAAAAA ATGGACACTG TTCCCTCTCA CTGACACTGT
CTGGGTCATG TTCTACTACA TACAGCTCTA TATATGTGCC TTCAGTTTCG TCAGTACCTA
CTTGGCCTAC TTACATAATG TTATCAACTG ACGTCTCTTG CAGCAAGCGC ACGTCGAACG
GGCAACTATC CGAGGCGAAG AAGAGTACAA ACTCTTGGAA CAACGGCACA AGGAACAAGG
AGGTACGACA AAACTCGCCA AACCGTCTCT CAGTCTTTTC CCGCGGGGTG CTGCTCAAAG
TCCCGATGCG CGATACATTT TCCAAATGTG TGACGCCGCG AGAGAACTTA TACACATTTG
CGTGGATAAT CTGTACCCTG GTGGAGCGCT GCCTTATCTG CCTTCAAGGT TCTTATTGTG
GTTCACGTAT GGAGCGATTG TGCTGTTGAA AGCTATTTAC TCAGGGGCTA TGCTCAGAGC
GGACCACAAA AGGTGAGTAC ATTCTCCATT TTTCTTTTTT TTTGGGTACT GAACAAGATT
GTAGAACCCT TGATTTGATT GACCGGCTTT GCACTTGCTT TGCTCAGTGC TCGACAGACG
AAGAGTACCC CGCGGTACGA TACGGCAAGC AGCTCGAAGC ACTCCGCAAT AAGCTTGCCG
GGTTATCAGA CGTCAAAAGC ACTCAAAGCC CCAATGGCTC TCAGACGGTC AGACTTCCTC
GTACGCAAAA CCGGGCCCCA GAGGTCCGCG TCAGTCCATC CCAGACTTCT ACCTCTCCCA
ATACAGTGCC TCCGCCCCAA CAGTTTGACC CACGGCCGTC GACATTTCAA CCTGCCCGGC
CTCATGCTAT GCCAAACGAA CACGCCGTCA ATACGCCGAC GTGGCAGTTA CCGGTCTTGC
AGCAGTACAC TCAGCCAGTC ACATTCCCAT ACCCTACAAC CCCAGTCTCC TTTACTACGG
GTCCTTCTAC AGAAATGTCT GCGCCATACG TCGCGCCGCC GCACCAACAG CCATTCATGG
AATTTGGTGT GGCGCAGTCA TCAGACGGTT TGAATTTGGG TATGAACGAC CATCATAACT
TGGGGTTCAC CACGCTTGAT GATTGGTTTG GGTTCGGAAC AGCCGGTACG GCTGGTGGAG
CTGGCCAAGG TGGGAATGGA GTGGATGATC CGATGGGATT GGCGAATGTC GGGTTGGATC
TGCAGGATTT CTGGATGAAC GTCGGGCCGG GTGAGGTGAG TGTGTGGTGG TGGTTTCTTA
CATTTGCTGG TATTGGAAAA TATAGCAGAT TTGCTGACTC CCCTTTTTTC TTCCCTGATC
AGGCTCAAGG AGGGTTCCCG TTCCGATAAA CAGCGTTTCT TTCCCTGTAC TTTTGTTACG
ACTAATACAG AGCGATTAAT GTTAGGATAA TGTATTACTA GTTAAT
 
Protein sequence
MNDLSAASRE KIRRGYRACL HCRSRKAKCD LGDIDAPSSP PCSRCKRESR ECVFAPSRRG 
GNNKKRARTD SEDISREDED PPRPLGRSTA EVTSGEEHSS QTFPYPQPPP QPRHPSVHNL
LGHSPPPARY RQHFGHTSSS SQSSPQTSFS ADHATPQTRP TLSNVQSHTA TGYPDSPPSP
RRRRTANPPL HAADPSSIVV ADMRNESDAL QILALASGQA ANRDGEEERS DRHDGHSVPG
TVDSTGMGGH QQQMRGAPSS PEKEMQLAKL AQFPLVKLGI LSVEQTTRLV DMFFRCHHHF
FPIIPSDGIP RTLEQLSVFA QNEKYLLATI IIISSRVENT PEMREIHERS WAVMRRWISK
VQCLGEPPTI GLVESLLLLA ENLPRTSREM MSDDSTETDA IEEPHGVENR QAWQMIGLAV
RSAYEMGLDK LGLQLIPETE RTLELERAKL VWVYCYLFDR HVSMRLGKGF WTRGGAVCFQ
GYSSSAQTGP AAALVNFPFL REIRPGDPHS DHPQDDLGSL VQAYLELTMM MSNAHDVLYP
NAARTRSLVV YGEYFKYIDE MARSLDGFKI LWRRKKWTLF PLTDTVWVMF YYIQLYICAF
SFQAHVERAT IRGEEEYKLL EQRHKEQGGT TKLAKPSLSL FPRGAAQSPD ARYIFQMCDA
ARELIHICVD NLYPGGALPY LPSRFLLWFT YGAIVLLKAI YSGAMLRADH KRTLDLIDRL
CTCFAQCSTD EEYPAVRYGK QLEALRNKLA GLSDVKSTQS PNGSQTVRLP RTQNRAPEVR
VSPSQTSTSP NTVPPPQQFD PRPSTFQPAR PHAMPNEHAV NTPTWQLPVL QQYTQPVTFP
YPTTPVSFTT GPSTEMSAPY VAPPHQQPFM EFGVAQSSDG LNLGMNDHHN LGFTTLDDWF
GFGTAGTAGG AGQGGNGVDD PMGLANVGLD LQDFWMNVGP GEAQGGFPFR