Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI01500 |
Symbol | |
ID | 3259419 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 432699 |
End bp | 436224 |
Gene Length | 3526 bp |
Protein Length | 950 aa |
Translation table | |
GC content | 51% |
IMG OID | 638258633 |
Product | specific RNA polymerase II transcription factor, putative |
Protein accession | XP_572987 |
Protein GI | 58271662 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCGACCCA AATGAATGAT CTATCAGCAG CCTCGAGGGA AAAGATCAGG AGAGGATACC GCGCTTGGTA AGCACCCACT CACAGAGCCA GTCCCAGCGG TCCGCCCACC GCCATCCTCC TCTCGGTACT GACACCAGCA CGCACCTCAC AGCCTCCACT GCAGGTCTCG CAAAGCAAAG TGCGATCTCG CAAGTATTCC GGGTCTCGCT ATCTACGCCA TACGCATGCT GATCCACAAT CTTCATAGGG AGACATTGAC GCGCCCTCTT CTCCGCCATG CAGCCGATGC AAGCGCGAAA GCCGAGAATG TGTGTTTGCT CCCTCTCGCC GGGGAGGAAA CAACAAGAAG AGGGCACGTA CGGATTCAGA AGACATTTCC AGAGAGGACG AAGATCCGCC GCGCCCACTG GGTCGTTCCA CAGCCGAAGT TACGTCGGGC GAAGAACATT CCTCCCAGAC TTTTCCGTAC CCTCAACCGC CTCCACAGCC CCGACATCCA TCTGTGCATA ATCTTCTAGG TCATTCTCCG CCACCCGCTC GATACCGTCA GCATTTCGGC CATACATCTT CCAGTTCACA ATCGTCTCCT CAAACTAGCT TTTCCGCCGA CCATGCAACT CCACAGACCC GACCCACACT GTCTAATGTT CAGTCTCACA CAGCCACAGG ATACCCAGAC TCTCCTCCTT CACCCAGACG AAGACGCACG GCCAATCCAC CGTTGCACGC TGCCGATCCA AGTAGTATTG TTGTAGCAGA CATGCGTAAT GAGAGTGATG CGTTGCAGAT CCTTGCGCTT GCTAGTGGTC AGGCAGCAAA TAGGGATGGT GAAGAAGAGA GATCTGATAG GCATGACGGC CATAGCGTGC CAGGCACTGT CGATTCTACA GGTATGGGAG GTCATCAGCA GCAAATGAGG GGAGCGCCGT CGTCTCCGGA AAAAGAAATG CAGCTGGCAA AGCTGGCGCA GTTCCCTCTG GTTAAATTGG GAATCCTCAG CGTTGAGCAG ACGACAAGAC TTGTGGACAT GTTTTTCAGG TGTCACCATC ACTTTTTCGT TAGTGTCTCC GCTTCTTGCA TTGCTCAACT CAAAATTCAG AAAATGCTAA CTCTAAACTT TAGCCAATTA TTCCCTCGGA TGGCATCCCA AGGACGCTTG AACAATTATC CGTCTTTGCC CAAAACGAAA AATACCTTTT GGCGACCATC ATCATCATTT CGAGCCGGGT GGAAAACACG CCAGAGATGA GAGAGATCCA TGAACGCTCA TGGGCAGTCA TGCGCAGGTG GATCTCCAAA GTCCAATGCC TCGGTGAACC ACCCACCATA GGCCTCGTCG AATCCCTCCT TTTACTCGCC GAGAACCTGC CACGCACTTC ACGTGAGATG ATGTCTGATG ATTCTACCGA GACGGATGCG ATAGAAGAGC CGCACGGTGT GGAGAACAGG CAGGCTTGGC AGATGATAGG TTTGGCGGTG AGGAGTGCGT ATGAGATGGG GTTGGATAAG TTGGGTCTGC AGTTGATACC GGAGACAGAG AGAACGCTGG AGTTGGAGAG GGCAAAGTTG GTTTGGGTCT GTGAGTGCGA CCATGTCCCT ACACGAATTG ACGTGATGCT GATTGTTTGG CAAGACTGCT ACCTTTTTGA CAGACAGTGA GTTGATTAGT TTTAGCTTTG GATTTTATTC CAAACGCTAA TTTTTTTTCC TAGTGTCTCT ATGCGACTCG GTAAAGGTTT CTGGACACGA GGTGGCGCAG TCTGTTTCCA AGGCTACTCT TCCTCTGCTC AAACTGGCCC TGCCGCTGCT CTCGTCAATT TCCCCTTTTT ACGCGAAATC AGGCCTGGCG ATCCTCATAG CGATCATCCA CAAGATGACT TGGGTAGTTT GGTACAGGCG TATCTGGAAC TGACGATGAT GATGAGTAAC GCGCATGATG TACTTTATCC TAATGCGGCG AGAACAAGGT CACTTGTTGT GTAGGTATTT TTTTAAATTT TTTTTTGTCT TCTTTATGGG ACAAGTACCC GAGAAATGCT GATGGTTGTT AGTTACGGAG AGTACTTCAA ATATATTGAC GAAATGGCAC GATCGCTGGA TGGGTTCAAG ATTTTATGGC GACGTAAAAA ATGGACACTG TTCCCTCTCA CTGACACTGT CTGGGTCATG TTCTACTACA TACAGCTCTA TATATGTGCC TTCAGTTTCG TCAGTACCTA CTTGGCCTAC TTACATAATG TTATCAACTG ACGTCTCTTG CAGCAAGCGC ACGTCGAACG GGCAACTATC CGAGGCGAAG AAGAGTACAA ACTCTTGGAA CAACGGCACA AGGAACAAGG AGGTACGACA AAACTCGCCA AACCGTCTCT CAGTCTTTTC CCGCGGGGTG CTGCTCAAAG TCCCGATGCG CGATACATTT TCCAAATGTG TGACGCCGCG AGAGAACTTA TACACATTTG CGTGGATAAT CTGTACCCTG GTGGAGCGCT GCCTTATCTG CCTTCAAGGT TCTTATTGTG GTTCACGTAT GGAGCGATTG TGCTGTTGAA AGCTATTTAC TCAGGGGCTA TGCTCAGAGC GGACCACAAA AGGTGAGTAC ATTCTCCATT TTTCTTTTTT TTTGGGTACT GAACAAGATT GTAGAACCCT TGATTTGATT GACCGGCTTT GCACTTGCTT TGCTCAGTGC TCGACAGACG AAGAGTACCC CGCGGTACGA TACGGCAAGC AGCTCGAAGC ACTCCGCAAT AAGCTTGCCG GGTTATCAGA CGTCAAAAGC ACTCAAAGCC CCAATGGCTC TCAGACGGTC AGACTTCCTC GTACGCAAAA CCGGGCCCCA GAGGTCCGCG TCAGTCCATC CCAGACTTCT ACCTCTCCCA ATACAGTGCC TCCGCCCCAA CAGTTTGACC CACGGCCGTC GACATTTCAA CCTGCCCGGC CTCATGCTAT GCCAAACGAA CACGCCGTCA ATACGCCGAC GTGGCAGTTA CCGGTCTTGC AGCAGTACAC TCAGCCAGTC ACATTCCCAT ACCCTACAAC CCCAGTCTCC TTTACTACGG GTCCTTCTAC AGAAATGTCT GCGCCATACG TCGCGCCGCC GCACCAACAG CCATTCATGG AATTTGGTGT GGCGCAGTCA TCAGACGGTT TGAATTTGGG TATGAACGAC CATCATAACT TGGGGTTCAC CACGCTTGAT GATTGGTTTG GGTTCGGAAC AGCCGGTACG GCTGGTGGAG CTGGCCAAGG TGGGAATGGA GTGGATGATC CGATGGGATT GGCGAATGTC GGGTTGGATC TGCAGGATTT CTGGATGAAC GTCGGGCCGG GTGAGGTGAG TGTGTGGTGG TGGTTTCTTA CATTTGCTGG TATTGGAAAA TATAGCAGAT TTGCTGACTC CCCTTTTTTC TTCCCTGATC AGGCTCAAGG AGGGTTCCCG TTCCGATAAA CAGCGTTTCT TTCCCTGTAC TTTTGTTACG ACTAATACAG AGCGATTAAT GTTAGGATAA TGTATTACTA GTTAAT
|
Protein sequence | MNDLSAASRE KIRRGYRACL HCRSRKAKCD LGDIDAPSSP PCSRCKRESR ECVFAPSRRG GNNKKRARTD SEDISREDED PPRPLGRSTA EVTSGEEHSS QTFPYPQPPP QPRHPSVHNL LGHSPPPARY RQHFGHTSSS SQSSPQTSFS ADHATPQTRP TLSNVQSHTA TGYPDSPPSP RRRRTANPPL HAADPSSIVV ADMRNESDAL QILALASGQA ANRDGEEERS DRHDGHSVPG TVDSTGMGGH QQQMRGAPSS PEKEMQLAKL AQFPLVKLGI LSVEQTTRLV DMFFRCHHHF FPIIPSDGIP RTLEQLSVFA QNEKYLLATI IIISSRVENT PEMREIHERS WAVMRRWISK VQCLGEPPTI GLVESLLLLA ENLPRTSREM MSDDSTETDA IEEPHGVENR QAWQMIGLAV RSAYEMGLDK LGLQLIPETE RTLELERAKL VWVYCYLFDR HVSMRLGKGF WTRGGAVCFQ GYSSSAQTGP AAALVNFPFL REIRPGDPHS DHPQDDLGSL VQAYLELTMM MSNAHDVLYP NAARTRSLVV YGEYFKYIDE MARSLDGFKI LWRRKKWTLF PLTDTVWVMF YYIQLYICAF SFQAHVERAT IRGEEEYKLL EQRHKEQGGT TKLAKPSLSL FPRGAAQSPD ARYIFQMCDA ARELIHICVD NLYPGGALPY LPSRFLLWFT YGAIVLLKAI YSGAMLRADH KRTLDLIDRL CTCFAQCSTD EEYPAVRYGK QLEALRNKLA GLSDVKSTQS PNGSQTVRLP RTQNRAPEVR VSPSQTSTSP NTVPPPQQFD PRPSTFQPAR PHAMPNEHAV NTPTWQLPVL QQYTQPVTFP YPTTPVSFTT GPSTEMSAPY VAPPHQQPFM EFGVAQSSDG LNLGMNDHHN LGFTTLDDWF GFGTAGTAGG AGQGGNGVDD PMGLANVGLD LQDFWMNVGP GEAQGGFPFR
|
| |