Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00870 |
Symbol | |
ID | 3259048 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 940024 |
End bp | 943554 |
Gene Length | 3531 bp |
Protein Length | 650 aa |
Translation table | |
GC content | 52% |
IMG OID | 638258395 |
Product | RNA polymerase II transcription factor, putative |
Protein accession | XP_572280 |
Protein GI | 58270248 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCCCCCGTT TCACCCACAC GTGGCCTTCC TAGCGCCCTA CTCCCGTGAA CAAATAAAAC TTCCACAAGG TGAGCCACGC ATCCTGCCTC TCTGTAGTCC GCCATGCTGC TCGCAGTGGT CTACCTAAAA AGATACCACT TTTCGTCATC TTTGACGCGT CGCGAGGCAT CTTAATGACC TTGTTCGCCA CCCTTCTTCT CAGTGTGGAC TTGAGCGAAA ATACTTGCCC TTGACTTGTC GAAAGTAATC CATTCGGGCA ATTCATTCAT GGGCCTTGCT TTGTATGTCG CCCCTGAGGT ATTCCCTCGT CTGCCATCTT ACTTTTATCT CCAGGGCAAT CAGATAGCCC TGGGACCTGC AAATATGACA AACGTGTGGG ATGCTGATTC TGCTTGTTTT TGCAGGCCGT CCACGACAGC CACTACTCGG GGAAGAGTCA AGTATATTGG AATTTGTCGG CCCAACCAAG TGAGTACATA CGTGTCTATG ATCGTGCATC GGACTTCTTT GCGACTCGGG TGACGTTCTC CGTCGGGGAT GGAAGAGGGT CTCTGCTTTA CATATGTTCG CGAAGACACC TTGCTGTGCT CGAGAAGGAT CTCGGTATTT CCGCGCCCAG TTTGCACTTC TTTGCCCCAT CCTCTTCCCT TCTTCCATCT GGCGCTCGGG TTATAGGTGT CTTACGAACA CTTATCTTTC GCCGGAGTTA CGAAGACAGC TGTGGTGAGC ATTGCAAAGG TACCATGAAA GGATGGCTGG CGTGAGCCCG AGCATCTGGC ACCGCAGCCG GCGCAGAATA TTCTACCATG GGGTCGATGT CGGGAGTTGT GGGTGGATTA GTAAATATGA TCTCGGCATT TTCTTTCCGC TTCATCTGCC AGACCACAAT GTAACGGTTG CTGACGCCAT TCAACAGGCT TCGTCAGCGC GACTCACAGT ACAGGTCACG AAGCGTAGAT CAGGTCCGTC GTTATCGCTT TGCGGATAGA TGAAAGACGA CGAAACATCC TTCTTTGTCT TCCTTTGTCC TTCAGTGATT TCCCACACCA CACCGTAGTT CCTGGAACGC CCGCGCCACC TCCTTACATT CACATCTCAC GTACAACATG AGCTTCGCAG CTCCCGACGA CCGAGCATAT TATAACTACT CCCGTGCGCC AGGCTCCGCT CACGGTCCCG GTCACGAATC TACTTCGCCT GATCAGAGAC TTCCTTCATC TCATGGTTAC AACAAGGGTG CATCCGCACC TAACGAATCA CCTAATCAAG TATACCATCC CGAAATGGCA GGTCCTCCTG GTTCTTCCCA TAGTTATCCT CCTCAACCTA TGACGCCACT CACTGGCTCT AGCGCTTATC CACCCCAACA TCCTCCTTCT GGTAGTTACT ATCTGCCCCC GCAGCAACAG CAACAACAAC AATCGGAACA ACCTATTCCT TCCCCTCCTA GCAGCTCTAA CCGACCTCCT TCAGCTACTG GTTATACCCC CGATGGGCAG CCTATCATCC CCGTAGGTGT GTCTGGTGGC AAAATGTTTA GGTGCAGAGG TTACGGTGAT TGTGACAAGG TCTTTACAAG AAGTGAGCAT TTGGCAAGGC ATGTGAGGTG AGTGGCTTTT AGTGTTTGCC GATAACGATT GTACCTTGAC TGATAAGTGG TTTCGCGTCC TTTGTTACCC CTGGTACCTT AATTACAAAT CCAGAAAACA TACCGGTGAA CGTCCTTTCC CTTGTCACTG TGGCAAAGCA TTCTCCAGGC TCGACAACCT ACGTCAGCAC GCTGCTACCG TCCACGCTGA ACAGGCTCAG CTGAATGAAA CCATGCTTGC TTCTCTCGCG CCTATCCACG CTGCCCTTTC TCAGCGTGCT AGCAGAGAAC AACGACGAAG AGGTGAAGTT GTCGAGGTGC CTAAGGGTGC AGTCGAACGA CGTCGCGAGA CCCGCAAAGC CCAAGCCGCT CAAGCCGCCG CCGCTGCCCA AGCTGCTGCT ACAAATGGCC ACAGCCAGCA AAACTCTCCT TACGCTCAGT ACCACGAATC TCAGTGGAAT GCCCCCCCTC ATCCTCGACC CAGGACTAAT GGCGGGTATG ATTACCCTTA TGTGCCTGAG CATAGTCTTA ATGACGATGC TGGACCATCA AGGAGACCCA GTTCGTCTGC GGGCTATGGC TACCAACAGG GCTACTATGA CTCTGCTCGA CCTCCTACTG CCCCTGGGTC TGGTTCTTCT GGCGAGTCGA TGTCTGGGTT GCCTTACCCC TACCGACCCA TGTCTGCTTC AGGCCGTGAA CTCCCCGTTC CTGCGCATTA TTCCGAGTCT GAGCCTCCTG CTCCTGCCCA CGGCCCTCCT CCCCAGTCAC CAATGTACGG CAACGTCCCT CCTGCTCAAC AACAGCCTCC CAACTGGTCT TCTCCTCCGG GACACGGTAC CTACCCTCCT CACGACACCG CTGCATACCC TCCTCCTCCC GAAGGCTATT ACCCTCCTGC ACATGCTGCT CACGGCTCAT ATCCTCCTAG GGAAGACGTG TACGAGTACC ATCCTCCAGG TTGGCAGGGC CAGTACCCTC CTGCGGCGGC TAATGGCGGA CCCTACGCTC AGGGATACGG CACCGGTGCA CCCCCAACTG CTCCTCCACC TCACGAGTCT CCCTTCCAAT ACAACGTTCC CAATCCAGGC GCTGAGGGTT ACCCATATCA AAACTACGAC TCCCGCAAGC GTCGCGCGGA AGATGACTTT GGCAAGGATG ACCGTAAACA CCCCCGGCCA TCATCATCTT CTAACTCTCA ACTTCCCGAT AGTTCCACCC CTGCTCACGC GAACGGCGCG CACGGTGCTC CTCATCCTCA CCCTCATCCC GGTGCGGCAC CAGGTAATGG AGCCATGGAC CCTCCACGCC CGCATGATCC CAACTGGTTG CCTGCCACTA CGGAGAGAAG GGGTAGTTTG GCGATCTCCG CGTTGTTAGG TAGTCCTCCC AAGGCGGTGA GGAGTAGACC CTCGACAGCT GATGGCGGTA CGGCGGGTGC GCACGGGTAC GAAAGTTATC ATTATGACCC CCATGGTCAG GTATCCGATG ATCAACAGAC AGATGGGGAC AAGGAGAGGG AAGAGAAGGG GGAAGTGAAG AAACATGGTG ATAAGAAGAA GCTGTAAGGC TGAAAGGATG TGAGATTCAG GCAGTGTTGT GGATGTTTTT CTTTTTTTTT CCATATATTT CTCTTCGAGT GATACCCACT TGCGGCTTTT CGAAATGAGT GATTGATGAC CCCTAATTTT ACGAAGGAAA GGATTTATTG GCTAATTGTG CGAAGTAATT TAGTTATCTA GCTATGCTAT ACGATTTACG GGCTTGGAGG CAGATTAGAC TAAGAGCTCT CTCTCTTTTA TTTTACCTTC CTAACCCCTT CATTAATCTA TTTTTCGTAC TTCAATTTAT AGCTTTCTGG TGTTTAGATA CACGATGTGA TATGAATTCT CTGATTTTTA TTACTGTTTT TGATACGATT GATGCATTTT GCCTCTTATT T
|
Protein sequence | MSFAAPDDRA YYNYSRAPGS AHGPGHESTS PDQRLPSSHG YNKGASAPNE SPNQVYHPEM AGPPGSSHSY PPQPMTPLTG SSAYPPQHPP SGSYYLPPQQ QQQQQSEQPI PSPPSSSNRP PSATGYTPDG QPIIPVGVSG GKMFRCRGYG DCDKVFTRSE HLARHVRKHT GERPFPCHCG KAFSRLDNLR QHAATVHAEQ AQLNETMLAS LAPIHAALSQ RASREQRRRG EVVEVPKGAV ERRRETRKAQ AAQAAAAAQA AATNGHSQQN SPYAQYHESQ WNAPPHPRPR TNGGYDYPYV PEHSLNDDAG PSRRPSSSAG YGYQQGYYDS ARPPTAPGSG SSGESMSGLP YPYRPMSASG RELPVPAHYS ESEPPAPAHG PPPQSPMYGN VPPAQQQPPN WSSPPGHGTY PPHDTAAYPP PPEGYYPPAH AAHGSYPPRE DVYEYHPPGW QGQYPPAAAN GGPYAQGYGT GAPPTAPPPH ESPFQYNVPN PGAEGYPYQN YDSRKRRAED DFGKDDRKHP RPSSSSNSQL PDSSTPAHAN GAHGAPHPHP HPGAAPGNGA MDPPRPHDPN WLPATTERRG SLAISALLGS PPKAVRSRPS TADGGTAGAH GYESYHYDPH GQVSDDQQTD GDKEREEKGE VKKHGDKKKL
|
| |