Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND05520 |
Symbol | |
ID | 3257096 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1493971 |
End bp | 1497223 |
Gene Length | 3253 bp |
Protein Length | 925 aa |
Translation table | |
GC content | 53% |
IMG OID | 638256490 |
Product | transcription factor, putative |
Protein accession | XP_570545 |
Protein GI | 58266778 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.21086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTTGCCAG CTTCCATCTT GCCAGCTCGC CAAAGACGCG TAAGCTTCCC CGCAAGACAC CGCCATGGAG CCGCCCAGGT AAGCCGTCTT CCGTCGCCGC CCTCCCGCAT CTCATCCCAT TGTGCAATAG CAACCCCATC CAGCCCCCCG TCACCCCGTC CCACCACTCT CTCCTTTCAG CCATCTCCCC GGCTCTGTCA GAACAGACAC CAGCTCCTAT CCACACCCTC CCTCCCCATC TTCGGCCCTC CATTCCGCAA CCCCATATCG CTCCTCCACG CCCCAGCTCT GTACAGCCAA CAATGGAGGA GCAACAGAGA ATGCATCACA TCCAACAGCA TCAGCAGCAA CAACATTTCC AACAGCAACA GAATGATGAG AATGTTTTTG GCTCAGTGAT GGGGGCACCA GGCCATGTTC CGGGACATGA GGCTCCGATG AGTACCCAGC CTAAAGTCTA TGCAAGTGTC TATTCTGGAG TACGTCTTTC CATCCAACCT CTAAACAAAA GAAAGTTACT CATGGTGACC GCAGGTTCCC GTATTTGAAG CCATGATTCG AGGTATCTCT GTGATGAGGC GTGCTTCCGA CTCGCAAGTC TTCCTCTTCC ACACATCGCG TCATTGGTCC CGTCTAACGT GATGTCTTGC AGATGGGTCA ACGCGACACA AATTCTCAAA GTTGCCGGCG TGCACAAGTC CGCTCGAACC AAGATACTGG AAAAGGAAGT GCTCAACGGC ATTCATGAAA AAATTCAAGG CGGATGTGCG TGCAATTGTA TCCGCTCTCG TTAATCTACG CTAAAACGTT CACTGTAGAC GGAAAATACC AAGGAACCTG GGTTCCACTT GACCGTGGGC GGGATCTTGC AGAACAATAC GGTGTCGGAA GCTACCTGTC TTCTGTCTTT GACTTTGTTC CTTCCGCGTC CGTCATTGCT GCCCTCCCCG TGATTCGCAC AGGTACTCCT GACCGTTCTG GACAACAAAC TCCTTCCGGA TTGCCAGGTC ACCCTAATCA GCGAGTCATC TCTCCCTTTG CTAATCACGG CCAAACGACT CCCCATATGC CTCCTCCTCA ATTCATACAT CAAGGTAACG AGCAAATGAT GAACCTTCCT CCCCACCCCT CCTCCTTGGC TTACCCTACA CAGCCTAAAC CTTACTTCTC CATGCCTCTT CAGCATACTG TCGGTCCACA GTATGATGAA AGACATGAAG GTATGACCAT GACACCTACC ATGAGCATGG ACGGCTTGGC CCCTCCGGCT GATATTGCCC GCATGGGTTT CCCATACAAC CCATCCGACA TTTATATTGA CCAATACGGC CAGCCACATG CCACCTACCA AGCTTCGCCT TATGGGAAGG AAAGTGGCCA TCCATCTAAG CGTCAGAGAT CAGATGCCGA GGGCAGCTAT ATCGAGAGCG GTGCCGCTGT CCAACAACAT GTTGAACAAG ATGAAGAAGC CGACGATGGT TTGGACAATG ACTCTACCGC GTCGGACGAC GCCCGCGACC CTCCCCCGCT CCCAAGTTCG ATGCTTCTTC CCCATAAACC GATCCGACCC AAGGCTACTC CAGCCAACGG CCGTATCAAG AGCAGGCTCG TCCAGATATT TAACGTGGAA GGTCAAGTTA ATCTCCGAAG CGTCTTTGGA TTGGCACCAG ATCAGCTACC CAATTTTGAC ATTGACATGG TAATCGACGA CCAAGGTCAC TCTGCCTTGC ATTGGGCTTG TGCCCTCGCC AGACTGTCCA TCGTGCAACA GCTCATCGAA CTTGGTGCCG ATATCCATCG AGGCAACTAC GCCGGAGAGA CCCCCCTTAT TCGCGCTGTC CTTACTTCCA ACCACGCCGA AGCTGGCTCC TTTACTGATC TTTTGCACCT CCTTTCCCCG TCGATTCGCA CGCTTGACCA TGCCTACCGC ACGGTTCTGC ACCATATTGC GCTGGTCGCT GGTGTCAAGG GCCGAGTACC TGCTGCGAGG ACTTATATGG CCAGTGTTCT CGAGTGGGTC GCCAGGGAAC AACAGGCCAA TAACACGCAT AGTATCACAA ACCCTCCCAA CCCTGCTGAT CGCAATGAGC TGGCACCGAT CAACCTTCGT ACTCTTGTGG ACGTTCAAGA CGTACATGGT GATACTGCTT TGAATGTCGC CGCACGAGTG GGTAACAAGG GACTGGTGGG TTTGCTATTG GATGCTGGTG CGGACAAGAC ACGGGCCAAC AAACTGGGAC TCAGGCCGGA AAACTTTGGC TTGGAGATTG AGGCTCTCAA GATCTCGAAT GGCGAGGCTG TCATGGCAAA CCTCAAATCA GAAGTGTCCA AGCCCGAGAG GAAGAGCCGC GACGTGCAGA AAAGTGAGCA TTATTCATTT ACTCAGTCTT ATGAGACGTA CTAATCATGT CCCGCACAGA CATTGCGACC ATCTTTGAAT CCATATCCTC CACCTTTTCG AGTGAAATGC TCGCCAAACA AACGAAATTG AATGCCACCG AAGCTTCTGT CCGCCATGCC ACTCGCGCGC TTGCGGACAA ACGGCAACAC CTTCACCGCG CTCAAGAGAA ACTCGCTACG ATGCAACTGT TTGAGCAACG TTCTGAAAAC GTGCGGCGTA TCATGGACGC CATCGCCGCC GGCACGCTGT TGACGCCTGC AGAGTTTACT GGCCGAACGC AGACGATGCA CGAAAAATCC ACGGGCCAAC TGCCTCCTCT TGCATTCCGG CATGTTCCAG GCTTGGCACT CGACGCGTCC TCGCAATCCC AGCTGAACGG CGCGCCCCCA TCCACACCGC TTTCCGTCGA GGACCAAGAG GACATTGCTT TGCCTGAGCG AGACGATCCA GAATGTCTGG TAAAACTCAG ACGTATGGCT CTGTGGGAAG ATCGGATTGC AGAAGTGTTG GAAGACAAGA TTAGGGCAAT GGAGGGGGAA GGTGTGGATA GGGCGGTCAA GTATCGCAAG TTGGTTAGTG TGTGCGCCAA GGTTCCTGTG GATAAAGTAG ACTCTGTAAG TTTCTGTTTC CTTCGCCGCT GTATATGTGA GATGGCTAAA ACGGATGGGA ACAGATGTTG GACGGGCTAG TCGCTGCTGT GGAGAGTGAA GGGCAAGGGC TGGATTTCTC TAGAGCCAGC AATTTTGTGA ACCGGATAAA AGCGACGAAA TCATAAGACT TGTTGTCAAG AACGACTACA TGTTTTGTTT TTGTTTTTTC TTGTCGTTTT GAATTTCTTT GATCTTCTAA TGT
|
Protein sequence | MEPPSNPIQP PVTPSHHSLL SAISPALSEQ TPAPIHTLPP HLRPSIPQPH IAPPRPSSVQ PTMEEQQRMH HIQQHQQQQH FQQQQNDENV FGSVMGAPGH VPGHEAPMST QPKVYASVYS GVPVFEAMIR GISVMRRASD SWVNATQILK VAGVHKSART KILEKEVLNG IHEKIQGGYG KYQGTWVPLD RGRDLAEQYG VGSYLSSVFD FVPSASVIAA LPVIRTGTPD RSGQQTPSGL PGHPNQRVIS PFANHGQTTP HMPPPQFIHQ GNEQMMNLPP HPSSLAYPTQ PKPYFSMPLQ HTVGPQYDER HEGMTMTPTM SMDGLAPPAD IARMGFPYNP SDIYIDQYGQ PHATYQASPY GKESGHPSKR QRSDAEGSYI ESGAAVQQHV EQDEEADDGL DNDSTASDDA RDPPPLPSSM LLPHKPIRPK ATPANGRIKS RLVQIFNVEG QVNLRSVFGL APDQLPNFDI DMVIDDQGHS ALHWACALAR LSIVQQLIEL GADIHRGNYA GETPLIRAVL TSNHAEAGSF TDLLHLLSPS IRTLDHAYRT VLHHIALVAG VKGRVPAART YMASVLEWVA REQQANNTHS ITNPPNPADR NELAPINLRT LVDVQDVHGD TALNVAARVG NKGLVGLLLD AGADKTRANK LGLRPENFGL EIEALKISNG EAVMANLKSE VSKPERKSRD VQKNIATIFE SISSTFSSEM LAKQTKLNAT EASVRHATRA LADKRQHLHR AQEKLATMQL FEQRSENVRR IMDAIAAGTL LTPAEFTGRT QTMHEKSTGQ LPPLAFRHVP GLALDASSQS QLNGAPPSTP LSVEDQEDIA LPERDDPECL VKLRRMALWE DRIAEVLEDK IRAMEGEGVD RAVKYRKLVS VCAKVPVDKV DSMLDGLVAA VESEGQGLDF SRASNFVNRI KATKS
|
| |