Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK02410 |
Symbol | |
ID | 3254666 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 699170 |
End bp | 702982 |
Gene Length | 3813 bp |
Protein Length | 1101 aa |
Translation table | |
GC content | 53% |
IMG OID | 638253733 |
Product | general transcriptional repressor, putative |
Protein accession | XP_567714 |
Protein GI | 58260608 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAAT CACGCATGGT TCATCCCCAC TCTCACCATC TTCACTCCGC CCACCACCAC TCTTCTCATC CACACTCTTA TCAACATCCA CCCCCGCCCA ACTACTCTTC GTCTCACCCG TCGTATCCCC ATCACCACCA TACCCCACGA CATGTCTCAA ACGGTCACTC TTCTCAAGTA CCCCTATCCG GGCCGCCCCC TCCTCAAATG CCCATTGCTG GCCCTCCGGC TGATCCCGTT GGGCCACCTG CTATAGCGAC CTCAAATGCC GGTAGCACTC GTTCTAGAAG CGTGCCGCCG ATGAGTAGTG AGGCAAGAGC AGCAAAGGAA AAGATGGACA ATATTCTGGC GCAATTGGCT GCCGCTAATG AGAATACATG GATGTTGATA GGTCGGTCGC CTTCTTCGCG AAAGGACCAT ATGAGCAAAG CCACTGACGG TTCGTAGGCG CTGTCGCTGA GGGGATGAAT GATCAAGACC GAGCGCTTTC CGCCTTTGAG AATGCACTTA GGCATAACCC TTCATCTGTT CTCGGCCTGA ATGCCGTCGC GTCCATTGCA CGAGGCAGAG ACGATTTTGA CAAGGCAATT GAGTACTTCC AACGCATCCT CAACGCGAAC CCTGAGAACG GTGAAGTATG GGGATCAATG GGTAAGTATT TGAAACTTCT GCATCAATCG ATTGTTGCGA TGCTGATAAA CGTCTCAGGG CACTGTCTTT TGATGAAAGA TGATCTTCCA AAGGCTTACA CATCGTATCA ACAAGCGCTG TACCATCTTG CTAATCCCAA AGTCAGTACG CAACTTCTGT ACAAGTGTCT AGTATTGACA ATCTCTGTAC AGGAACCCAA GCTTTGGTAT GGTATTGGTA TATTGTACGA TCGATACGGC TCTTTCGAAC ATGCCGAGGA GGCTTTCTCG AGTGTCCTTA AAGTTGATCC TAGTAAGTGA ATTCGATTAT CGTTCAAGAC GTTGTGATTG ACTCTTTGAA GACTTTGAAA AAGCCAACGA GATCTACTTC CGTTTGGGTA TCATCTACAA ACATCAACGC AAATACAAGT CCTCTCTTGA TGTGAGTGTT CTGTAGTGCA ATCATCGTAC TTCTTTGACG TCGAGCTATA GTGCTTCCGA TACATTCTTA ACAATCCTCC TCGACCTCTG ACATCTTGGG ATATATGGTT CCAGCTTGGA CACGTATACG AGCAAGATCG CGATTTCGAA GCTGCGAGGG ATGCTTACAT GAGAGTGCTC AGCCATCAAC CAGATCATGC CAAAGTTCTC CAGCAGCTTG GCTGGCTTTA TCACCAGCCT GGGGCCCACT TTGCTGATCA AGAAAAGGCC GTATCATATT TGACCAAGAG TCTTGAAACC GACCCGTCGG ATGCACAAAG CTGGTACCTT CTTGGACGTG CATACATGGC CGCCCAACGT TACAATAAGG CCTATGAAGC ATATCAACAG GCAGTGTATC GAGATGGTCG CAATCCCACC TTCTGGTGCT CAATCGGTGT CCTGTACTAT CAAATTGCGC AGTATCGTGA CGCTCTTGAC GCTTATTCGC GTGCCATCCG ACTCAACCCC TACATCAGCG AGGTCTGGTA TAATTTGGGA AGTCTTTACG AGTCGTGCAA TAATCAAATG GCTGATGCGA TGGATGCTTA TTCTCGCGCC CTCGAACTTG ATCCCAATAA CACCGTTATC AAGCAGCGTA TGGCTTTACT CCAAAATCCC AACGGCGGTC CTCTGCCTCC CGTACCTCCT CCAATTGACG TTCACCCATC TCAATACACT GCTACTCCTA CCGCTCAGCC TCCAGCTGGA TCACCAAATG CTTCGCCAAG CCACCAGCTT CCATCAGATG CTCATGCAGG CGGACGAGAC CTTCCACCTC CACCTCCGGG AGGCGACATT GCTCGTGGCC ACTCCCCTGG TCCATTCCGA AACGGTGCTG CCCCTCCACC ACTTAATCAC GTTGATGAGC CTCGAGGCCC TGTGCCCGGT ATGACTCAGC TAGCTAGAAT GGAGACTGAG CCGCGCTCCG CTGAAATTCG GGATGAACGA TACAACGGAC GTTACGATCC TGCCGATATC CGACGCCACA ATGGTTCACC ATTGTCCCCT CGTGCTTCAC GACGCGAAAG TCAGGCTTTC CCAAGCCAGC CGAGCTACTT CCCGTCTAAC GGTGCGCACG CGCGTGACAG GGAGAGGGAA GAGTGGGACA GGTCTCGAGG TGGATCTAGG GCTCCTCAGG GGCACTCACC TAGATTGGGC GATCGTACAC CTGGCGCCGA CCCCAGGGTT CCTCAAGAAT ACCGCGATTA TCCTGGGTAT TACGATCCTC GTACGGCCTA TCCCGCTCCT TTGGGCGGTC CACCCACGCC ATCAGCTATG CCCGGAAGGT TTGATCCCAG ACGAGAAGCT GAGGAGTTCC GCCGTCGTGA AGAAGATCGA GAAGTCAACG GTGCCAAGCT GGCCAGTTCA GCGAGACAAG AGAACCGAAT GCCCAGTCCG GCGCCATCGA TTGCATCAAG TAAGAACGGG AGGAAGCGTG GAGAAGGCAC GAGGAAAGCA AAGGATAAGG AAGAAAAGGC TCTGAACAAG AAAGAAGGTG GAAGGAAGGG AAAGGCCGCG GGGTTGAAGG CCGGTGAAGA AATTACCGGA CAGTCACCAA GAGGAAACGT CAAAAGTCCT TCTGTCAGTG TGCACAACAC GCCGCAGTTG AACACCGCCC GGCCAATCCC CCCTTCTGCT CCAGCGCCAG TACCTTTGAT GTCTCGCACT GTGGATGAAG GTAAATTCAA TGTTTTGTTT ATGCGAAATG GCCATGACTG ATCATGTCTT TAGACTACGA CGAGGGCGCA GCCGATGCGC TTATGGTTCT TGCGGGCGAT CGCAGCACTA CCACACTCCC TCTGCCTGTT CGTCATATAA CCCCCACTCC TTCAGGCCCC ATGTCGCCTC CGCCTCCCGC TCCCAATACT GGCCCTGCCA CCGGAGCCAA ACGTCCTGGT CCAGAAGCCA ACTCACCGGA ACAAGCCAAC AAACGGGTCA AAGCAGAGAA GTCTCAGTCA CCAGTTGATT CCGCGTCAGG TTCTGCTTCC AGTCGAGTGC CGAAGCGAAT GGTGATAGAA GTGCTCAACA CACCCAGCAT AGCCAGTCCC TTACCTCGGA CGTCTTCAGC GGTAAACGAA GAAAAACAGT CCCGAGCGTC TCAAGAGCGA AGGGAAGAGA GGAGGGAAGG GACGATTTCT GGAACGGAGA TTGACGAGCG TACTGCTCAG ACATCAACTC CTTTACCACC CCCAGCCCCA CTGTCTCCCG CAAACCGTCT TGCACCCTCT TCATCCGCTC ACACTCCTCA ATCGCCTCCT CGTCAGGCTT CATCTCCACC TCCCGAAGCC GAGAAAGAAG CGTCCCGACC TCCTACACCT CCTCTACCAG ATGAGCCACC CTCTCCAGCA GCTCCGGCTG CTGCTGTGCG AGTCGAGTCT TCGAGAGACT CGGACCCTCG CACTCCTCCT TTGCCGCCTT CCCAGACTTT CGAGAGTTCA TCCATGAAGA CTCCAACAAG TGCTGGGGAT CGTGGAGATG TCGATATGGC CGATGCCGGC ACGGAGAGAT GAGGCGAGGA AAAGAACGAC TTTGCCGCTG AGGATGGACG AATGTGAGGT GTTTGAAAAG CAACTAAGAG GTTTGAGGGG AGGATTAATT GTGACAGATG AGCCGAGAAG ATCATGAAAC AAGTAGTGTT TTTTCTTTGT CGACTAGGGT CTACTGTAGT GAGGCTTATA GTTTAGAGGC ATGGATGAAG CAG
|
Protein sequence | MPESRMVHPH SHHLHSAHHH SSHPHSYQHP PPPNYSSSHP SYPHHHHTPR HVSNGHSSQV PLSGPPPPQM PIAGPPADPV GPPAIATSNA GSTRSRSVPP MSSEARAAKE KMDNILAQLA AANENTWMLI GAVAEGMNDQ DRALSAFENA LRHNPSSVLG LNAVASIARG RDDFDKAIEY FQRILNANPE NGEVWGSMGH CLLMKDDLPK AYTSYQQALY HLANPKEPKL WYGIGILYDR YGSFEHAEEA FSSVLKVDPN FEKANEIYFR LGIIYKHQRK YKSSLDCFRY ILNNPPRPLT SWDIWFQLGH VYEQDRDFEA ARDAYMRVLS HQPDHAKVLQ QLGWLYHQPG AHFADQEKAV SYLTKSLETD PSDAQSWYLL GRAYMAAQRY NKAYEAYQQA VYRDGRNPTF WCSIGVLYYQ IAQYRDALDA YSRAIRLNPY ISEVWYNLGS LYESCNNQMA DAMDAYSRAL ELDPNNTVIK QRMALLQNPN GGPLPPVPPP IDVHPSQYTA TPTAQPPAGS PNASPSHQLP SDAHAGGRDL PPPPPGGDIA RGHSPGPFRN GAAPPPLNHV DEPRGPVPGM TQLARMETEP RSAEIRDERY NGRYDPADIR RHNGSPLSPR ASRRESQAFP SQPSYFPSNG AHARDREREE WDRSRGGSRA PQGHSPRLGD RTPGADPRVP QEYRDYPGYY DPRTAYPAPL GGPPTPSAMP GRFDPRREAE EFRRREEDRE VNGAKLASSA RQENRMPSPA PSIASSKNGR KRGEGTRKAK DKEEKALNKK EGGRKGKAAG LKAGEEITGQ SPRGNVKSPS VSVHNTPQLN TARPIPPSAP APVPLMSRTV DEDYDEGAAD ALMVLAGDRS TTTLPLPVRH ITPTPSGPMS PPPPAPNTGP ATGAKRPGPE ANSPEQANKR VKAEKSQSPV DSASGSASSR VPKRMVIEVL NTPSIASPLP RTSSAVNEEK QSRASQERRE ERREGTISGT EIDERTAQTS TPLPPPAPLS PANRLAPSSS AHTPQSPPRQ ASSPPPEAEK EASRPPTPPL PDEPPSPAAP AAAVRVESSR DSDPRTPPLP PSQTFESSSM KTPTSAGDRG DVDMADAGTE R
|
| |