Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00820 |
Symbol | |
ID | 3255888 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 234559 |
End bp | 237545 |
Gene Length | 2987 bp |
Protein Length | 754 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254734 |
Product | transcription factor, putative |
Protein accession | XP_569090 |
Protein GI | 58263360 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.140572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGAAACAT CCTCCATCTT CTCTTCCCAT CTATCTTTTG CCCATATACA CCACAAAGCG GGGAACAAGC ATCTCACAGC CACATTCGGC GACACAGATT TACGTAGTCG CCTCTTTTCC CACCCCCTTC TACCTCCAAG TCTCAACCCG AGACACGTTC CGCCATGGGC AAGAAGGTCA TCGCCTCTGG TGGGGATAAT GGGCCCAATA CCATCTACAA GGTGCGCATC TACACCAATC CTATCTGGAT GGCATCCTGG ATTGCTTTCA AAGCATTGCT AACTCGATGT CCCGTTCTAC AGGCCACATA TAGGTGCGCA GCATTCCTTT AATTCGAGCG GCGCGCCGCC CGATTGTATG TGTCAAGTTT GCTAACCTTG AACTCAATCT CTTTGATAGT GGAGTGTACG TACCATATCC TTTGTTTACC TGCTTCTAAA CTGACAGTGT TTCCAGGCCC GTATACGAGA TGGTCTGTCG CGATGTCGCG GTTATGCGCA GACGTTCAGA CGCGTACCTG AACGCGACTC AAATTCTGAA AGTAGCCGGT TTCGACAAGC CTCAGCGAAC ACGAGTTTTG GAGAGAGAAG TGCAAAAGGG AGAGCATGAA AAGGTTCAGG GTGGCTACGG AAAATATCAA GGTTAGTCTA CTTTTTTTCC TTCAAATGCT CTGTTTTGTC TTGTCACCCA ATCGCGTAAG CTGACTCGAC GTAGGTACCT GGATTCCTAT CGAGCGCGGT CTTGCTCTTG CCAAGCAATA TGGTGTTGAA GACATTCTCC GACCCATCAT TGACTACGTT CCTACCTCTG TATCTCCTCC CCCTGCCCCT AAACACTCTG TCGCGCCTCC TTCGAAAGCC CGCAGGGACA AGGAAAAAGA AACCGGTCGA ACCAAGGCTA CTCCTTCACG AACCGGGCCA ACATCAGCAG CTGCTCTTCA AGCTCAAGCA CAACTTAATC GTGCCAAGAT GCATGATTCC ACTCCCGACG CTGATGCTAG CTTCCGCTCT TTCGAGGAGA GAGTCAGCTT AACGCCTGAA GATGATTCGA GCAGTGATAC ACCGAGCCCA GTCGCGAGTG TTATGACTGA CCAGGACATG GAAGTCGATA AGATGGGGAT GCACATGAGC ATGCCCAACG TGACACTGTC CCAAAATATG GAGGAACTGG GAGCTGGCTC AAGAAAACGT AGCGCCGCAA TGATGATGGA AGATGAAGAC CAATTTGGCC AGCTCCGGTC CATCAGGGGT AATAGCGCTG TACACACTCC TCACGGTACT CCTCGACATC TTGGTATCGG TATGCCCCCG GAACCAATCG GCCCGGAGCA ATACACCGAT ATTATCCTTA ACTACTTCGT CTCTGAAACC TCGCAAATAC CGTCTATCCT CGTCAGCCCT CCTCACGACT TTGATCCTAA TGCTCCCATT GACGATGACG GCCACACCGC GCTTCACTGG GCTTGTGCCA TGGGTCGAGT ACGCGTTGTC AAGCTGCTTC TCACTGCAGG CGCGTCAATC TTTGCTGGTA ATAATGCCGA ACAAACTCCT CTTATGCGCA GCGTCATGTT TTCAAATAAC TATGACATGC GTAAATTCCC CGAGCTTTAC GAACTTCTTC ACCGATCTAC TCTTAATATT GACAAGCAAA ATCGAACCGT TTTCCACCAC ATCGCCAATC TTGCCCTAAC AAAAGGCAAA ACTCATGCCG CCAAGTACTA CATGGAGACT ATCCTCGCGC GTTTGGCCGA CTACCCTCAA GAACTTGCCG ACGTGATCAA CTTTCAAGAT GAAGAAGGTG AAACTGCTTT AACTATTGCT GCGCGTGCCA GAAGCCGTCG ACTGGTGAAG GCTCTGCTCG ACCACGGTGC CAATCCCAAG ATCAAGAACC GTGACTCCCG CTCAGCTGAA GATTATATCC TCGAGGATGA GCGATTCCGT TCATCTCCCG TTCCAGCTCC CAACGGTGGC ATCGGTAAAG CTAGCACCTC TGCTGCCGCC GAAAAACCTC TCTTTGCTCC TCAGTTGTAC TTCTCCGAAG CGGCCAGGTT ATGTGGCGGC CAAGCATTAA CCGACATCAC TTCCCACATG CAGTCACTCG CACGATCTTT CGACGCTGAA TTGCAAGGCA AAGAACGAGA CATTCTCCAA GCCAAGGCTC TTCTTACCAA CATCCATACT GAGGTTACCG AAAATGGTCG ATCAATCACT GCTATCACCA ATCAAGCGGC TCCCCTTGAA GAAAAACGAC GTGAGCTTGA GGCTCTACAA GCATCTCTGA AGACAAGAGT AAAGGACGCT TTGAAGAAGG GTTATATCGG GTGGCTTGAG GGCGAACTGG TAAGGGAACA ACGATGGGAG AACGGTGAGC TCGAGGGAAA TGAAGAGGAG AAGGCGGCTG TTCAGGCATT AAGGGATGTT CCTACCGGTG GTCAGGAGGT TGTTCAGGCC GAGGAGGAAA AGTTAAGATG GGAGATTGAG GAGAAGAGGA AGCGAAGGGC TATGTTTGTG GAAAAATTTG TCAGAGCACA GACCGAAGTA AGTTTCTGGG CAATGTTGAA GTAGAGCAAT GCTCATAATT TTGCAGGCTG GTACAAGTGA ACAGATTGCC AAGTACAGGA AACTGGTATC CGCTGGGCTC GGAGGTGTTT CAACAAATGA AGTAGATGAG TTGATGAACC AGTTATTAGA AGTAGGTTCT GCGATCTACT AGCCTAATGA GAGTTGAACT GATCTTCCTT GCAGGGTCTC GAAGAGGAGA ATGATAATCA AGTGTACAAC ACAACCGCTG GAGAATCAGG TCCTTCATCA TGGGTGCAGT AATATGGTCA TTGGGGATGA AGGGAAGGAA GGAATCATGT GGTCAATAAT TGGAAGTTCT CAGATCTCTG TTCTGTATTA CCAAAAGGTT TCTGCACATG ATGTGACTTG GTCTTGGGTC TCTTAAGTGG TCTTTTACTT TCTAGTAACT ATGCGAATGC AAAATGC
|
Protein sequence | MGKKVIASGG DNGPNTIYKA TYSGVPVYEM VCRDVAVMRR RSDAYLNATQ ILKVAGFDKP QRTRVLEREV QKGEHEKVQG GYGKYQGTWI PIERGLALAK QYGVEDILRP IIDYVPTSVS PPPAPKHSVA PPSKARRDKE KETGRTKATP SRTGPTSAAA LQAQAQLNRA KMHDSTPDAD ASFRSFEERV SLTPEDDSSS DTPSPVASVM TDQDMEVDKM GMHMSMPNVT LSQNMEELGA GSRKRSAAMM MEDEDQFGQL RSIRGNSAVH TPHGTPRHLG IGMPPEPIGP EQYTDIILNY FVSETSQIPS ILVSPPHDFD PNAPIDDDGH TALHWACAMG RVRVVKLLLT AGASIFAGNN AEQTPLMRSV MFSNNYDMRK FPELYELLHR STLNIDKQNR TVFHHIANLA LTKGKTHAAK YYMETILARL ADYPQELADV INFQDEEGET ALTIAARARS RRLVKALLDH GANPKIKNRD SRSAEDYILE DERFRSSPVP APNGGIGKAS TSAAAEKPLF APQLYFSEAA RLCGGQALTD ITSHMQSLAR SFDAELQGKE RDILQAKALL TNIHTEVTEN GRSITAITNQ AAPLEEKRRE LEALQASLKT RVKDALKKGY IGWLEGELVR EQRWENGELE GNEEEKAAVQ ALRDVPTGGQ EVVQAEEEKL RWEIEEKRKR RAMFVEKFVR AQTEAGTSEQ IAKYRKLVSA GLGGVSTNEV DELMNQLLEG LEEENDNQVY NTTAGESGPS SWVQ
|
| |