Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG01750 |
Symbol | |
ID | 3258751 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 494725 |
End bp | 498660 |
Gene Length | 3936 bp |
Protein Length | 1028 aa |
Translation table | |
GC content | 50% |
IMG OID | 638257792 |
Product | transcription factor, putative |
Protein accession | XP_571902 |
Protein GI | 58269492 |
COG category | [K] Transcription |
COG ID | [COG5169] Heat shock transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCATCCTCC ATAACCCGTT CTCGACACTC ATCAACTGCG TCCTCTGACA TCGCCCTGTT TTTCGCACTG GGACCAGCCT CGACATGTCC CGTGTGCCTT CAGGCTCTCA GCAGCCACAG TACACACAGC AGCCGCTCTA TCACCCATCC TCCGTCCACT CTCAATCGAC ACCCATACAA CAAACTCATC AGCAATATCA TCCTCATACA GGCCAGCCAA TGACCTTCCA AGGCGATGCT CACGCGTACT CCTACGACAG GCAGTCTCAG ATAAATACAT GGCAAGCAAA TCCAGGTGCG GCCGGTAGTA TGGAATACCA CAGTGCCGTT GCTGGTTCTT CGCGTGGAGC CGACCCGTAC CAGCCTTATT CCCAGTCTCA TCGCTCCTCA CCATCCCAAT CTGTGACGGC AAGTCGAACA CAACCTGCCT CAGCTCAACA ATACACCCCT TGGGCACCTG TGCAGACGAG TGCAACAACG CCATCCAGCC GAGGAGGTGG GGTTAAGCTG GAAGATTTGA TGAGTAGCGA CTCGCGTGCT GGTGGGAAAA ATCAGCCTTT ATCGGGTATC CCCCTCAGCG TCTCCGCATC TTTTGATACA CGATCACAAT TTAGTGCAGG TCCTCAGGGT GCCAGCGCAG AAAAGGAAAA AGACAAGGAA AGGGACAAGG CGACCCAGCA ACAAGGGCCT TCAGAATTCA TCAAAAAACT ATACAAGATG CTGGAAGAAG AACAGGCGCA ATACGGAAAT AGTCGGACAA AGAAGGGAGA AAAGGGAAAG AGAGGTTCAG TGGGTTGGGG AGCAAATGGG ACAAGCTTTG TGGTATGGGA TATGAACGAC TTCACTACGA AAATCCTGTA AATTTGATCC ACAACCCATT ACTGCATATA TTAATGCTAA CGCTCTGTAG GCCGCAGACC TTCAGACATT CCAATTTCTC TAGTTTTGTG CGACAGTTGA ATAAATATGG CTTTTCCAAG GTTTGTACGA CTGAAGACTA GACCGTGTGC ATATGCTAAC GTCGTGGCAG ATAAAGCACG TCGATGCTGG AACGGGTTCC ATCAAGGAAA ATGTAAATTT TTTTCTTCTT TGCTGATGCT TGATAGGATA CGTGACGCAA TCTAATGCTA ATGTTTCTGA TCGGTAGATT TGGGAGTTTC AACATCCGAA TTTCCAGGCT GGCGGTAAAT CAGATCTTGA GAGTATCAAA GTGAGTCTTC TCCGGCAGGT TGGAAGTATC GCTAACGTAG ACCAGCGCAA ACCCGTGGCG CCCAAGAAGG CGAACAACCA AGAAGGAGAT GAAAATTCTC CTCGTCATAT TGGTCTGAGC AATGAAGATC AGACCCGTAT GCACTTGATG GAAGACAGGA TCAATATGCT AGAGGATGCA CTCGCGAAAT CTAGGTCGGA AGCGGACGAG GCTAGAATGC GTGAGATGGC GATGATGGGT TTGGTAAGAG ATATGATTGG TCATATCGCC GCCAGCGAAA GCGGTAGGTT GTATCTGTTA AGCGATAGCG AATTACGCTA ATAAGTCCGA CAGAAGCGAT AGGATCACCG GCAGCTACGA CCGGCCATTC CCCTCGGGTG CTGCATCTTC TCAAATCCCT AGACAACCTC GCGAGAGCGT ACCCGCCCGA AATGTACCAC AGCAACATTG CTCGACAACC AACGACGATG CCTTACACCC CCGCCCAACC TCAAAGTCAC CAGTCCTTTC TGAATGCTGC ATACAACCCA GCTTTCAGCG GTAACAGTGT CGGCGGCGCG GTTCAAGCAC AAGCATCTCC TCATACAGAT GGTACAGGAA GTCGAGGGAG CTTTAGCGGG CCAAGCACAG GCGAGATCAA GCCTCCTGCA TCTTCTTCCC GCCTTCGAGC GGCTTCTAAC GCCGGCGCCG CTCCGTCTGG TAGCGTAAAC GCTTCTGTGC CTCTGGCGAG CCAGCCCTCT GGCAACCCGC CTGCGGGTCC TGTTGCACGC GCTGAAGATC CTGCCGAAGA TGTCATCGAG ATCCCTAACC AGATGCAAGA CATGTACGGC GGCGAGTCTG TCGGTATCGC TCCCCTTTTT GTAGAGACAC CTGCTTGGTT GACCGAAGGC ACGACGGTTC CTCTCACGAT GTACAGAAAA CAAAGTGATG GACAAGTCTT GAAAGCTGTT TACCAAGCGT TTGCAGCTGG GGGAAAATTA CCTGCTGGCG AAGGGCAAGA CGAGGATGGA AATGCGATCG CGTCAGGATC CAGTACGAGC GCGGCTGAAA TCATGGCAGG CTCAAGCCAG ATGGGTCTTC AGCAATCTTA CGGCGCTGGT TCAAGTACCT CCACTGGTAT CCCTCTTCCC ATGGACGGTT CACCATCTAG CGAGAGCAGT TCCCGCAAGG GCAAGGGCAA GAAGTCTCGC GGAACAAGTC TTAAAGAACA ACGTGAATCC AAGCTCAAGC CGCATTGGGC CCAGACGCCC AGAATTCTTG TCGTTGAGGA CGATATTGTT TACCGAACGC TATCGAAGAA GTTTTTGCAG AAGTTTGGCT GTGAGACGGA GACAGTAGAG AATGCGCAGG GTGCGGTGGA TAAAATGAAC GGGACAAAAT ATGATTTGGT GTTGATGGAC ATTTTCTTTG GGCCTAATAT GGATGGGTGA GTTGCAACCA CTTGGGCAAC GTTCGAGTCG TTGACGTAGA CTTGTAGACG AAAAGCTACC TCCTTGATTC GACAGTTCAA TAACTATACC CCTATTATTT CTATGACTTC CAATGCTCAA CCCCGTGAGT TTAACTTGCT GTATGACGTT GCGTATTCTG ACAATCAAGC AGAGGACGTG GATTCTTACT ATCAATCGGG TATGAACGAC ATCCTCGCAA AACCCTTTAC CAAAAACCAC CTTTTCACTA TCCTCGACAA GCACCTTGTC CACCTTCGAC ATGCTCAACT TTACGAGCGA CTTATCCCCC TCGGTGTCGG TGTCCCTCCT TTATCTGATC AGCATGTCCA GGAGGCCTTG GCTGTCAGCG CCGCGAATTT ACAAAACACC AATGGCCAGT TGCAGCTGGA GAACGGGCCA ACAGCGGAAA CGCAGAATGA GCAGCAAGAT TCAAACGGAG AGCTGGAGCT AGGGATTAGG AACCCATTGG CGGGAAGCGG TTGGAGTGAT GAAGCATATC AACTTGTATT AGCTGTGAGT AAACACTTGT AATCGGGAAC CCGTCCGTGA CAGAAATTGA TACGAAACTT AGCAATTCCT CCAAACTGGG GTGATGCCTG ATATCACGAC CATCTCTGCA GTGTCGTCCT CTCCTAATCC CAATGGCATC GACGGCACGA TCGGTACCAG CATCGTCTTT GGAGAATCTT CTGGTTTTAG CCGTAAACGA TCTATTGAAG CCATCAGTGA TGACAACTGG GACGGTCAAG CTCAAACATC CGCTTCTGTT CAAGCTCAAG CGCAAGCGCA GGCTCAGGCT CAGGCTCAGG CTCAGGCTCA AGCGCAAGCG CAGGCTCAGG CTCAGGCTCA GGCTCAAACT CAAGCCCAGG CTCAGAACAC GATCATGCAA GGTGATCAGC AGCAGGGCAT GTCATTACAG ATGCCAATGA ATGTGGGGAT GGGTATGGAA GTGAATATGG GCAACATGCC GGAGTCTGGT ATGATGAACC AAGATATAAG TAATGGCTCT GATGGCAGAG AGGCCAAGCG AGCTCGAGGT ATGGCGGGAT AGTTACAGCG TTTCCAGGTG GTGTTTGTCC CGATTATAAG CGATGAGTGA AATGGGACAA GTATACTCTT GAGAACATTT GTAGCAATTA TCGTGAAAGA ATCATAGGTT TTTTTGTTTG TGACCTTTTG GTCGCATATA GACGAAGCGG CAAAGCGCGT GTACTTTCTA TTATCTTATA TTTTGATATG TAGCGTAGGC CATGTGTTTA TGCACTATCT CATTCG
|
Protein sequence | MSRVPSGSQQ PQYTQQPLYH PSSVHSQSTP IQQTHQQYHP HTGQPMTFQG DAHAYSYDRQ SQINTWQANP GAAGSMEYHS AVAGSSRGAD PYQPYSQSHR SSPSQSVTAS RTQPASAQQY TPWAPVQTSA TTPSSRGGGV KLEDLMSSDS RAGGKNQPLS GIPLSVSASF DTRSQFSAGP QGASAEKEKD KERDKATQQQ GPSEFIKKLY KMLEEEQAQY GNSRTKKGEK GKRGSVGWGA NGTSFVVWDM NDFTTKILPQ TFRHSNFSSF VRQLNKYGFS KIWEFQHPNF QAGGKSDLES IKRKPVAPKK ANNQEGDENS PRHIGLSNED QTRMHLMEDR INMLEDALAK SRSEADEARM REMAMMGLVR DMIGHIAASE SDNLARAYPP EMYHSNIARQ PTTMPYTPAQ PQSHQSFLNA AYNPAFSGNS VGGAVQAQAS PHTDGTGSRG SFSGPSTGEI KPPASSSRLR AASNAGAAPS GSVNASVPLA SQPSGNPPAG PVARAEDPAE DVIEIPNQMQ DMYGGESVGI APLFVETPAW LTEGTTVPLT MYRKQSDGQV LKAVYQAFAA GGKLPAGEGQ DEDGNAIASG SSTSAAEIMA GSSQMGLQQS YGAGSSTSTG IPLPMDGSPS SESSSRKGKG KKSRGTSLKE QRESKLKPHW AQTPRILVVE DDIVYRTLSK KFLQKFGCET ETVENAQGAV DKMNGTKYDL VLMDIFFGPN MDGRKATSLI RQFNNYTPII SMTSNAQPQD VDSYYQSGMN DILAKPFTKN HLFTILDKHL VHLRHAQLYE RLIPLGVGVP PLSDQHVQEA LAVSAANLQN TNGQLQLENG PTAETQNEQQ DSNGELELGI RNPLAGSGWS DEAYQLVLAQ FLQTGVMPDI TTISAVSSSP NPNGIDGTIG TSIVFGESSG FSRKRSIEAI SDDNWDGQAQ TSASVQAQAQ AQAQAQAQAQ AQAQAQAQAQ AQAQTQAQAQ NTIMQGDQQQ GMSLQMPMNV GMGMEVNMGN MPESGMMNQD ISNGSDGREA KRARGMAG
|
| |