Gene CNG01750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG01750 
Symbol 
ID3258751 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp494725 
End bp498660 
Gene Length3936 bp 
Protein Length1028 aa 
Translation table 
GC content50% 
IMG OID638257792 
Producttranscription factor, putative 
Protein accessionXP_571902 
Protein GI58269492 
COG category[K] Transcription 
COG ID[COG5169] Heat shock transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCATCCTCC ATAACCCGTT CTCGACACTC ATCAACTGCG TCCTCTGACA TCGCCCTGTT 
TTTCGCACTG GGACCAGCCT CGACATGTCC CGTGTGCCTT CAGGCTCTCA GCAGCCACAG
TACACACAGC AGCCGCTCTA TCACCCATCC TCCGTCCACT CTCAATCGAC ACCCATACAA
CAAACTCATC AGCAATATCA TCCTCATACA GGCCAGCCAA TGACCTTCCA AGGCGATGCT
CACGCGTACT CCTACGACAG GCAGTCTCAG ATAAATACAT GGCAAGCAAA TCCAGGTGCG
GCCGGTAGTA TGGAATACCA CAGTGCCGTT GCTGGTTCTT CGCGTGGAGC CGACCCGTAC
CAGCCTTATT CCCAGTCTCA TCGCTCCTCA CCATCCCAAT CTGTGACGGC AAGTCGAACA
CAACCTGCCT CAGCTCAACA ATACACCCCT TGGGCACCTG TGCAGACGAG TGCAACAACG
CCATCCAGCC GAGGAGGTGG GGTTAAGCTG GAAGATTTGA TGAGTAGCGA CTCGCGTGCT
GGTGGGAAAA ATCAGCCTTT ATCGGGTATC CCCCTCAGCG TCTCCGCATC TTTTGATACA
CGATCACAAT TTAGTGCAGG TCCTCAGGGT GCCAGCGCAG AAAAGGAAAA AGACAAGGAA
AGGGACAAGG CGACCCAGCA ACAAGGGCCT TCAGAATTCA TCAAAAAACT ATACAAGATG
CTGGAAGAAG AACAGGCGCA ATACGGAAAT AGTCGGACAA AGAAGGGAGA AAAGGGAAAG
AGAGGTTCAG TGGGTTGGGG AGCAAATGGG ACAAGCTTTG TGGTATGGGA TATGAACGAC
TTCACTACGA AAATCCTGTA AATTTGATCC ACAACCCATT ACTGCATATA TTAATGCTAA
CGCTCTGTAG GCCGCAGACC TTCAGACATT CCAATTTCTC TAGTTTTGTG CGACAGTTGA
ATAAATATGG CTTTTCCAAG GTTTGTACGA CTGAAGACTA GACCGTGTGC ATATGCTAAC
GTCGTGGCAG ATAAAGCACG TCGATGCTGG AACGGGTTCC ATCAAGGAAA ATGTAAATTT
TTTTCTTCTT TGCTGATGCT TGATAGGATA CGTGACGCAA TCTAATGCTA ATGTTTCTGA
TCGGTAGATT TGGGAGTTTC AACATCCGAA TTTCCAGGCT GGCGGTAAAT CAGATCTTGA
GAGTATCAAA GTGAGTCTTC TCCGGCAGGT TGGAAGTATC GCTAACGTAG ACCAGCGCAA
ACCCGTGGCG CCCAAGAAGG CGAACAACCA AGAAGGAGAT GAAAATTCTC CTCGTCATAT
TGGTCTGAGC AATGAAGATC AGACCCGTAT GCACTTGATG GAAGACAGGA TCAATATGCT
AGAGGATGCA CTCGCGAAAT CTAGGTCGGA AGCGGACGAG GCTAGAATGC GTGAGATGGC
GATGATGGGT TTGGTAAGAG ATATGATTGG TCATATCGCC GCCAGCGAAA GCGGTAGGTT
GTATCTGTTA AGCGATAGCG AATTACGCTA ATAAGTCCGA CAGAAGCGAT AGGATCACCG
GCAGCTACGA CCGGCCATTC CCCTCGGGTG CTGCATCTTC TCAAATCCCT AGACAACCTC
GCGAGAGCGT ACCCGCCCGA AATGTACCAC AGCAACATTG CTCGACAACC AACGACGATG
CCTTACACCC CCGCCCAACC TCAAAGTCAC CAGTCCTTTC TGAATGCTGC ATACAACCCA
GCTTTCAGCG GTAACAGTGT CGGCGGCGCG GTTCAAGCAC AAGCATCTCC TCATACAGAT
GGTACAGGAA GTCGAGGGAG CTTTAGCGGG CCAAGCACAG GCGAGATCAA GCCTCCTGCA
TCTTCTTCCC GCCTTCGAGC GGCTTCTAAC GCCGGCGCCG CTCCGTCTGG TAGCGTAAAC
GCTTCTGTGC CTCTGGCGAG CCAGCCCTCT GGCAACCCGC CTGCGGGTCC TGTTGCACGC
GCTGAAGATC CTGCCGAAGA TGTCATCGAG ATCCCTAACC AGATGCAAGA CATGTACGGC
GGCGAGTCTG TCGGTATCGC TCCCCTTTTT GTAGAGACAC CTGCTTGGTT GACCGAAGGC
ACGACGGTTC CTCTCACGAT GTACAGAAAA CAAAGTGATG GACAAGTCTT GAAAGCTGTT
TACCAAGCGT TTGCAGCTGG GGGAAAATTA CCTGCTGGCG AAGGGCAAGA CGAGGATGGA
AATGCGATCG CGTCAGGATC CAGTACGAGC GCGGCTGAAA TCATGGCAGG CTCAAGCCAG
ATGGGTCTTC AGCAATCTTA CGGCGCTGGT TCAAGTACCT CCACTGGTAT CCCTCTTCCC
ATGGACGGTT CACCATCTAG CGAGAGCAGT TCCCGCAAGG GCAAGGGCAA GAAGTCTCGC
GGAACAAGTC TTAAAGAACA ACGTGAATCC AAGCTCAAGC CGCATTGGGC CCAGACGCCC
AGAATTCTTG TCGTTGAGGA CGATATTGTT TACCGAACGC TATCGAAGAA GTTTTTGCAG
AAGTTTGGCT GTGAGACGGA GACAGTAGAG AATGCGCAGG GTGCGGTGGA TAAAATGAAC
GGGACAAAAT ATGATTTGGT GTTGATGGAC ATTTTCTTTG GGCCTAATAT GGATGGGTGA
GTTGCAACCA CTTGGGCAAC GTTCGAGTCG TTGACGTAGA CTTGTAGACG AAAAGCTACC
TCCTTGATTC GACAGTTCAA TAACTATACC CCTATTATTT CTATGACTTC CAATGCTCAA
CCCCGTGAGT TTAACTTGCT GTATGACGTT GCGTATTCTG ACAATCAAGC AGAGGACGTG
GATTCTTACT ATCAATCGGG TATGAACGAC ATCCTCGCAA AACCCTTTAC CAAAAACCAC
CTTTTCACTA TCCTCGACAA GCACCTTGTC CACCTTCGAC ATGCTCAACT TTACGAGCGA
CTTATCCCCC TCGGTGTCGG TGTCCCTCCT TTATCTGATC AGCATGTCCA GGAGGCCTTG
GCTGTCAGCG CCGCGAATTT ACAAAACACC AATGGCCAGT TGCAGCTGGA GAACGGGCCA
ACAGCGGAAA CGCAGAATGA GCAGCAAGAT TCAAACGGAG AGCTGGAGCT AGGGATTAGG
AACCCATTGG CGGGAAGCGG TTGGAGTGAT GAAGCATATC AACTTGTATT AGCTGTGAGT
AAACACTTGT AATCGGGAAC CCGTCCGTGA CAGAAATTGA TACGAAACTT AGCAATTCCT
CCAAACTGGG GTGATGCCTG ATATCACGAC CATCTCTGCA GTGTCGTCCT CTCCTAATCC
CAATGGCATC GACGGCACGA TCGGTACCAG CATCGTCTTT GGAGAATCTT CTGGTTTTAG
CCGTAAACGA TCTATTGAAG CCATCAGTGA TGACAACTGG GACGGTCAAG CTCAAACATC
CGCTTCTGTT CAAGCTCAAG CGCAAGCGCA GGCTCAGGCT CAGGCTCAGG CTCAGGCTCA
AGCGCAAGCG CAGGCTCAGG CTCAGGCTCA GGCTCAAACT CAAGCCCAGG CTCAGAACAC
GATCATGCAA GGTGATCAGC AGCAGGGCAT GTCATTACAG ATGCCAATGA ATGTGGGGAT
GGGTATGGAA GTGAATATGG GCAACATGCC GGAGTCTGGT ATGATGAACC AAGATATAAG
TAATGGCTCT GATGGCAGAG AGGCCAAGCG AGCTCGAGGT ATGGCGGGAT AGTTACAGCG
TTTCCAGGTG GTGTTTGTCC CGATTATAAG CGATGAGTGA AATGGGACAA GTATACTCTT
GAGAACATTT GTAGCAATTA TCGTGAAAGA ATCATAGGTT TTTTTGTTTG TGACCTTTTG
GTCGCATATA GACGAAGCGG CAAAGCGCGT GTACTTTCTA TTATCTTATA TTTTGATATG
TAGCGTAGGC CATGTGTTTA TGCACTATCT CATTCG
 
Protein sequence
MSRVPSGSQQ PQYTQQPLYH PSSVHSQSTP IQQTHQQYHP HTGQPMTFQG DAHAYSYDRQ 
SQINTWQANP GAAGSMEYHS AVAGSSRGAD PYQPYSQSHR SSPSQSVTAS RTQPASAQQY
TPWAPVQTSA TTPSSRGGGV KLEDLMSSDS RAGGKNQPLS GIPLSVSASF DTRSQFSAGP
QGASAEKEKD KERDKATQQQ GPSEFIKKLY KMLEEEQAQY GNSRTKKGEK GKRGSVGWGA
NGTSFVVWDM NDFTTKILPQ TFRHSNFSSF VRQLNKYGFS KIWEFQHPNF QAGGKSDLES
IKRKPVAPKK ANNQEGDENS PRHIGLSNED QTRMHLMEDR INMLEDALAK SRSEADEARM
REMAMMGLVR DMIGHIAASE SDNLARAYPP EMYHSNIARQ PTTMPYTPAQ PQSHQSFLNA
AYNPAFSGNS VGGAVQAQAS PHTDGTGSRG SFSGPSTGEI KPPASSSRLR AASNAGAAPS
GSVNASVPLA SQPSGNPPAG PVARAEDPAE DVIEIPNQMQ DMYGGESVGI APLFVETPAW
LTEGTTVPLT MYRKQSDGQV LKAVYQAFAA GGKLPAGEGQ DEDGNAIASG SSTSAAEIMA
GSSQMGLQQS YGAGSSTSTG IPLPMDGSPS SESSSRKGKG KKSRGTSLKE QRESKLKPHW
AQTPRILVVE DDIVYRTLSK KFLQKFGCET ETVENAQGAV DKMNGTKYDL VLMDIFFGPN
MDGRKATSLI RQFNNYTPII SMTSNAQPQD VDSYYQSGMN DILAKPFTKN HLFTILDKHL
VHLRHAQLYE RLIPLGVGVP PLSDQHVQEA LAVSAANLQN TNGQLQLENG PTAETQNEQQ
DSNGELELGI RNPLAGSGWS DEAYQLVLAQ FLQTGVMPDI TTISAVSSSP NPNGIDGTIG
TSIVFGESSG FSRKRSIEAI SDDNWDGQAQ TSASVQAQAQ AQAQAQAQAQ AQAQAQAQAQ
AQAQTQAQAQ NTIMQGDQQQ GMSLQMPMNV GMGMEVNMGN MPESGMMNQD ISNGSDGREA
KRARGMAG