Gene CNB00120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB00120 
Symbol 
ID3255945 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp34177 
End bp36659 
Gene Length2483 bp 
Protein Length783 aa 
Translation table 
GC content51% 
IMG OID638254665 
Productheat shock transcription factor 2, putative 
Protein accessionXP_568758 
Protein GI58262696 
COG category[K] Transcription 
COG ID[COG5169] Heat shock transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000900604 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAA ATCTATACGC TATAGCAGGC CCCTCAAAAC CCACAACTCC GACATCGACC 
CCTTCTCCAC GCTCCGAGCC GCCTTCACCG CTCAAATCAC TCACATCACT CCCGACAAAC
CCGCTCAACT CGCATGGCAC GTCTACCCCC AACACACTCA CAAATCAGCT GTCAAGCACA
GGAATAGGAA TATCCAAACC GGGCCTAAGT GTGGATGAGA ATGGAGAAGT CATGAAGGTG
CCCGCATTCT TGAACAAGCT GTATACGATG GTCAGTGATC CGGAGGTGGA CGACTTGATT
TACTGGGGAG AGAGTGGGGA TTCATTCTTT GGTACGTCGA TTTTGTTGAC ATTATACCAT
CTTGTCCTTT AGATATATTT TACTAAGCTT ACAAAATTAT AGTACCGAAT GCAGAGCTAT
TCGGGAGAGA ACTCTTACCG AGATGGTTTA AACATTCCAA CTTCTCAAGT TTTGTCCGTC
AACTCAACAT GTATGGGTTT CGTACGTTCA TTCATGTCTT TCCCTATCTT TATACCAACT
GACTTATCTT CCGATATCCA GACAAAGTCC CTCACCTTCA GTCTGGTGCC CTGAAGAATG
AAACGCCCAT CGAATTATGG GAGTTCGCAA ACCCTTATTT CAAACGCGGC CAACCCCAAC
TTCTCACCAA AGTAACTCGC AAAAACAACC GACCTTCAAA CTCTGGTGTT GGACCTTCAT
CTTCCGTTGG AGGTAGCGGA GCTGGTGGAG GAATGAGCAC CCGCTCTGCA TCTGCTGCTG
CTGCCTCTGG CTCTGCTTCC GGACAAATCC AGCAAGCCAT CAGTCAAGGC CATGAAGCTG
GTAACCATTC CACTTCAGGA AAATACCTTA TCACAGACGG TACCACCCCT GGCTCTGTCC
CTCCTTCCCA CACCTCCGCC GGTCCACTCA TCGCCCCTCA AACCCTCGAT CTTTCGGCAA
TCAATTCTGG TATCGCCGCC ATACGCCAAA CCCAAGCTTC CATCGCTACC GATCTCCGCA
AACTTCAGGC ATCTAACGAA GCGCTTTGGA GGCAGGCGTA TGAAACGCAG GAAAAGCAGA
GGAAACATGA AGAGACGATA GATTTGATTG TAAGCTTTTT GGAGAGGTTG TTTGGGACGG
AAGGGGAGGG ATTGAAGGGA TTGAAGGAGG CGATGAGAAG GGGGGTTGGA GTAAGAAGGG
ATAGGGATGG GAGAGAAGGT AGGGATTCAA GAGATTCGAG ATTTGCGGAG GACGACGATG
GGGGACAGAA GAAAAGAAGG AGGGTAGGAA TCGATAGGAT GATTGAGGGT GGCACCGGTG
ATGGAACAGG CGAACATGGT GAGATTGAAA GCCCAACATC AGACGATCGC CTCGTCGAGA
TCGGATCCAA CTCGGAATAT TCCATCCCAT CCGTCAAACG TACCTCCTCT TCCTCCCACC
CAATTTCCCT CGGTCAACTG GGTTCCTCCC GATTTACTGC GCTGCCTTCC GAAGATCCTT
CTCCTTCAGC TTCTGGACCT GGATCAACAT CTTACGAAGG TCTTCACACC ACACAAACTA
ATGCCCGTGG AGCTGGGGCT GACGTCAACG TGACCGACCC GACTTTAGGC ATGAACCACC
TCTCGCCTCT ATCCGATACC GATCCCCTCC TCCCGTCATC ATCCAACGCC CTCGCCCCAT
ACTCCTCTCA CCTCCCCTTC CCTTCTTCCA ACTCTAACCA ATCTAACTCA TTTAACCCAT
CTAACCCATC TTCCGCATGG GCCTCCAACC CTTCCCAACC CTTACTCTCA CCAACATCCG
CCGCAGCCGC CGCACACGCA TATAACCTCG ATCCTTCTCT GCTCCAAACC ACGATCGGGA
GTCTACTCCA AAGTCCTGCA GCGGCGCAAA TGTTTTTGAA TTCGTTAAGC GCCAGTGCAC
AAGGTCAGGC TTTGGCTTCG CACTCTCATC CCCATAATCC ATCTCCGCTG AACCCGAACC
CGAACGGCAA TGCCTCCACC TCGGCCTCTG CTTCTGCTCA TGGCATGAAT ACCGGAGGTA
TGGGAACAGG ATCAGGAACC AAAGACGTCG ACCCAACTCT CGCCCTTTTT TCCCCACTCC
CCTCCCATTC GTCGCTCACT TCCCAATCCA ACGACCTCTT GAAATCCTAC AGTGACGCCC
TCACAGTCGG AGAAGGCGTG GACAATTTAC AAGAGAGTAT CGATAGTCTG GTGAGGAGTA
TGGGGTTGGA TTTGCCTAAT GGTGGATCTT CTGTGGGTGT CGATGTCGGT GACGGGGCTG
GAGTTGGAAC AGAGACAGGG GAAGGGGATG GAGAGTTTAA TGTGGATGAA TTCTTGCAGG
GCTTGGCGAA GGAAGGGGAA GAAGAAGGAG AAAGGGAAGT AGAAGGGGAT GGGGGTGTGT
CAAGCTCAGG CGCAGGCGCA GGCGCAGAAA ATGGAAGGAA GGAAGATGTA ATTGCCCAAA
GTGGCCTCAA GTCGGAAAGT TAA
 
Protein sequence
MTTNLYAIAG PSKPTTPTST PSPRSEPPSP LKSLTSLPTN PLNSHGTSTP NTLTNQLSST 
GIGISKPGLS VDENGEVMKV PAFLNKLYTM VSDPEVDDLI YWGESGDSFF VPNAELFGRE
LLPRWFKHSN FSSFVRQLNM YGFHKVPHLQ SGALKNETPI ELWEFANPYF KRGQPQLLTK
VTRKNNRPSN SGVGPSSSVG GSGAGGGMST RSASAAAASG SASGQIQQAI SQGHEAGNHS
TSGKYLITDG TTPGSVPPSH TSAGPLIAPQ TLDLSAINSG IAAIRQTQAS IATDLRKLQA
SNEALWRQAY ETQEKQRKHE ETIDLIVSFL ERLFGTEGEG LKGLKEAMRR GVGVRRDRDG
REGRDSRDSR FAEDDDGGQK KRRRVGIDRM IEGGTGDGTG EHGEIESPTS DDRLVEIGSN
SEYSIPSVKR TSSSSHPISL GQLGSSRFTA LPSEDPSPSA SGPGSTSYEG LHTTQTNARG
AGADVNVTDP TLGMNHLSPL SDTDPLLPSS SNALAPYSSH LPFPSSNSNQ SNSFNPSNPS
SAWASNPSQP LLSPTSAAAA AHAYNLDPSL LQTTIGSLLQ SPAAAQMFLN SLSASAQGQA
LASHSHPHNP SPLNPNPNGN ASTSASASAH GMNTGGMGTG SGTKDVDPTL ALFSPLPSHS
SLTSQSNDLL KSYSDALTVG EGVDNLQESI DSLVRSMGLD LPNGGSSVGV DVGDGAGVGT
ETGEGDGEFN VDEFLQGLAK EGEEEGEREV EGDGGVSSSG AGAGAENGRK EDVIAQSGLK
SES