Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00120 |
Symbol | |
ID | 3255945 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 34177 |
End bp | 36659 |
Gene Length | 2483 bp |
Protein Length | 783 aa |
Translation table | |
GC content | 51% |
IMG OID | 638254665 |
Product | heat shock transcription factor 2, putative |
Protein accession | XP_568758 |
Protein GI | 58262696 |
COG category | [K] Transcription |
COG ID | [COG5169] Heat shock transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000900604 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACAA ATCTATACGC TATAGCAGGC CCCTCAAAAC CCACAACTCC GACATCGACC CCTTCTCCAC GCTCCGAGCC GCCTTCACCG CTCAAATCAC TCACATCACT CCCGACAAAC CCGCTCAACT CGCATGGCAC GTCTACCCCC AACACACTCA CAAATCAGCT GTCAAGCACA GGAATAGGAA TATCCAAACC GGGCCTAAGT GTGGATGAGA ATGGAGAAGT CATGAAGGTG CCCGCATTCT TGAACAAGCT GTATACGATG GTCAGTGATC CGGAGGTGGA CGACTTGATT TACTGGGGAG AGAGTGGGGA TTCATTCTTT GGTACGTCGA TTTTGTTGAC ATTATACCAT CTTGTCCTTT AGATATATTT TACTAAGCTT ACAAAATTAT AGTACCGAAT GCAGAGCTAT TCGGGAGAGA ACTCTTACCG AGATGGTTTA AACATTCCAA CTTCTCAAGT TTTGTCCGTC AACTCAACAT GTATGGGTTT CGTACGTTCA TTCATGTCTT TCCCTATCTT TATACCAACT GACTTATCTT CCGATATCCA GACAAAGTCC CTCACCTTCA GTCTGGTGCC CTGAAGAATG AAACGCCCAT CGAATTATGG GAGTTCGCAA ACCCTTATTT CAAACGCGGC CAACCCCAAC TTCTCACCAA AGTAACTCGC AAAAACAACC GACCTTCAAA CTCTGGTGTT GGACCTTCAT CTTCCGTTGG AGGTAGCGGA GCTGGTGGAG GAATGAGCAC CCGCTCTGCA TCTGCTGCTG CTGCCTCTGG CTCTGCTTCC GGACAAATCC AGCAAGCCAT CAGTCAAGGC CATGAAGCTG GTAACCATTC CACTTCAGGA AAATACCTTA TCACAGACGG TACCACCCCT GGCTCTGTCC CTCCTTCCCA CACCTCCGCC GGTCCACTCA TCGCCCCTCA AACCCTCGAT CTTTCGGCAA TCAATTCTGG TATCGCCGCC ATACGCCAAA CCCAAGCTTC CATCGCTACC GATCTCCGCA AACTTCAGGC ATCTAACGAA GCGCTTTGGA GGCAGGCGTA TGAAACGCAG GAAAAGCAGA GGAAACATGA AGAGACGATA GATTTGATTG TAAGCTTTTT GGAGAGGTTG TTTGGGACGG AAGGGGAGGG ATTGAAGGGA TTGAAGGAGG CGATGAGAAG GGGGGTTGGA GTAAGAAGGG ATAGGGATGG GAGAGAAGGT AGGGATTCAA GAGATTCGAG ATTTGCGGAG GACGACGATG GGGGACAGAA GAAAAGAAGG AGGGTAGGAA TCGATAGGAT GATTGAGGGT GGCACCGGTG ATGGAACAGG CGAACATGGT GAGATTGAAA GCCCAACATC AGACGATCGC CTCGTCGAGA TCGGATCCAA CTCGGAATAT TCCATCCCAT CCGTCAAACG TACCTCCTCT TCCTCCCACC CAATTTCCCT CGGTCAACTG GGTTCCTCCC GATTTACTGC GCTGCCTTCC GAAGATCCTT CTCCTTCAGC TTCTGGACCT GGATCAACAT CTTACGAAGG TCTTCACACC ACACAAACTA ATGCCCGTGG AGCTGGGGCT GACGTCAACG TGACCGACCC GACTTTAGGC ATGAACCACC TCTCGCCTCT ATCCGATACC GATCCCCTCC TCCCGTCATC ATCCAACGCC CTCGCCCCAT ACTCCTCTCA CCTCCCCTTC CCTTCTTCCA ACTCTAACCA ATCTAACTCA TTTAACCCAT CTAACCCATC TTCCGCATGG GCCTCCAACC CTTCCCAACC CTTACTCTCA CCAACATCCG CCGCAGCCGC CGCACACGCA TATAACCTCG ATCCTTCTCT GCTCCAAACC ACGATCGGGA GTCTACTCCA AAGTCCTGCA GCGGCGCAAA TGTTTTTGAA TTCGTTAAGC GCCAGTGCAC AAGGTCAGGC TTTGGCTTCG CACTCTCATC CCCATAATCC ATCTCCGCTG AACCCGAACC CGAACGGCAA TGCCTCCACC TCGGCCTCTG CTTCTGCTCA TGGCATGAAT ACCGGAGGTA TGGGAACAGG ATCAGGAACC AAAGACGTCG ACCCAACTCT CGCCCTTTTT TCCCCACTCC CCTCCCATTC GTCGCTCACT TCCCAATCCA ACGACCTCTT GAAATCCTAC AGTGACGCCC TCACAGTCGG AGAAGGCGTG GACAATTTAC AAGAGAGTAT CGATAGTCTG GTGAGGAGTA TGGGGTTGGA TTTGCCTAAT GGTGGATCTT CTGTGGGTGT CGATGTCGGT GACGGGGCTG GAGTTGGAAC AGAGACAGGG GAAGGGGATG GAGAGTTTAA TGTGGATGAA TTCTTGCAGG GCTTGGCGAA GGAAGGGGAA GAAGAAGGAG AAAGGGAAGT AGAAGGGGAT GGGGGTGTGT CAAGCTCAGG CGCAGGCGCA GGCGCAGAAA ATGGAAGGAA GGAAGATGTA ATTGCCCAAA GTGGCCTCAA GTCGGAAAGT TAA
|
Protein sequence | MTTNLYAIAG PSKPTTPTST PSPRSEPPSP LKSLTSLPTN PLNSHGTSTP NTLTNQLSST GIGISKPGLS VDENGEVMKV PAFLNKLYTM VSDPEVDDLI YWGESGDSFF VPNAELFGRE LLPRWFKHSN FSSFVRQLNM YGFHKVPHLQ SGALKNETPI ELWEFANPYF KRGQPQLLTK VTRKNNRPSN SGVGPSSSVG GSGAGGGMST RSASAAAASG SASGQIQQAI SQGHEAGNHS TSGKYLITDG TTPGSVPPSH TSAGPLIAPQ TLDLSAINSG IAAIRQTQAS IATDLRKLQA SNEALWRQAY ETQEKQRKHE ETIDLIVSFL ERLFGTEGEG LKGLKEAMRR GVGVRRDRDG REGRDSRDSR FAEDDDGGQK KRRRVGIDRM IEGGTGDGTG EHGEIESPTS DDRLVEIGSN SEYSIPSVKR TSSSSHPISL GQLGSSRFTA LPSEDPSPSA SGPGSTSYEG LHTTQTNARG AGADVNVTDP TLGMNHLSPL SDTDPLLPSS SNALAPYSSH LPFPSSNSNQ SNSFNPSNPS SAWASNPSQP LLSPTSAAAA AHAYNLDPSL LQTTIGSLLQ SPAAAQMFLN SLSASAQGQA LASHSHPHNP SPLNPNPNGN ASTSASASAH GMNTGGMGTG SGTKDVDPTL ALFSPLPSHS SLTSQSNDLL KSYSDALTVG EGVDNLQESI DSLVRSMGLD LPNGGSSVGV DVGDGAGVGT ETGEGDGEFN VDEFLQGLAK EGEEEGEREV EGDGGVSSSG AGAGAENGRK EDVIAQSGLK SES
|
| |