Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44099 |
Symbol | HSF1 |
ID | 7203865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 960374 |
End bp | 961889 |
Gene Length | 1516 bp |
Protein Length | 428 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | heat shock factor, DNA-binding |
Protein accession | XP_002186160 |
Protein GI | 219113153 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.52958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATAAACAGTC GTCCCGACAG GTTTGTTGTA GTCTCTTCGT TGCTCTTCTA CGATGGATTA TTTGAGTATT GAAGAACACG CGGCCGCCTC CGCCGTCGCA GCGCTTGGTG GCAACCAGGA GAACAACAAA AGGTCCCGGG ACGACTCTCC CGATGCTGCC GGTGGTTGCC ACAAGCTTGC CCGAGTTGAA GACGAGCAGC TCGTTGCTGC CGGAGGTATG GGTCCTCCTC TTCCTCCTGC TGAAAATGTG GCACAAGGCG CCATGCTCCA GCCTTACCCG ATGTTCTACT ACCGAGACTT TTCTACTGAA TCGGATCCGG ACGCCTTGAC CCCCTTGACT CCTCCGGGGC GTGTTCCGAA CTTCCCTGCG AAGATGCATT CAATTCTCAG CCGTCCTGAT CTTGCGGATG TGATCTGCTG GATGCCCCAT GGCCGATCTT GGCGCGTTTT GAAGCCTCGG GAGTTTGAGA TTCGTGTAAT TCCCACTTAT TTTGAACACG CCAAGTTCTC ATCCTTCATT CGTCAGGCGA ATGGATGGGG ATTCCGTCGA ATCACCCAAG GTCGTGATCG CAACTCTTAC TACCACGAGT TGTTCCTCCG TGGACTTCCC CACCTCTGCA AGCAGATGAA GCGCCCAGGT GTTGCGCAAA AGCAAGCTGC CGATCCTGAG CACGAGCCTG ATCTTTACAA AGTCTCCGAG ATGTACGCGG TTCCCGAAAA AGCTGAGGAT GACTCGATTC TTCTCCAGTG CACGCTTCAG GGAGGCCCGA AGGCCCGCAT GCCCATCTAC TCGGGCGCCC TGAACAACTC CTCTCTCAAG GATTTCAAGA TGCCTGGTGT TGAAACTGCG TCGTTGACTC CTCGCGACCA GCAAGCACTT AGCGCGTTTC AACAGTCCCT CGGTGCATCC GAGAGTCAGT TCAAGTCTAT GAGCTTTTCC ACCACAACTC CTCAAGCCAC CCCGCTTGTA CTTCCGCAGC AGTCTTACAT GACGCCGAAT GCTGCCCCCG TCAATATTCG ACCAAACGTG GCCACCACAG AAGGAACAAA TAACAATATG TCAGCTCTCA TGGCAGCTAA TCAATTGGCT TTTTCTCAAC CCAACATGGC CGCGGCCTTC CAAGCAAGTT CCGCTGCATC GCAGTTTGCA GCAGGATTCG CTGCGGCAAC CGCTTTGAGT CATCAACAGT TCCAAACAAT GCTAGGACAG TTTGGAGTAG CTGCCCAACC TACCCAAGTT TCGGTTCAAC AACCGATGAG CATCCAGCAA CAACCCGGGG AGCAACAGCC GCCCATTCAG GTGCAGCAAC AGCAATCAAT GATTGCGAAC ACGAACTAGG AATACGTTTT CCGTTTACTT TACGTTACGA CGACTAGAAC GCGGCCACGA GCTCTCCTCA AGTCCTTTTC ATCTTTCCCT TGACACACTC TTTTGCACAA CTAACTGGCA TTGCATGATT TTAGCTTCAC ATTAATTTCT CTTAACTTTA CGGTTAGTTT GAGTTTCTGA TGAAAA
|
Protein sequence | MDYLSIEEHA AASAVAALGG NQENNKRSRD DSPDAAGGCH KLARVEDEQL VAAGGMGPPL PPAENVAQGA MLQPYPMFYY RDFSTESDPD ALTPLTPPGR VPNFPAKMHS ILSRPDLADV ICWMPHGRSW RVLKPREFEI RVIPTYFEHA KFSSFIRQAN GWGFRRITQG RDRNSYYHEL FLRGLPHLCK QMKRPGVAQK QAADPEHEPD LYKVSEMYAV PEKAEDDSIL LQCTLQGGPK ARMPIYSGAL NNSSLKDFKM PGVETASLTP RDQQALSAFQ QSLGASESQF KSMSFSTTTP QATPLVLPQQ SYMTPNAAPV NIRPNVATTE GTNNNMSALM AANQLAFSQP NMAAAFQASS AASQFAAGFA AATALSHQQF QTMLGQFGVA AQPTQVSVQQ PMSIQQQPGE QQPPIQVQQQ QSMIANTN
|
| |