Gene PHATR_44099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44099 
SymbolHSF1 
ID7203865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp960374 
End bp961889 
Gene Length1516 bp 
Protein Length428 aa 
Translation table 
GC content52% 
IMG OID 
Productheat shock factor, DNA-binding 
Protein accessionXP_002186160 
Protein GI219113153 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.52958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATAAACAGTC GTCCCGACAG GTTTGTTGTA GTCTCTTCGT TGCTCTTCTA CGATGGATTA 
TTTGAGTATT GAAGAACACG CGGCCGCCTC CGCCGTCGCA GCGCTTGGTG GCAACCAGGA
GAACAACAAA AGGTCCCGGG ACGACTCTCC CGATGCTGCC GGTGGTTGCC ACAAGCTTGC
CCGAGTTGAA GACGAGCAGC TCGTTGCTGC CGGAGGTATG GGTCCTCCTC TTCCTCCTGC
TGAAAATGTG GCACAAGGCG CCATGCTCCA GCCTTACCCG ATGTTCTACT ACCGAGACTT
TTCTACTGAA TCGGATCCGG ACGCCTTGAC CCCCTTGACT CCTCCGGGGC GTGTTCCGAA
CTTCCCTGCG AAGATGCATT CAATTCTCAG CCGTCCTGAT CTTGCGGATG TGATCTGCTG
GATGCCCCAT GGCCGATCTT GGCGCGTTTT GAAGCCTCGG GAGTTTGAGA TTCGTGTAAT
TCCCACTTAT TTTGAACACG CCAAGTTCTC ATCCTTCATT CGTCAGGCGA ATGGATGGGG
ATTCCGTCGA ATCACCCAAG GTCGTGATCG CAACTCTTAC TACCACGAGT TGTTCCTCCG
TGGACTTCCC CACCTCTGCA AGCAGATGAA GCGCCCAGGT GTTGCGCAAA AGCAAGCTGC
CGATCCTGAG CACGAGCCTG ATCTTTACAA AGTCTCCGAG ATGTACGCGG TTCCCGAAAA
AGCTGAGGAT GACTCGATTC TTCTCCAGTG CACGCTTCAG GGAGGCCCGA AGGCCCGCAT
GCCCATCTAC TCGGGCGCCC TGAACAACTC CTCTCTCAAG GATTTCAAGA TGCCTGGTGT
TGAAACTGCG TCGTTGACTC CTCGCGACCA GCAAGCACTT AGCGCGTTTC AACAGTCCCT
CGGTGCATCC GAGAGTCAGT TCAAGTCTAT GAGCTTTTCC ACCACAACTC CTCAAGCCAC
CCCGCTTGTA CTTCCGCAGC AGTCTTACAT GACGCCGAAT GCTGCCCCCG TCAATATTCG
ACCAAACGTG GCCACCACAG AAGGAACAAA TAACAATATG TCAGCTCTCA TGGCAGCTAA
TCAATTGGCT TTTTCTCAAC CCAACATGGC CGCGGCCTTC CAAGCAAGTT CCGCTGCATC
GCAGTTTGCA GCAGGATTCG CTGCGGCAAC CGCTTTGAGT CATCAACAGT TCCAAACAAT
GCTAGGACAG TTTGGAGTAG CTGCCCAACC TACCCAAGTT TCGGTTCAAC AACCGATGAG
CATCCAGCAA CAACCCGGGG AGCAACAGCC GCCCATTCAG GTGCAGCAAC AGCAATCAAT
GATTGCGAAC ACGAACTAGG AATACGTTTT CCGTTTACTT TACGTTACGA CGACTAGAAC
GCGGCCACGA GCTCTCCTCA AGTCCTTTTC ATCTTTCCCT TGACACACTC TTTTGCACAA
CTAACTGGCA TTGCATGATT TTAGCTTCAC ATTAATTTCT CTTAACTTTA CGGTTAGTTT
GAGTTTCTGA TGAAAA
 
Protein sequence
MDYLSIEEHA AASAVAALGG NQENNKRSRD DSPDAAGGCH KLARVEDEQL VAAGGMGPPL 
PPAENVAQGA MLQPYPMFYY RDFSTESDPD ALTPLTPPGR VPNFPAKMHS ILSRPDLADV
ICWMPHGRSW RVLKPREFEI RVIPTYFEHA KFSSFIRQAN GWGFRRITQG RDRNSYYHEL
FLRGLPHLCK QMKRPGVAQK QAADPEHEPD LYKVSEMYAV PEKAEDDSIL LQCTLQGGPK
ARMPIYSGAL NNSSLKDFKM PGVETASLTP RDQQALSAFQ QSLGASESQF KSMSFSTTTP
QATPLVLPQQ SYMTPNAAPV NIRPNVATTE GTNNNMSALM AANQLAFSQP NMAAAFQASS
AASQFAAGFA AATALSHQQF QTMLGQFGVA AQPTQVSVQQ PMSIQQQPGE QQPPIQVQQQ
QSMIANTN