Gene PHATR_44200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44200 
SymbolHSF3 
ID7204113 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1290821 
End bp1292970 
Gene Length2150 bp 
Protein Length457 aa 
Translation table 
GC content48% 
IMG OID 
Productheat shock factor, DNA-binding 
Protein accessionXP_002186220 
Protein GI219113273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.106256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGCAACTCC CCGTATTCCT ACGACCAACC TCGTTTGTTG TTGAAACCCC ACGGACCAAG 
TGTCTAGCAG CAATCCGCAA ATCACTCATA CTATTATAGA CATCTGACAA GCCACCATGG
CGAAGACCAA TCCACAAAGC GTAGCGACTT CGATCGGCGA ACCTTCACCG GACGTTCCCA
TCTTTCTACG GAGTGAGTAA AGCACTTTCC AGTGCGATAG ATTTATCGCC GTAGGATTTC
TACGCGTTTC GTCGTAGACG ATCCGAAAAT GTCCGTTTGC AATGGAGCGA CCCGCGGAAA
TGACGCTTGA GCAACGATGG AATGCAGGAA TGGACGAAGG ATACACGCAT TTTCTCTGTC
TCATCTCACA CTTGCCTTCC TCCCATTCTT GATTTTACAG AAACCTACTA CATGATCGAT
CAATGCGACG ATGAAATCGC CTGTTGGTCC GAGGACGGCA CCACGTTTGT GGTGAAAGAC
CCAGATCGTT TCGAACGGAC AATCATCCCG CAATACTTCA AGCATTCCAA GTTTAGCAGT
TTCGTAAGGG TACGTTGGTG ATTGCTGGCT TGCGGACGGA TTTCTAGGCT GTTCAATGCG
CTCGTCAGGG AAATTTGGCC CCCTATGGGA ATCCCGGGAT TGGCACTGCC GAACGCGTTT
GATTTCTCTG TGTTCTCACG CCTTTGTTCC TTTCCCCTTT TTTTTGGCAG CAACTTAACT
TTTATTCCTT TCGCAAGATC AAGTACGCTG ACACTATTCG CATTGATCCC AAGCTGGAGG
CCGAAACGGC TAATTATTGG CGATTTCGAC ACGAAAATTT CCAGAAGGGC AAACCGGAAC
TTTTGACTGA GATTAAACGC ATGAACGGAC AGAAAGCCCC TACGTCTCCC TCGACTTCTA
GTTCCAGCAG CGTTTCCACT ACGCAAGGGA ACAAAGTAGG TGTGACAGCA TCTGCCAAAG
TAACGCCCGA TCCTGACGGT GCCAGCAAAG CTAGCAAGTC AGAAGTGCAG TCGCTTCAGA
AACGCATCGA AGAAATGACC AAAAACATTG ATCAGCTTAC GGCTATGGTT CAGAAGGTTT
CCCTAAAGCA GGAAGAGGAG GATCAAGTAG GATCCAAGCG TAAAAAGACG GAATTATTGA
TCAAGACGGA AGACTTTCAA ACTGCTTTTA ACGGAGACCT TATGATGGAT GTCAATGGAG
AATCGGATCA ATTGGTCCGT CCGGACGACA TGTTTAGTAA CATGGAGCTC GACGAGATTA
TAACAGCGAC AGGAAACAAA GATCTATCCT CATCTGTGGA AGCGGAGGCA GCCGCTTTGT
CCATAACACC ACCGTCCCCC GTTAAGTCAA CCGTACCCAT GATCCGCGAG ACTTCCGTCA
ACACTCAGGT CAGCGACAAT GAGTTTGTGG ATCAGCTTTT CACCGCTTTC AACGAAGACT
CAGATGATCT TTTGCAGATG GAACCACCTT CCTTTGGTCT CGACTCCGCC AATCGACCAG
ATGCCGAACT CATGAAGAGA TTAAGTGACG CACTCATGTT TTTGCCTCGG GATATTCAAG
TAATGATTGT TGAGCGATTG ATCGCCGCCA TTACTTCCAC CGAATCCCTC AAGCCCCTTT
CCAAGGAAAA CACATCGAAG TCTTTGCAAG TGCAATCTGC TATGACTCCT TCGACATCCA
TTCCGCAGAC TTTGGAAGAG GAGAAACAGG ACGTGCCCAT GCCCTTGGCT GCCGCCACTT
TGGCCGCATT GTTGCACCAC TATAGTAGCC AAATTCAGGC CCAGCAACAG GGTAACAAGA
GCAAAAAACC CCAGAACGTT CAAAAGTCTA TTCCGGTCAT TCCAGTACAT GCTTAAGTAG
CACGAATACT GTGACACTTG TTTGGGGTTA TCGCACGGAA CACTGTGTGT GGGGCCTACT
ACATATAGGC ACGCGCGTCA AGGGTACAAT TGGGAAATTA TGTATAGACC ACTTGGCGCG
TCACCGATCT TGGAGTCGAG AGGGACGGTT TTATGCCTTA CACGGAGATG CTCGATCCTC
TTCTTAGATA TCTTTCTCCT TTAGCCTTTC TCTAAGGTCA GCTTTTTACT TATATAACGA
TTATATAGTA CACGCTATAA CGAATATAAG CTTTACAATC TATTTCTTTT
 
Protein sequence
MQEWTKDTRI FSVSSHTCLP PILDFTETYY MIDQCDDEIA CWSEDGTTFV VKDPDRFERT 
IIPQYFKHSK FSSFVRQLNF YSFRKIKYAD TIRIDPKLEA ETANYWRFRH ENFQKGKPEL
LTEIKRMNGQ KAPTSPSTSS SSSVSTTQGN KVGVTASAKV TPDPDGASKA SKSEVQSLQK
RIEEMTKNID QLTAMVQKVS LKQEEEDQVG SKRKKTELLI KTEDFQTAFN GDLMMDVNGE
SDQLVRPDDM FSNMELDEII TATGNKDLSS SVEAEAAALS ITPPSPVKST VPMIRETSVN
TQVSDNEFVD QLFTAFNEDS DDLLQMEPPS FGLDSANRPD AELMKRLSDA LMFLPRDIQV
MIVERLIAAI TSTESLKPLS KENTSKSLQV QSAMTPSTSI PQTLEEEKQD VPMPLAAATL
AALLHHYSSQ IQAQQQGNKS KKPQNVQKSI PVIPVHA