Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44200 |
Symbol | HSF3 |
ID | 7204113 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1290821 |
End bp | 1292970 |
Gene Length | 2150 bp |
Protein Length | 457 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | heat shock factor, DNA-binding |
Protein accession | XP_002186220 |
Protein GI | 219113273 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.106256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCAACTCC CCGTATTCCT ACGACCAACC TCGTTTGTTG TTGAAACCCC ACGGACCAAG TGTCTAGCAG CAATCCGCAA ATCACTCATA CTATTATAGA CATCTGACAA GCCACCATGG CGAAGACCAA TCCACAAAGC GTAGCGACTT CGATCGGCGA ACCTTCACCG GACGTTCCCA TCTTTCTACG GAGTGAGTAA AGCACTTTCC AGTGCGATAG ATTTATCGCC GTAGGATTTC TACGCGTTTC GTCGTAGACG ATCCGAAAAT GTCCGTTTGC AATGGAGCGA CCCGCGGAAA TGACGCTTGA GCAACGATGG AATGCAGGAA TGGACGAAGG ATACACGCAT TTTCTCTGTC TCATCTCACA CTTGCCTTCC TCCCATTCTT GATTTTACAG AAACCTACTA CATGATCGAT CAATGCGACG ATGAAATCGC CTGTTGGTCC GAGGACGGCA CCACGTTTGT GGTGAAAGAC CCAGATCGTT TCGAACGGAC AATCATCCCG CAATACTTCA AGCATTCCAA GTTTAGCAGT TTCGTAAGGG TACGTTGGTG ATTGCTGGCT TGCGGACGGA TTTCTAGGCT GTTCAATGCG CTCGTCAGGG AAATTTGGCC CCCTATGGGA ATCCCGGGAT TGGCACTGCC GAACGCGTTT GATTTCTCTG TGTTCTCACG CCTTTGTTCC TTTCCCCTTT TTTTTGGCAG CAACTTAACT TTTATTCCTT TCGCAAGATC AAGTACGCTG ACACTATTCG CATTGATCCC AAGCTGGAGG CCGAAACGGC TAATTATTGG CGATTTCGAC ACGAAAATTT CCAGAAGGGC AAACCGGAAC TTTTGACTGA GATTAAACGC ATGAACGGAC AGAAAGCCCC TACGTCTCCC TCGACTTCTA GTTCCAGCAG CGTTTCCACT ACGCAAGGGA ACAAAGTAGG TGTGACAGCA TCTGCCAAAG TAACGCCCGA TCCTGACGGT GCCAGCAAAG CTAGCAAGTC AGAAGTGCAG TCGCTTCAGA AACGCATCGA AGAAATGACC AAAAACATTG ATCAGCTTAC GGCTATGGTT CAGAAGGTTT CCCTAAAGCA GGAAGAGGAG GATCAAGTAG GATCCAAGCG TAAAAAGACG GAATTATTGA TCAAGACGGA AGACTTTCAA ACTGCTTTTA ACGGAGACCT TATGATGGAT GTCAATGGAG AATCGGATCA ATTGGTCCGT CCGGACGACA TGTTTAGTAA CATGGAGCTC GACGAGATTA TAACAGCGAC AGGAAACAAA GATCTATCCT CATCTGTGGA AGCGGAGGCA GCCGCTTTGT CCATAACACC ACCGTCCCCC GTTAAGTCAA CCGTACCCAT GATCCGCGAG ACTTCCGTCA ACACTCAGGT CAGCGACAAT GAGTTTGTGG ATCAGCTTTT CACCGCTTTC AACGAAGACT CAGATGATCT TTTGCAGATG GAACCACCTT CCTTTGGTCT CGACTCCGCC AATCGACCAG ATGCCGAACT CATGAAGAGA TTAAGTGACG CACTCATGTT TTTGCCTCGG GATATTCAAG TAATGATTGT TGAGCGATTG ATCGCCGCCA TTACTTCCAC CGAATCCCTC AAGCCCCTTT CCAAGGAAAA CACATCGAAG TCTTTGCAAG TGCAATCTGC TATGACTCCT TCGACATCCA TTCCGCAGAC TTTGGAAGAG GAGAAACAGG ACGTGCCCAT GCCCTTGGCT GCCGCCACTT TGGCCGCATT GTTGCACCAC TATAGTAGCC AAATTCAGGC CCAGCAACAG GGTAACAAGA GCAAAAAACC CCAGAACGTT CAAAAGTCTA TTCCGGTCAT TCCAGTACAT GCTTAAGTAG CACGAATACT GTGACACTTG TTTGGGGTTA TCGCACGGAA CACTGTGTGT GGGGCCTACT ACATATAGGC ACGCGCGTCA AGGGTACAAT TGGGAAATTA TGTATAGACC ACTTGGCGCG TCACCGATCT TGGAGTCGAG AGGGACGGTT TTATGCCTTA CACGGAGATG CTCGATCCTC TTCTTAGATA TCTTTCTCCT TTAGCCTTTC TCTAAGGTCA GCTTTTTACT TATATAACGA TTATATAGTA CACGCTATAA CGAATATAAG CTTTACAATC TATTTCTTTT
|
Protein sequence | MQEWTKDTRI FSVSSHTCLP PILDFTETYY MIDQCDDEIA CWSEDGTTFV VKDPDRFERT IIPQYFKHSK FSSFVRQLNF YSFRKIKYAD TIRIDPKLEA ETANYWRFRH ENFQKGKPEL LTEIKRMNGQ KAPTSPSTSS SSSVSTTQGN KVGVTASAKV TPDPDGASKA SKSEVQSLQK RIEEMTKNID QLTAMVQKVS LKQEEEDQVG SKRKKTELLI KTEDFQTAFN GDLMMDVNGE SDQLVRPDDM FSNMELDEII TATGNKDLSS SVEAEAAALS ITPPSPVKST VPMIRETSVN TQVSDNEFVD QLFTAFNEDS DDLLQMEPPS FGLDSANRPD AELMKRLSDA LMFLPRDIQV MIVERLIAAI TSTESLKPLS KENTSKSLQV QSAMTPSTSI PQTLEEEKQD VPMPLAAATL AALLHHYSSQ IQAQQQGNKS KKPQNVQKSI PVIPVHA
|
| |