Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55070 |
Symbol | HSF2 |
ID | 7198250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 155300 |
End bp | 156914 |
Gene Length | 1615 bp |
Protein Length | 387 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | DNA-binding heat shock factor |
Protein accession | XP_002184407 |
Protein GI | 219128411 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000184897 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAAGACAAG CACCATTAAT ATAATATCGA CTCGCTCGTT CGAGTATGTG CAAACCAGAC GAGACCCACA CCATCAATCC ACTCTCTCTT GGAAATACTA GTCTTCTTCT TTCTCTTCTA ATCTCAAAGG TCGAAGAAGC AGCCGCTGCA CTTGCGCAAG AAAGTCCAAA GCGACGTCCT TCGCTCTGAG CGCAGGGCAG GAGGGGTTAA CCAGTTCGGA ACCCGTGAAA ATACAGCCAG TTGTGCTGAG TGCAGGAGTC AGCAGCCCGC TCATAGACGG AGCTCTTTGT CCAAAGGATA GATCAGGTCT ACGTCGTGCA GCTGCGACAG CCGCAACGTC CAAAATAATG ACTTACGCAT CTCCCAATAA TGTTGTCTCT GCTTCGAGTG ATCGAGACAG TAGCAATGAT GAATGCTCGA AAACTACTGA CCGACCCAGA AGACGCAAAC GGAAAGCTCG AAATCATTAT GGAGAATGTG AATCTACAGA TACTGGACAC AGGAGGGTTT ACGTCAACCA TAACTATCAT GATTACGCCG GTTGCCCCAA TCGAGTCCTC AACGAAGTTT ATACACTTGA ATCTATCGAG GCTAGAAAAA GCAGGGGCGG CACCAGCACT CCTTTCCCGA AAGTATTGCA TCGGATGCTG GATCAAGCTG AGGTGGATGG TTTTAGTGAA ATTGTGTCGT GGCAACCTCA CGGACGTGCA TTTTTAGTGC ACGACCAGGC TCGGTTCGTT GCGGAAGTCA TGCCACGTTT CTTTCGCCAG ACGCGGTTCT CATCCTTCCA GCGCCAATTG AGTCTCTACG GCTTTTTGCG TTTGACCCGT AAAGGAGCCG ACCACAACGC ATACTACCAT GAGCTCTGTC TACGAGGTAT GCCTGAGTTG CTCGCCCAGA TGCAGCGCAC CCGTATCAAG GGATACTGGG TGCGGCAGTC CTCCTCTCCG GAAAGTGAGC CAGACTTTTA CAGCATGCCG ACCGTCGAAG AATCTATTGA AAACCGACCT CTTCAATCCG ACAGTACTCA AGCACTTTCA AGTGGCAATA TTATTATTGA TGGTAATGAT TCCCTTGAGC CCATTCCGGT AAACCCGGTT TCCAACTTGG GCGTATCCAG AGCGCCTAGT TTTGGTAAGC TCGAACCAGG CCAATTCGCT GCTTATGGCG GTTGTATGAA ACTACCTCCA ATGCCGCCCC TTCGGGATGG TCCTCTCTGG TCATCAGCTG CATACTTGAA AGAATTCCCT GTGGACGAAG ATATGTCAAT CAGTATCGAT GCATTTATTG ACTCCATTGC TCCACATGCG TCGAATCCGT CCGACACTGC TGTTCTAGAA CCGCTACCAG CCTACTCTTC TACCAGTGAC ATGAGAGAAG ATCTAGTCAA CTTTTTGTCG AATGTCGACC TTAGCTCAGA AGACGGAGAC AACAGCGAAT TTTACCGGAA TGAAGAACAT TTGGAGAGGT TGCACAGCAG CAAGCAAGCA CAAAATGGGA TGGAAATGTG ATCAAAGTTC CGTTTCTTCG GCGGACTTTC CATCGAGGGC AAGGCATGCA TCTGCAGACA ATATTTACTG TTGGTTAAAA GCTTAGTGAA AAGTTCATGA TACAA
|
Protein sequence | MTYASPNNVV SASSDRDSSN DECSKTTDRP RRRKRKARNH YGECESTDTG HRRVYVNHNY HDYAGCPNRV LNEVYTLESI EARKSRGGTS TPFPKVLHRM LDQAEVDGFS EIVSWQPHGR AFLVHDQARF VAEVMPRFFR QTRFSSFQRQ LSLYGFLRLT RKGADHNAYY HELCLRGMPE LLAQMQRTRI KGYWVRQSSS PESEPDFYSM PTVEESIENR PLQSDSTQAL SSGNIIIDGN DSLEPIPVNP VSNLGVSRAP SFGKLEPGQF AAYGGCMKLP PMPPLRDGPL WSSAAYLKEF PVDEDMSISI DAFIDSIAPH ASNPSDTAVL EPLPAYSSTS DMREDLVNFL SNVDLSSEDG DNSEFYRNEE HLERLHSSKQ AQNGMEM
|
| |