Gene PHATRDRAFT_55070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_55070 
SymbolHSF2 
ID7198250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp155300 
End bp156914 
Gene Length1615 bp 
Protein Length387 aa 
Translation table 
GC content49% 
IMG OID 
ProductDNA-binding heat shock factor 
Protein accessionXP_002184407 
Protein GI219128411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000184897 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAAGACAAG CACCATTAAT ATAATATCGA CTCGCTCGTT CGAGTATGTG CAAACCAGAC 
GAGACCCACA CCATCAATCC ACTCTCTCTT GGAAATACTA GTCTTCTTCT TTCTCTTCTA
ATCTCAAAGG TCGAAGAAGC AGCCGCTGCA CTTGCGCAAG AAAGTCCAAA GCGACGTCCT
TCGCTCTGAG CGCAGGGCAG GAGGGGTTAA CCAGTTCGGA ACCCGTGAAA ATACAGCCAG
TTGTGCTGAG TGCAGGAGTC AGCAGCCCGC TCATAGACGG AGCTCTTTGT CCAAAGGATA
GATCAGGTCT ACGTCGTGCA GCTGCGACAG CCGCAACGTC CAAAATAATG ACTTACGCAT
CTCCCAATAA TGTTGTCTCT GCTTCGAGTG ATCGAGACAG TAGCAATGAT GAATGCTCGA
AAACTACTGA CCGACCCAGA AGACGCAAAC GGAAAGCTCG AAATCATTAT GGAGAATGTG
AATCTACAGA TACTGGACAC AGGAGGGTTT ACGTCAACCA TAACTATCAT GATTACGCCG
GTTGCCCCAA TCGAGTCCTC AACGAAGTTT ATACACTTGA ATCTATCGAG GCTAGAAAAA
GCAGGGGCGG CACCAGCACT CCTTTCCCGA AAGTATTGCA TCGGATGCTG GATCAAGCTG
AGGTGGATGG TTTTAGTGAA ATTGTGTCGT GGCAACCTCA CGGACGTGCA TTTTTAGTGC
ACGACCAGGC TCGGTTCGTT GCGGAAGTCA TGCCACGTTT CTTTCGCCAG ACGCGGTTCT
CATCCTTCCA GCGCCAATTG AGTCTCTACG GCTTTTTGCG TTTGACCCGT AAAGGAGCCG
ACCACAACGC ATACTACCAT GAGCTCTGTC TACGAGGTAT GCCTGAGTTG CTCGCCCAGA
TGCAGCGCAC CCGTATCAAG GGATACTGGG TGCGGCAGTC CTCCTCTCCG GAAAGTGAGC
CAGACTTTTA CAGCATGCCG ACCGTCGAAG AATCTATTGA AAACCGACCT CTTCAATCCG
ACAGTACTCA AGCACTTTCA AGTGGCAATA TTATTATTGA TGGTAATGAT TCCCTTGAGC
CCATTCCGGT AAACCCGGTT TCCAACTTGG GCGTATCCAG AGCGCCTAGT TTTGGTAAGC
TCGAACCAGG CCAATTCGCT GCTTATGGCG GTTGTATGAA ACTACCTCCA ATGCCGCCCC
TTCGGGATGG TCCTCTCTGG TCATCAGCTG CATACTTGAA AGAATTCCCT GTGGACGAAG
ATATGTCAAT CAGTATCGAT GCATTTATTG ACTCCATTGC TCCACATGCG TCGAATCCGT
CCGACACTGC TGTTCTAGAA CCGCTACCAG CCTACTCTTC TACCAGTGAC ATGAGAGAAG
ATCTAGTCAA CTTTTTGTCG AATGTCGACC TTAGCTCAGA AGACGGAGAC AACAGCGAAT
TTTACCGGAA TGAAGAACAT TTGGAGAGGT TGCACAGCAG CAAGCAAGCA CAAAATGGGA
TGGAAATGTG ATCAAAGTTC CGTTTCTTCG GCGGACTTTC CATCGAGGGC AAGGCATGCA
TCTGCAGACA ATATTTACTG TTGGTTAAAA GCTTAGTGAA AAGTTCATGA TACAA
 
Protein sequence
MTYASPNNVV SASSDRDSSN DECSKTTDRP RRRKRKARNH YGECESTDTG HRRVYVNHNY 
HDYAGCPNRV LNEVYTLESI EARKSRGGTS TPFPKVLHRM LDQAEVDGFS EIVSWQPHGR
AFLVHDQARF VAEVMPRFFR QTRFSSFQRQ LSLYGFLRLT RKGADHNAYY HELCLRGMPE
LLAQMQRTRI KGYWVRQSSS PESEPDFYSM PTVEESIENR PLQSDSTQAL SSGNIIIDGN
DSLEPIPVNP VSNLGVSRAP SFGKLEPGQF AAYGGCMKLP PMPPLRDGPL WSSAAYLKEF
PVDEDMSISI DAFIDSIAPH ASNPSDTAVL EPLPAYSSTS DMREDLVNFL SNVDLSSEDG
DNSEFYRNEE HLERLHSSKQ AQNGMEM