Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41417 |
Symbol | HSP70F |
ID | 7199263 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 113865 |
End bp | 116583 |
Gene Length | 2719 bp |
Protein Length | 841 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | heat shock protein Hsp70 |
Protein accession | XP_002185428 |
Protein GI | 219130555 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.325838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCGG TAGTTGGTGT GGATTTGGGA TACCAAAACA GCGTGATTGC TGCGGCGGGA CGGGGTGGCG TCGATGTCAT TCTCAACGGG AATTCCAACC GTTTGAATCC GTACGTACCC AAATACTAGA TAACAATGTA GATGCGTGTG GTTGATTTCG GCGGAAGAAG AGATATGCGC TCGGTACCCG TCCGTATCGA ATTGTACCGT ACTTGGACCA ATCGATTCTT GTGCTCACCC CGTTTGTGGC TTGCTTATGT GTTCGTTTGT TCGTTCATTC ATTCGCTTGT ACTTGTCCAA CAGGTCCATG GTGGGCTTTG ACGAGAGTCG CAAGATGGGT GAACTCGCCA CGTCCGGAGC CTCGAGTAAT TACAAGTACA CCGTGACGGC CATGAAGCGC TTGATCGGTT TGGCCTTTGA CGATCCCGTC GCGACGCTGG AAATGCAGCG TCTGCCGCTC CAGTTTTGTC GTGTTCCGCA CGCGGACGGC GTCGACTCGA TTGGCGTACA AACGTCCAAG GACCCCGACG CGAACGACAG CGCGACTACC GTCGTGCCCA TGGAAGCCGT GGCCGGAATG ATGGTCCGAC ACATGGGGAC CATCGTGGCG CAAAAGATCG CGCAAGAGAC CAACACGTCC GTGGAAGCCA ACATGCCGCA AGACTGGGTA TTGACCATTC CCTCCTACTA CACCGACGCA CAACGTAGAG CCTTGCTCGC GGGGTGTGCC ATGGTGGGAC TTACCGGTGT CCAACGACTC CTGCACGAAA CCACCGCCAC GGCCCTCGCC TACGGGATCT TCAAGGATCT CAAGAAGGAG TTCCAGGCGG ATCAACCCAC ACACGTGCTC TTTCTCGATA TGGGCGCCTC GGCCTACACC GTATCGCTCG TGGCCTTTGA ACCGGGCAAA CTCATCGTCA AGAGTACCAC CGGGGACGCC AATCTCGGTG GACGCGATTT CGACTGGATG ATCGTTACCT GGATGGCGAA CAAATTTGCC GAAAAGTTCG GAGCCAAGCT GTCCGGCAAT CCACTCGATC GTCCCAAAAC GGTTCTCAAA CTCCTCGCCG CCGCCGAAAA GGCCAAGAAA ACACTCAGTC CGCAGGGCGT CAAGGAAGCC CGCATCAATC TAGAAATGCT CATGGACGAT TTGGATTTTA GCATTACCCT AACGGCAGCC GAGTACGAGC AAATGTGCGA ACCACTCCTG GCGCGCTTGG AAGCTCCCAT TGTCCAGGCC TTGGCCGAAG GCAAGCTCAC GGCGGCCGAT TTGCACTCGG TCGAAATTGT GGGTGGTTCG ACCCGTATCG GTTGCGTCAA ACGGGCCCTC ACGGGATTCC TCACCAACAG TGGCGCCGGC GCGGCCGCCA CGGAACTGCT CTCCACGACG CTCAACGCCG ACGAAGCCGT CGCCCGCGGC GCGGCCCTGC AATCCGCTAT TCTCTCACCC CGATTCAAGG TCCTCCCCTA CGACATTCAC GAATTCCAAG CCTGGCCCAT TCAACTGCGT TGGGACGAGG ACGCCAACGA CGAGGCACAA GGCATGGAAG TGGACGCCAC CACGGGGGCC CAGCCGACCA ACGCGGTCGT CATGTTCGAC CGCGGTTTGT CCTTTCCCAT TGTGCGTCGT GTGACATTGA AACGCAACCA GGGAACCTTC GCCGTGCAGG CGGAATACAA CGAAAAGGCC CTCGAGTACG GTTTGCCGGC GTCGGGGAAC GCGATTGCGA CCTTTTCCGT GCAGGCGCCC ATATCCGAAG AAGCCAAAAA GATTCGCGTC AATGTCAAAC AGGATATTCA CGGTATCATC CAACTGAGTT CGGCACAAAT GGTGGAAGAG ATTGCCGACG AGGAAGAACC CGAGTCGTCG GGTGCTGCAC CCCTCAAGGA CGGTGAGGAA GCCGCCGCTC CGGAAAACAA GAAGAAAAAG GTCAAAAAGA CCAATTTAGT GTTTACTACG ACGCGTCCGT TGGATTGGAC CGAGGCCGAA ATCCAAAAGG CGTACCAGGC CGAACTGGCC ATGGCGCTCA AAGACAAACT CGTGCAGGAA ACGTCGGACA AACGTAACGA ACTCGAGTCC TACATTTACG ATATGCGGGA CAAGATTGGT TCAGAATCGG CTCTGGGATC GTTCGGTACC GACGCGGAAA AGGCAGCCTT TATTACCCAA AACGAGGCGA TGGAGAATTG GTTGTACGAG GACGGGTTCG ACGCGAGCAA GGAAACGTAC GCTACCAAAC TGAAGGAATT GCAAAAATTG GGCAGTCCCA TGGAGCGCCG TCAGGCCGAA CAAGAAGGTC GGCCGGCGGC CGTGAGTACT TTGCAGCAGA GTCTGGAAAA GTACCAGAAT TGGGTCAACC AGGAAGCGAC GGACGAGGCC TACGCACACA TTACCGACGA TGAACGTCAG CGTGTCCAGA GCAAATGTGA CGAAATCAGT GCCTGGATGT ACGACATGTT GGATCAACAG GGTGCTCTTC CCAATCATCA GGACGCGGTT TTGACCGTGT TTGATTTGCA GGCCAAGAAC AAGGACTTGA TCGATACCTG TGGACCAGTC TTGCGCAAGC CCAAGCCGGC GGCGCCGAAA AAGGAGCCCG AACCCGCGGC GCAGCCGGAA GGCGAAGCCA AGACGGCCGA CGAGGTTCCT CCCCCCGAGC CCATGGAGGG AGTCGAAGAA AGCGAAGCGA AACCGGCCGA AACTATGGAG ACGGATTAG
|
Protein sequence | MSSVVGVDLG YQNSVIAAAG RGGVDVILNG NSNRLNPSMV GFDESRKMGE LATSGASSNY KYTVTAMKRL IGLAFDDPVA TLEMQRLPLQ FCRVPHADGV DSIGVQTSKD PDANDSATTV VPMEAVAGMM VRHMGTIVAQ KIAQETNTSV EANMPQDWVL TIPSYYTDAQ RRALLAGCAM VGLTGVQRLL HETTATALAY GIFKDLKKEF QADQPTHVLF LDMGASAYTV SLVAFEPGKL IVKSTTGDAN LGGRDFDWMI VTWMANKFAE KFGAKLSGNP LDRPKTVLKL LAAAEKAKKT LSPQGVKEAR INLEMLMDDL DFSITLTAAE YEQMCEPLLA RLEAPIVQAL AEGKLTAADL HSVEIVGGST RIGCVKRALT GFLTNSGAGA AATELLSTTL NADEAVARGA ALQSAILSPR FKVLPYDIHE FQAWPIQLRW DEDANDEAQG MEVDATTGAQ PTNAVVMFDR GLSFPIVRRV TLKRNQGTFA VQAEYNEKAL EYGLPASGNA IATFSVQAPI SEEAKKIRVN VKQDIHGIIQ LSSAQMVEEI ADEEEPESSG AAPLKDGEEA AAPENKKKKV KKTNLVFTTT RPLDWTEAEI QKAYQAELAM ALKDKLVQET SDKRNELESY IYDMRDKIGS ESALGSFGTD AEKAAFITQN EAMENWLYED GFDASKETYA TKLKELQKLG SPMERRQAEQ EGRPAAVSTL QQSLEKYQNW VNQEATDEAY AHITDDERQR VQSKCDEISA WMYDMLDQQG ALPNHQDAVL TVFDLQAKNK DLIDTCGPVL RKPKPAAPKK EPEPAAQPEG EAKTADEVPP PEPMEGVEES EAKPAETMET D
|
| |