Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54019 |
Symbol | HSP70A |
ID | 7196800 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1695430 |
End bp | 1697709 |
Gene Length | 2280 bp |
Protein Length | 653 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | protein heat shock protein Hsp70 |
Protein accession | XP_002177351 |
Protein GI | 219111199 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAAGAGCG AAGCGAAACG CATTTTAATA AACTTTTTGG AAAACAGCAA GCAAATACAA ACATGAGTGG TACGTTCTGC TGTTGTGTTG ACAATGACAA ACAGGAACTG TTAGAATAAG TAGTGAGAGT GCGCGAGAGA ATAGCAATGG ATTGCAGCAG CTCATACGAA AAGTTTGTGT AGGACCGGTC ATTTGCGTTG CCTACACGAT TCGACCAACC TCCCTAACTC ACACTCTTCT TATTGTTCTT GCCGACTATA GTCACTGGAG AAAGCGTCGG TATCGATTTG GGTACCACCT ACAGTTGTGT TGGAGTTTGG CAAAATGATC GCGTTGAAAT CATTGCCAAC GACCAGGGAA ACCGTACGAC TCCTTCGTAC GTTGCCTTTA CCGAGACTGA GCGTCTCATC GGTGATGCCG CCAAGTCGCA GGCTGCCATG AACGCTCACA ACACCGTCTT CGATGCCAAG CGTCTTATTG GTCGCAAGTT CACCGATGCC GGTGTTCAAG GGGACATGAA ACATTGGCCT TTCAAGGTCG TTTCCGGTCC GGGAGGCACC CCCATTATCG AAGTTGACTA CAAGGGAGAA AGCAAGCAGT TCAAAGCTGA AGAAATTTCC AGTATGGTTC TCCAGAAGAT GAAGGAGATT GCCGAGGCCT ATCTTGGAAA GGAAGTGAAG AACGCTGTAG TCACCGTTCC CGCTTATTTC AACGACTCCC AGCGCCAGGC TACCAAGGAC GCCGGTGCCA TTTCTGGACT GAATGTCCTC CGCATCATCA ACGAGCCGAC GGCCGCTGCT ATTGCGTATG GTCTCGACCA GAAGGGCGAA GAGAAGAATG TTCTCATCTT TGATCTTGGT GGCGGTACTT TTGATGTTTC TCTTTTGACT ATTGAAGAGG GAATCTTCGA AGTCAAGGCC ACCGCCGGAG ACACGCATTT GGGTGGAGAA GATTTCGACA ATCGTCTCGT CGACTATTTC CTCCAGGACT TCAAACGTCG TCACCGCAAG GATATGTCGC AGAACCAGCG CTCCCTCCGT CGTCTTCGCA CGGCTTGCGA ACGCGCAAAG CGTACTCTTT CGTCCTCCAC CCAGGCCCAT ATTGAGATCG ATTCCCTCTT TGACGGTATC GATTTCAATT CTACCATCAC CCGTGCCCGT TTTGAAGATT TGTGTATGGA CTACTTCAAG AAGTGCATGG AGCCTTGCGA AAAGGTTTTG CGCGATTCCA AGATTGCCAA GGGCCAGGTT GACGAAATTG TCCTCGTCGG AGGTTCCACC CGTATCCCTA AGGTGCAATC CATGCTTTCC GAGTTCTTTA ACGGCAAAGA GCCCTGCAAG TCCATCAACC CCGATGAAGC TGTAGCTTAC GGTGCCACAG TTCAGGCTGC GATTCTCTCC GGAGCTGACA AGAGTGAGAA GCTCTCTGAG CTCTTGCTTT TGGACGTTAC CCCTTTGTCG CTCGGTTTGG AAACCGCTGG AGGTGTGATG ACCACCCTCA TCAAGCGTAA TACGACCGTC CCGGCCAAAA AGACCCAGAC TTTCTCCACC TACGCCGACA ACCAGCCCGG TGTACTCATT CAGGTCTTTG AGGGCGAACG TTCCATGACC AAGGACAACA ACTTGCTCGG AAAGTTCAAT CTCGACGGCA TTCCGCCCAT GCCCCGTGGA CAACCCCAAA TTGATGTCAC CTTTGATATT GACGCTAACG GTATCCTCAA CGTATCGGCC ATTGAGAAGT CTACTGGAAA AGAAAACAAG ATTACTATCA CCAACGATAA AGGCCGTCTT TCTCAGGATG AAATTGAGCG CATGGTGTCC GAAGCCGAGA AATACAAGGC CGAGGATGAC GCCAACAAGA ACCGCATTGA AGCCAAGAAC GGACTCGAGA ATTACTGCTA CAGTCTCAAA ACTTCCATCA GCTCGGAAGA AGTCAAGGAC AAGATCCCTG CTGACGACAA GACTGCGCTC GAAGCTGCCA TTGAAGACGC GATCAAGTGG CTCGACGCGA ACCCGACCGC CGAAAAAGAA GAATACGAAG AGAAGCAAAA GAGCCTCGAA GGTATTGCGA TGCCCATTTT GCAGAGCATG GGTGGCGGTG CCGGTGGCAT GCCCGATATG GGTGGTGCTG GAGGTATGCC TGACATGGGC GGCGCTGGTG GGGCTCCTCC GTCCGCCGAT CCGGCCTCGG GACCTACCAT TGAAGAAATC GATTAAGCGT CGTTTCCATT TTCTTGATAG AATCCAGAAT GATTATAAAA TAATTACGTT TTGATATGAT
|
Protein sequence | MSVTGESVGI DLGTTYSCVG VWQNDRVEII ANDQGNRTTP SYVAFTETER LIGDAAKSQA AMNAHNTVFD AKRLIGRKFT DAGVQGDMKH WPFKVVSGPG GTPIIEVDYK GESKQFKAEE ISSMVLQKMK EIAEAYLGKE VKNAVVTVPA YFNDSQRQAT KDAGAISGLN VLRIINEPTA AAIAYGLDQK GEEKNVLIFD LGGGTFDVSL LTIEEGIFEV KATAGDTHLG GEDFDNRLVD YFLQDFKRRH RKDMSQNQRS LRRLRTACER AKRTLSSSTQ AHIEIDSLFD GIDFNSTITR ARFEDLCMDY FKKCMEPCEK VLRDSKIAKG QVDEIVLVGG STRIPKVQSM LSEFFNGKEP CKSINPDEAV AYGATVQAAI LSGADKSEKL SELLLLDVTP LSLGLETAGG VMTTLIKRNT TVPAKKTQTF STYADNQPGV LIQVFEGERS MTKDNNLLGK FNLDGIPPMP RGQPQIDVTF DIDANGILNV SAIEKSTGKE NKITITNDKG RLSQDEIERM VSEAEKYKAE DDANKNRIEA KNGLENYCYS LKTSISSEEV KDKIPADDKT ALEAAIEDAI KWLDANPTAE KEEYEEKQKS LEGIAMPILQ SMGGGAGGMP DMGGAGGMPD MGGAGGAPPS ADPASGPTIE EID
|
| |