Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55890 |
Symbol | Hsp70_2 |
ID | 7202385 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 258678 |
End bp | 261517 |
Gene Length | 2840 bp |
Protein Length | 732 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181519 |
Protein GI | 219122370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCATC TTCCATCCTC CTCCACCCTC CTAACCTGCG TGTCGGTACT TCTCTCCGGA GCTCACCCCG CAAAAGCATC ATGGCTAGCT CGTAGGACAG TAGAAAAGCC GACTCTGGCA CGAATCCATG AGCAGCGAGA CTCTACGGAC CGGAAATCGA GGGCGCCCTT CTTACATCGT CGACAGAATC ACTACGCGCA TCATCCCCAC GGGCTACTCA CTACCCTCCG TGGTGGCGCC AGTAATGCAG ACAACAAAAT GGACGGTCCT TGTATCGGAA TTGATCTCGG AACGACGTAT TCGTGTGTCG CTGTCTGGAG AAATTCTCGG GTGGACGTGT GTCCGAACGA ACAAGGTAAT CGGATTACCC CTTCGTACGT GGCTTTCGGC AAGGACGGCA CCCGACTAAT TGGCGACGCT GCGAAGAATC AGGCACCGTC CAATCCAACG GGTACGTTTT TTGACGTGAA ACGCCTGATT GGTCGCAAGT ACAACGATGC TACCGTGCAA AAGGACAAGA CGCTCTTTCC GTTTTCGATC GAAAAGGGTC CCGACGACAA ACCCTTGCTC GGATTGCCGA CCGAGTTAAA AAAGCAGCAA GGCAAATCTC AATATACTCC AGAAGAAGTA TCGGGAATGA TTCTGCGCAA ACTCAAGGAA ACCGCGGAAA CCTTTTTGGG ATGCGAAGTG AAACACGCCG TCGTAACCGT CCCAGCGTAC TTTAACGACG CCCAACGACA GGCGACGAAA GACGCCGGAA CAATTGCCGG TCTTAAGATC GAACGAGTCA TTAATGAACC CACCGCAGCC GCAATTGCCT ACGGACTAGA TAAGCAAGAT GTCGAAGAGA ATGTGTTGGT TTTTGATCTT GGCGGTGGAA CGTTCGACGT GACTCTGCTC TCGATCGACC ACGGAGTCTT TGAAGTGCGT GCTACGTCGG GAAACACACA TCTGGGAGGT GAAGATTTTG ATCAACGTCT CATGGAGTAC TGCATGAGTG TTTTCAAACG TCAGTCTGGA ATTGATGTTT CCGGGGACAA ACGAGCGATT CAACGACTTC GCAAACAATG TGAACTGGCC AAGCGGACGC TATCGACTCA AACTTCCGCC ACGATCGATT GTGACGCTTT TTCCAACGGT GTTGACTTTA CCACCACCAT TTCGCGAGCC AAATTTGAAG AATTGAACAT CGACTTATTC AAAAAGACTA TGGTTCCTGT TACTCAAGTC CTGAAAGATG CGGGAATGAG CAAAAGCGAG ATTGACGAAA TTATTCTCGT TGGTGGTTCT ACTCGAATTC CAAAAGTCCA GCAGATGTTG ACGGAGTACT TTGGCGGCAA GGAACTGAAT AAGGGGATCA ACCCGGATGA AGCCGTGGTA CGTATACCTT TTGGAAGGAA AAAATGGTTC AGGGACTGGT GCTACTGACG AAAACTTCGT AACGCTTGGC TACTGAAGAG GGCTTTGCGA CCGGAGTAGT TACCGGGCCT ACTCGCCGAG TCGCTCCTGA TGCGAACAGT TACACATAAA CAACAACCGA AGCACAACAA CTCCATGGGA CTATTGCCGA ATGACTAGAA GTTTTCCGTA TGCTGACACG CTGCTGTTTG TATTTTTGTA GGCCTACGGT GCTGCGGTCC AGGGAGGAAT TTTGAGCGGC GACGCGGCCG AGGCGACCAA GGATGTGTTG TTACTTGATG TCGCACCCTT GTCACTAGGC ATTGAAACCG CCGGTGGCGT CATGACTCCC TTGATCAAAC GGGGTACCAC AATTCCGGTC AAAAAGTCCC AAGTATTTTC GACGTACGCT GATAATCAAC CCGGCGTGAA TATTCAAGTT TTTGAAGGCG AACGGTCCAT GACTAAGTCC AACCGGCTTT TGGGGCAATT CGAGTTGGCG GGCATCCCAC CCGCTCCACG CGGTGTACCC CAAATTGAAG TTAGTTTTGA CGTGGATGCC AATGGAATTC TAAGCATATC GGCATCCGAC AAAGGGACTG GTAAATTGGA AACCTTAACC ATCACCAGTG AAAAGGGTCG GCTTTCGGAA GAAGAAATTG AACGAATGAT ACAAGAAGCG GAACAGTTCG CCGATCAGGA TGCCGCCGAA AAGGAAAAGG TCCAGGCTCG GAATGATTGT GAAGCATATT TGTACAATCT GAAAAACTCC ATGAATGATG CTCTTCAGGA TAAGCTTTCT GCGGAAGACA AAGAAACACT ATCACAGGCC ATTGAGGAAG GTTTGGTGTG GTTGGAAAAC AACCCCGCGG CGGAAAAAGA CGCGTACGAT GCCAAGCAAA AGGAGGTGGA ACAAGTTGCT AATCCCATTC TCAAGCGAGC CTACGAAAGC ACCAGTGATG GTGCGGCCGC TGCTGACAAT GATTTTTTAG GTGAAGATTT GGACGGTGTA GATGACGGCC CGAGTGTAGA AGAAGTAGAT TAACCATAGC ATAGAACACA GACAAGCTCT GAAAGCTTCC TTCGATACCA ACTCAATCTT TCGGTTTTCG AGTTGGCTCG TGAAGCCACT GCCAAGAATC AGCCTCCATG TCCGGTAGTT GAGCCCAAAA ACTGCGTACG TGCTGGACGC CATCCCACTG ATACTCACCA GTCGTCTGCA CTCCAGCAAA GGGACCCAAT ACTGGATCTT GTAGGAGTTC GTAAGTGTCG TATTCCCTCT CGGTCAACAC ATGCTTCAAA ATTGTTTTGT ACGAGTTACT CCAATCAACG TTTGCAAACC GTGTGGCTAT CACTTGCAAA GTCTGTACCT GAGCAACATC TTTCATCCCC AACAGTGTAC ACGGCACGCC AGGATAAGCC AAGGCGCAAA CAAGCGCAAG
|
Protein sequence | MVHLPSSSTL LTCVSVLLSG AHPAKASWLA RRTVEKPTLA RIHEQRDSTD RKSRAPFLHR RQNHYAHHPH GLLTTLRGGA SNADNKMDGP CIGIDLGTTY SCVAVWRNSR VDVCPNEQGN RITPSYVAFG KDGTRLIGDA AKNQAPSNPT GTFFDVKRLI GRKYNDATVQ KDKTLFPFSI EKGPDDKPLL GLPTELKKQQ GKSQYTPEEV SGMILRKLKE TAETFLGCEV KHAVVTVPAY FNDAQRQATK DAGTIAGLKI ERVINEPTAA AIAYGLDKQD VEENVLVFDL GGGTFDVTLL SIDHGVFEVR ATSGNTHLGG EDFDQRLMEY CMSVFKRQSG IDVSGDKRAI QRLRKQCELA KRTLSTQTSA TIDCDAFSNG VDFTTTISRA KFEELNIDLF KKTMVPVTQV LKDAGMSKSE IDEIILVGGS TRIPKVQQML TEYFGGKELN KGINPDEAVA YGAAVQGGIL SGDAAEATKD VLLLDVAPLS LGIETAGGVM TPLIKRGTTI PVKKSQVFST YADNQPGVNI QVFEGERSMT KSNRLLGQFE LAGIPPAPRG VPQIEVSFDV DANGILSISA SDKGTGKLET LTITSEKGRL SEEEIERMIQ EAEQFADQDA AEKEKVQARN DCEAYLYNLK NSMNDALQDK LSAEDKETLS QAIEEGLVWL ENNPAAEKDA YDAKQKEVEQ VANPILKRAY ESTSDGAAAA DNDFLGEDLD GVDDGPSVEE VD
|
| |