Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46333 |
Symbol | HSP70B |
ID | 7201514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 959314 |
End bp | 961495 |
Gene Length | 2182 bp |
Protein Length | 681 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | protein heat shock protein |
Protein accession | XP_002180736 |
Protein GI | 219119972 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000192711 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCACCTTTAC CGTTGCGCTG CCATTACACT GTTCTCCCAC AGCAGTTCTC TAGACATAGA CAAACACTGC TTGCTTGTCA TGTCTATTCG GATCTCTCTG CTTGAAAGGG TGAACAAGAA TTGACCTTGG TGGACTATGC TGCCACGAGT TGTTGACGGT CCTTCCATCG GCATCGACCT CGGAACGACA AATTGTGCTG TCGCGGTCTG GGATTCTACT CGGGGTCATC CGAAATGGAT GCGACTAGCT AACATAGCGA CACCGCCACG CAATTCTAGC AAGATAGGTC GCGTTGTTCC CTCCGCTGTC CTCTTCCTGA CGCGAGATGC AGCAGCACTA CACAACCTTT TGGACGAGGC TCAAGATGTC GACGGGATTC TGGAACGATC TGATCTGGTG GCTCTCGTTG GCAACAGTGC AGTGAAAATC CTGGAGAAAT CTCAAGCACG TGAGATTGAA ATGCCTTTCT CACCGGCTCA GGTATCGGCC GCTTTCGTGG CCAGTGTCAA GCGATTGATT GGCGCAGCCA ACTCGATAGC TTTTCGGAAT AGCGATTTTT TGGACTCGCT CCCCTACCGA GTAGTCTCGG GCGGAACGGA AGAAAATAAT CTATATCTCG AGATAACTCC TCTGGGGTCC TCCGAAACAG TCCTCGTGAC GCCAACGCAA GTATCAGCCG TTTTGTTACA ATCGCTGCGT TTATCTGCCG GACGGTACCT AAGGCTTTGT GCTGCAAAAA AGCAGTTGAA AGTTCCCGGG GATGCTAGAG AGCCTTGCTG TCACGCTGTG GTCGGTGTGC CTGCGCAGTA TGGTCGGGCG CAGCGTAGTC TGATAGAGCG AGCTTGCCGA ATTGCCGGCT TTACGGGTCG CGTGCTGTTA CTTACAGAAT CAACTGCTGC CGCCATTGCC TACGGCTTAA CGGTCGGAAT TACTATTGCA ACAACGAAAA CGATTCTAGT ATTCGATATG GGAGGTGGGA CAACCGATAT TACAATTGCT GAAATGCATC CACCAGCTTT AGAAGCGCCA ACGTCAGTCA ATGCCGACTT TGAGGTCAAG GTGACGTTCG GAGACCAAAG GTTAGGCGGG GATGATATGG ATGCAGCCCT TTCAAGATTG GTATGGCAGC GTCTGAAAGT GCATCCGTCA GACTGTAGTT TGCATACACA GCGTGAGGTT TTACTCCACG GCAAGCAAGC CAAAGAAGCA TTGTGTGGCA ATGCTGAGCA TGGCGATCTG CAACCAGTCA ATTCATACTC AATGACTGTT CATGGCCGGT CAATATGTTT GACGCGAAAG GATTTCGAGG CTGTGATTGA GCCGCTAATA CATCGAGCAA AGAAGTTGAT CCAAGAGGCG ATACGGCAGT ATAAAGCTAC TTCTTGCACT TCGATTATGA CTGATCAAAC CGACGCTGCT ATAACATTCG ACGAAGTTCT TCTCGTCGGT GGAGCCACCC GTGTTCCGGC AGTACGGTCA CTGCTCAAAC AACTTTTCCC TCCGCCGGTC CCTCCCGAAT TGTGTTTGTC TTTGAATGCC ATGGCTGCAG TGGCACAAGG AACGGCTATT CAAGCCGCTT TGTGGTCCGG TTTGATACCA AGATACGACA TTGAATCTGC TCTAATGCTG GATACCGTAC CACACGCAAT AGGAGTGCGT TTGAGTGAAG CACACTTCAT TGAGGTCATT AAGAAAGGAT CACCTTTGCC AGCGACCGGA TTTGCTCCGT TTCAGTTAGC TGATGCTCGA CAAGCGGGCG TTACAGTACA GGCCGTGGAG CAAGTCGACA TCGAAACCTA TGAAAGCATT GGAGACTTCA CCTTTTTGCT GCATCGGATG ACGAAGGCAC AGCTGGCAAA CTTGGGGAAC GACGCACGAT TGGTAGATGT GGGCATGAAA CTGAACGCGC AAGGAGAGTT TGTTGTTTCA ATTTTGGACC CTCACGATCC CGAGCACGTT AAGAGACGTT TAAATCATGA AAAAATACGT GCGGCATCTG CCGATGGATC AAAAGCGGGC CATAGTGTTT TGAATACCTA CACCGCAACT GCAGAGGTTT GTGAAGGTGA AGCGTTATTG ACGGATCAAC TCGTGTTGTG CTTTGTCTGT ATACTATTGT TTGTGGTCTA CGTTATGGTC AAACTTTTAG TTGCCGAACC AATTTCTGTC CTGCAGCCAT GA
|
Protein sequence | MLPRVVDGPS IGIDLGTTNC AVAVWDSTRG HPKWMRLANI ATPPRNSSKI GRVVPSAVLF LTRDAAALHN LLDEAQDVDG ILERSDLVAL VGNSAVKILE KSQAREIEMP FSPAQVSAAF VASVKRLIGA ANSIAFRNSD FLDSLPYRVV SGGTEENNLY LEITPLGSSE TVLVTPTQVS AVLLQSLRLS AGRYLRLCAA KKQLKVPGDA REPCCHAVVG VPAQYGRAQR SLIERACRIA GFTGRVLLLT ESTAAAIAYG LTVGITIATT KTILVFDMGG GTTDITIAEM HPPALEAPTS VNADFEVKVT FGDQRLGGDD MDAALSRLVW QRLKVHPSDC SLHTQREVLL HGKQAKEALC GNAEHGDLQP VNSYSMTVHG RSICLTRKDF EAVIEPLIHR AKKLIQEAIR QYKATSCTSI MTDQTDAAIT FDEVLLVGGA TRVPAVRSLL KQLFPPPVPP ELCLSLNAMA AVAQGTAIQA ALWSGLIPRY DIESALMLDT VPHAIGVRLS EAHFIEVIKK GSPLPATGFA PFQLADARQA GVTVQAVEQV DIETYESIGD FTFLLHRMTK AQLANLGNDA RLVDVGMKLN AQGEFVVSIL DPHDPEHVKR RLNHEKIRAA SADGSKAGHS VLNTYTATAE VCEGEALLTD QLVLCFVCIL LFVVYVMVKL LVAEPISVLQ P
|
| |