Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49949 |
Symbol | HSP70G |
ID | 7198546 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 400185 |
End bp | 403547 |
Gene Length | 3363 bp |
Protein Length | 936 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | protein heat shock protein |
Protein accession | XP_002184797 |
Protein GI | 219129229 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.816724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTGGTGTGT CGTCGTCGTC TTTCCCGTCG TTCGCCCATC ACTCTCCACA TATAATTTAC TGTCAATCAC GACAACAACG ACGCTTGGGA AAAGCTTGTA AACAGTCAAG TTGAGTACTA CGGTTGACTC TATCTATTAC CAGTAGTGTT ATTGTTACCG GTGTTCATTG GAATTACTGC TTTTTGATTT CCTCATGAGA CTGCACAGTC ACCGTGCCGT CGTGGCGGCG GCATCCTTCT GCCTGTGGAC TTGCTTGGCC TCGTTCTCGG GAACGGTGGA AGGTCGTGCC ATTCTCGGTG TCGACTTGGG CTCACTCTAC ATGAAAGTGG CACTCGTACA GTCGGGGAGT CCGTTGGAAA TCGTTACCAA TTTACACGCC AAGCGCAAGA CGGAACAAAT GATTCTATTT GATCAACAAC AACGCTTTTA CGGGGCCGAC GCATCCGCAC TCTTGGCTCG CAAGTCCACC AAAACACCCT CCGCAATGTC CGTACTGCTC GGACGAGACG AGCAACATCC CACCGTGCGA GTAAGTCATG TATAAATGAA AATTCCGGCG GTGCAGTGCT GTACAAACAG TACAATACAG TACGGTACGG TCCAAAGTAC CGGACACTGT TTTACATGGC AGCCTACTAT CTTCAAAATC CTCGATTTCC ATCCTCACGG TTTGCTTTCT CGTTGACTTT GTTTTCTCGT TTTCTTGCGT TGCGTTGCCC TGAAATTCTT CATCCCCTAC AGGTCCTTGC GGAACGTCAC TACCCCGTCC GTCCCGTCTA CAACGAAACC CGTGCGGGAG TGACCCTCAC CGTGGACGGT GTGGAGTTCA CACCGGAAGA GCTCGTCGCC ATGGTACTCA GTCACGCCGT CGATATATCC GTCGCTTACG CCACAGAACA AGGATCCACC ATTGCCCCAC CCAAGGATGT CATGCTGACC GTTCCCTCCT ACGCCACACA ACCGGAACGG CAGGCTTTGT TGGATGCGGC GGGACTCGCC GAACTCAACG TACTCGGACT CATTGACGAG AATACCGCTT CTGCCCTCCA CTACGCCATG GACAAGTCCT TTGAAACACC GCAGCTTATC ATCTTCTACA ACATGGGCGC ATCCGCACTC CAGGTTTCAC TCATTCGTTT CTTCAACTAC GAACAACCGC AAAAGTTCGG TAAGCCCAAA ACGGTACCCG CTCTGGAAGT GCTCGGGAAA TCTTGGGACG CCACCTTGGG TGGACAGGCC TTTGATCAAA TCGTAGTGGA ATACCTAGCG GACGAATTTA ACAAGGCCTG GCACGCCTCC ACCGGAAAGA CCGAGCAGGA CGTCCGGAGC TTCCCCCGTG CCATGATTAA GTTGCGTCTC CAGGCCAACA AGGTCAAGCA CGTCCTGTCC GCCAATTCCG AAATCCCCGT CTACATGGAA GCCGTTCACG ACGACGTTGC CCTCTCGACC ACCATGACCC GGGAACAACT CGAATTACTC GCCTCCTCGC TCTGGGCACG AGCCATTCAA CCCGTCACGG ACGTCCTCCA GCAAGCCAAT GTGACGTTGG AGGAGTTGAC CATATTGGAA TTGCTCGGTG GAGGTATGCG GGTACCGCGG ATACAGACGG AACTGATCGA AAACGGACTC GGCGGCAACG CCGCCCTGTT GGGCAAACAC ATCAATTCCG ACGAATCCAT GGCCTTGGGT GCCGCCTTTG CCGGAGCCAA CATTTCCACC GCCTTTCGAG TCCGTCAGGT TGGCATGACT GATCTTAACC CGTTTGCCTT GTCCGTAACT TTGACCAATC TCCCGGACGG CGACGACACC GCATCGGAGG CTTCCAATGA CGAATGGAGT AAGAAAGCGA CCATTTTTAA AGCGTTTGGC AAAGTGGGTG TTAAGAAAAC TATCGCCTTT ACGCACGATA CCGATGTTCA CTGCGCCTTG GACTACGATA CAGATGGCGA AGCGAGCGTG TTGCCGGCAG GATCACAGAC GGCTCTAGAA CGGTACAGGA TATCGGGCGT AGCCGCCTTT GCGAAGGAAA TGGCCGACAA GGGTCTGGGT AAACCCAAAC TTTCCTTGCA GTTTGAATTG AGTGCTTCCG GTATCACTGC CCTCGTGAAA GCCGAAGCTG CAGTGGAAGA AACCTACACT GTCGAAGAGG AAGTCGAAGT CGAAGACGAT GGCGTCACCA ACACGACCGA GGACGAGAGC GAAGAAGAAA AGAAGAACGA TACCGAAGCT GATGGCGAAA ACAAAACGGA CGATTCGCAT GAGGTGAAAA AGGAGAAGAA AACAATAAAG GTGCAAAAGG TACGTTTTGG ACAACGGGTT GACTATGTTT AGTACAGCCG ATTTCGTTGC CTCTCTGATG ATCTACTAAC CCCACAGACG TTGTTTGGCT TACGCCAACG CCGACCTGCT TTTGTCGTAA AGGAAAAGAA ACGCCTGCAC AAGAAGGAGC TCACGGTGGA TACGTATCAT GTTGGTCGCG TAACTCCATA TTCCGCGGAG CTGCTGGCAG CATCGAAGGC GAAGCTCCTC GAGATGGCTC GAAACGACAA AGAACGCATG ATGTTGGAAG AAGCGAAAAA CCGCGTCGAA TCGTACATTT ACTACATCAA GAATAAACTC ACCGACGATG AAGAGGAAAT CGGCACGGTG TCAACCAAGG AGCAACGAGA AGAGTGCCAA AAAGCGGCCG AAGCGGCTGA AGAATGGTTG TACGACGACG GCTACTCAGC GGACCTGGCT ACAATGGAGG ACAAGTATGC CGAATTGTCG GCACCCTTCG AAAAAATCAT GCTCCGTGTG AAAGAGACTG CCGCTCGTCC GGAAGCCGTA AAGGTACTGG AGAAGAAATT GGAGGAGGTT GAAGCTCTCA TCAAGAAGTG GGAGACTTCC ATGCCGCAGG TAACAGAGGA GGAGCGAACG AAAGTCTTGG ATCAGGTGGA GGAGGTTCGC AAGTGGATTA CCAAGGTCGA AGGCATGCAA GCCAAGAAGA AGCCTCATGA CGAACCAGCT TTTGTGAGTG CGGACGTACC GTTGCAAGCA AAGGACTTGG AACTCATGGT AGTTCGACTA AGCAAGAAAC CCAAGCCCAA GCCACCCAAA AAAAAGAAAG ATGACAAGAA GCCTGGTAAC TCGACAGACG CTATGGAGGA CACTGAGGAT CCGGCCGCGG TGAACAAAAC CGAAGCCAAT TCGAGCGAAA GCGCTGAAAA CAAGAGCGAA GCCGAAGGGG CGCAGAGCAA CGAAACACTT CCGGAACCCA CGAGTGGCGA CGCGGGCATG GATGAAGAGC TGTAGATAGG AAGGTCTTTT AGTTACTTTC AGTCAATTAC TGAATCCATA GGGATAGGAT GTTATCGTAC GTG
|
Protein sequence | MRLHSHRAVV AAASFCLWTC LASFSGTVEG RAILGVDLGS LYMKVALVQS GSPLEIVTNL HAKRKTEQMI LFDQQQRFYG ADASALLARK STKTPSAMSV LLGRDEQHPT VRVLAERHYP VRPVYNETRA GVTLTVDGVE FTPEELVAMV LSHAVDISVA YATEQGSTIA PPKDVMLTVP SYATQPERQA LLDAAGLAEL NVLGLIDENT ASALHYAMDK SFETPQLIIF YNMGASALQV SLIRFFNYEQ PQKFGKPKTV PALEVLGKSW DATLGGQAFD QIVVEYLADE FNKAWHASTG KTEQDVRSFP RAMIKLRLQA NKVKHVLSAN SEIPVYMEAV HDDVALSTTM TREQLELLAS SLWARAIQPV TDVLQQANVT LEELTILELL GGGMRVPRIQ TELIENGLGG NAALLGKHIN SDESMALGAA FAGANISTAF RVRQVGMTDL NPFALSVTLT NLPDGDDTAS EASNDEWSKK ATIFKAFGKV GVKKTIAFTH DTDVHCALDY DTDGEASVLP AGSQTALERY RISGVAAFAK EMADKGLGKP KLSLQFELSA SGITALVKAE AAVEETYTVE EEVEVEDDGV TNTTEDESEE EKKNDTEADG ENKTDDSHEV KKEKKTIKVQ KTLFGLRQRR PAFVVKEKKR LHKKELTVDT YHVGRVTPYS AELLAASKAK LLEMARNDKE RMMLEEAKNR VESYIYYIKN KLTDDEEEIG TVSTKEQREE CQKAAEAAEE WLYDDGYSAD LATMEDKYAE LSAPFEKIML RVKETAARPE AVKVLEKKLE EVEALIKKWE TSMPQVTEEE RTKVLDQVEE VRKWITKVEG MQAKKKPHDE PAFVSADVPL QAKDLELMVV RLSKKPKPKP PKKKKDDKKP GNSTDAMEDT EDPAAVNKTE ANSSESAENK SEAEGAQSNE TLPEPTSGDA GMDEEL
|
| |