Gene PHATRDRAFT_49949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49949 
SymbolHSP70G 
ID7198546 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp400185 
End bp403547 
Gene Length3363 bp 
Protein Length936 aa 
Translation table 
GC content52% 
IMG OID 
Productprotein heat shock protein 
Protein accessionXP_002184797 
Protein GI219129229 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.816724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTGGTGTGT CGTCGTCGTC TTTCCCGTCG TTCGCCCATC ACTCTCCACA TATAATTTAC 
TGTCAATCAC GACAACAACG ACGCTTGGGA AAAGCTTGTA AACAGTCAAG TTGAGTACTA
CGGTTGACTC TATCTATTAC CAGTAGTGTT ATTGTTACCG GTGTTCATTG GAATTACTGC
TTTTTGATTT CCTCATGAGA CTGCACAGTC ACCGTGCCGT CGTGGCGGCG GCATCCTTCT
GCCTGTGGAC TTGCTTGGCC TCGTTCTCGG GAACGGTGGA AGGTCGTGCC ATTCTCGGTG
TCGACTTGGG CTCACTCTAC ATGAAAGTGG CACTCGTACA GTCGGGGAGT CCGTTGGAAA
TCGTTACCAA TTTACACGCC AAGCGCAAGA CGGAACAAAT GATTCTATTT GATCAACAAC
AACGCTTTTA CGGGGCCGAC GCATCCGCAC TCTTGGCTCG CAAGTCCACC AAAACACCCT
CCGCAATGTC CGTACTGCTC GGACGAGACG AGCAACATCC CACCGTGCGA GTAAGTCATG
TATAAATGAA AATTCCGGCG GTGCAGTGCT GTACAAACAG TACAATACAG TACGGTACGG
TCCAAAGTAC CGGACACTGT TTTACATGGC AGCCTACTAT CTTCAAAATC CTCGATTTCC
ATCCTCACGG TTTGCTTTCT CGTTGACTTT GTTTTCTCGT TTTCTTGCGT TGCGTTGCCC
TGAAATTCTT CATCCCCTAC AGGTCCTTGC GGAACGTCAC TACCCCGTCC GTCCCGTCTA
CAACGAAACC CGTGCGGGAG TGACCCTCAC CGTGGACGGT GTGGAGTTCA CACCGGAAGA
GCTCGTCGCC ATGGTACTCA GTCACGCCGT CGATATATCC GTCGCTTACG CCACAGAACA
AGGATCCACC ATTGCCCCAC CCAAGGATGT CATGCTGACC GTTCCCTCCT ACGCCACACA
ACCGGAACGG CAGGCTTTGT TGGATGCGGC GGGACTCGCC GAACTCAACG TACTCGGACT
CATTGACGAG AATACCGCTT CTGCCCTCCA CTACGCCATG GACAAGTCCT TTGAAACACC
GCAGCTTATC ATCTTCTACA ACATGGGCGC ATCCGCACTC CAGGTTTCAC TCATTCGTTT
CTTCAACTAC GAACAACCGC AAAAGTTCGG TAAGCCCAAA ACGGTACCCG CTCTGGAAGT
GCTCGGGAAA TCTTGGGACG CCACCTTGGG TGGACAGGCC TTTGATCAAA TCGTAGTGGA
ATACCTAGCG GACGAATTTA ACAAGGCCTG GCACGCCTCC ACCGGAAAGA CCGAGCAGGA
CGTCCGGAGC TTCCCCCGTG CCATGATTAA GTTGCGTCTC CAGGCCAACA AGGTCAAGCA
CGTCCTGTCC GCCAATTCCG AAATCCCCGT CTACATGGAA GCCGTTCACG ACGACGTTGC
CCTCTCGACC ACCATGACCC GGGAACAACT CGAATTACTC GCCTCCTCGC TCTGGGCACG
AGCCATTCAA CCCGTCACGG ACGTCCTCCA GCAAGCCAAT GTGACGTTGG AGGAGTTGAC
CATATTGGAA TTGCTCGGTG GAGGTATGCG GGTACCGCGG ATACAGACGG AACTGATCGA
AAACGGACTC GGCGGCAACG CCGCCCTGTT GGGCAAACAC ATCAATTCCG ACGAATCCAT
GGCCTTGGGT GCCGCCTTTG CCGGAGCCAA CATTTCCACC GCCTTTCGAG TCCGTCAGGT
TGGCATGACT GATCTTAACC CGTTTGCCTT GTCCGTAACT TTGACCAATC TCCCGGACGG
CGACGACACC GCATCGGAGG CTTCCAATGA CGAATGGAGT AAGAAAGCGA CCATTTTTAA
AGCGTTTGGC AAAGTGGGTG TTAAGAAAAC TATCGCCTTT ACGCACGATA CCGATGTTCA
CTGCGCCTTG GACTACGATA CAGATGGCGA AGCGAGCGTG TTGCCGGCAG GATCACAGAC
GGCTCTAGAA CGGTACAGGA TATCGGGCGT AGCCGCCTTT GCGAAGGAAA TGGCCGACAA
GGGTCTGGGT AAACCCAAAC TTTCCTTGCA GTTTGAATTG AGTGCTTCCG GTATCACTGC
CCTCGTGAAA GCCGAAGCTG CAGTGGAAGA AACCTACACT GTCGAAGAGG AAGTCGAAGT
CGAAGACGAT GGCGTCACCA ACACGACCGA GGACGAGAGC GAAGAAGAAA AGAAGAACGA
TACCGAAGCT GATGGCGAAA ACAAAACGGA CGATTCGCAT GAGGTGAAAA AGGAGAAGAA
AACAATAAAG GTGCAAAAGG TACGTTTTGG ACAACGGGTT GACTATGTTT AGTACAGCCG
ATTTCGTTGC CTCTCTGATG ATCTACTAAC CCCACAGACG TTGTTTGGCT TACGCCAACG
CCGACCTGCT TTTGTCGTAA AGGAAAAGAA ACGCCTGCAC AAGAAGGAGC TCACGGTGGA
TACGTATCAT GTTGGTCGCG TAACTCCATA TTCCGCGGAG CTGCTGGCAG CATCGAAGGC
GAAGCTCCTC GAGATGGCTC GAAACGACAA AGAACGCATG ATGTTGGAAG AAGCGAAAAA
CCGCGTCGAA TCGTACATTT ACTACATCAA GAATAAACTC ACCGACGATG AAGAGGAAAT
CGGCACGGTG TCAACCAAGG AGCAACGAGA AGAGTGCCAA AAAGCGGCCG AAGCGGCTGA
AGAATGGTTG TACGACGACG GCTACTCAGC GGACCTGGCT ACAATGGAGG ACAAGTATGC
CGAATTGTCG GCACCCTTCG AAAAAATCAT GCTCCGTGTG AAAGAGACTG CCGCTCGTCC
GGAAGCCGTA AAGGTACTGG AGAAGAAATT GGAGGAGGTT GAAGCTCTCA TCAAGAAGTG
GGAGACTTCC ATGCCGCAGG TAACAGAGGA GGAGCGAACG AAAGTCTTGG ATCAGGTGGA
GGAGGTTCGC AAGTGGATTA CCAAGGTCGA AGGCATGCAA GCCAAGAAGA AGCCTCATGA
CGAACCAGCT TTTGTGAGTG CGGACGTACC GTTGCAAGCA AAGGACTTGG AACTCATGGT
AGTTCGACTA AGCAAGAAAC CCAAGCCCAA GCCACCCAAA AAAAAGAAAG ATGACAAGAA
GCCTGGTAAC TCGACAGACG CTATGGAGGA CACTGAGGAT CCGGCCGCGG TGAACAAAAC
CGAAGCCAAT TCGAGCGAAA GCGCTGAAAA CAAGAGCGAA GCCGAAGGGG CGCAGAGCAA
CGAAACACTT CCGGAACCCA CGAGTGGCGA CGCGGGCATG GATGAAGAGC TGTAGATAGG
AAGGTCTTTT AGTTACTTTC AGTCAATTAC TGAATCCATA GGGATAGGAT GTTATCGTAC
GTG
 
Protein sequence
MRLHSHRAVV AAASFCLWTC LASFSGTVEG RAILGVDLGS LYMKVALVQS GSPLEIVTNL 
HAKRKTEQMI LFDQQQRFYG ADASALLARK STKTPSAMSV LLGRDEQHPT VRVLAERHYP
VRPVYNETRA GVTLTVDGVE FTPEELVAMV LSHAVDISVA YATEQGSTIA PPKDVMLTVP
SYATQPERQA LLDAAGLAEL NVLGLIDENT ASALHYAMDK SFETPQLIIF YNMGASALQV
SLIRFFNYEQ PQKFGKPKTV PALEVLGKSW DATLGGQAFD QIVVEYLADE FNKAWHASTG
KTEQDVRSFP RAMIKLRLQA NKVKHVLSAN SEIPVYMEAV HDDVALSTTM TREQLELLAS
SLWARAIQPV TDVLQQANVT LEELTILELL GGGMRVPRIQ TELIENGLGG NAALLGKHIN
SDESMALGAA FAGANISTAF RVRQVGMTDL NPFALSVTLT NLPDGDDTAS EASNDEWSKK
ATIFKAFGKV GVKKTIAFTH DTDVHCALDY DTDGEASVLP AGSQTALERY RISGVAAFAK
EMADKGLGKP KLSLQFELSA SGITALVKAE AAVEETYTVE EEVEVEDDGV TNTTEDESEE
EKKNDTEADG ENKTDDSHEV KKEKKTIKVQ KTLFGLRQRR PAFVVKEKKR LHKKELTVDT
YHVGRVTPYS AELLAASKAK LLEMARNDKE RMMLEEAKNR VESYIYYIKN KLTDDEEEIG
TVSTKEQREE CQKAAEAAEE WLYDDGYSAD LATMEDKYAE LSAPFEKIML RVKETAARPE
AVKVLEKKLE EVEALIKKWE TSMPQVTEEE RTKVLDQVEE VRKWITKVEG MQAKKKPHDE
PAFVSADVPL QAKDLELMVV RLSKKPKPKP PKKKKDDKKP GNSTDAMEDT EDPAAVNKTE
ANSSESAENK SEAEGAQSNE TLPEPTSGDA GMDEEL