Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32019 |
Symbol | HSP70G |
ID | 5002444 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 131721 |
End bp | 134452 |
Gene Length | 2732 bp |
Protein Length | 711 aa |
Translation table | |
GC content | 61% |
IMG OID | 640417865 |
Product | Heat Shock Protein 70, cytosolic |
Protein accession | XP_001418384 |
Protein GI | 145347872 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02350] chaperone protein DnaK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0878343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGACGCGCT TCGCCGCGCG CACCGCACAC GATGGCGCCC AGCAAGAAGG GAGGCAAGAC CGCCGCCGCG CCCGTCGTCG CGCACGACGA GGACGATCTG ATTCAGGTGC GTCGCGGAAC GCGATCGCGA CCGCGACGCG ACGCGCGCGG ACGGATGGAT GGATGGCGGC GCGCGACCGA TCGGACGACG GCGCGCGCGA TGGGGCGCGC GGGACGATCG ACGGCGCGGT GAAACGCGCG CAAAGGACGC GCGACGCGAT CGGGACGCGC GGGAGACCGA AGGCGCGGCG GACGCGCGGG AGGGGCGACG ATGGGGATTT CGGATAGGGC CGGGCGCGGC CCTTCGGGCG CGGGGCGAAG ACGCGCGAGG GACGACGCGC GAGGGAGGCG CGAGCGCGAC GCGAGGTGGG GCGACGCGCG ATGGGACGCG AGAGGGGGGT GAGAGACGGG ACGCGAGAGG CGGTGGAGAC GGTCGACGGA GAGAGACGCG CGATGTGGGA GACTGACGAG GCGGACGACG ACGACGCGCG CGCAGGTCAC GCCGGCGATG AAGCAGCAAG CGGCGAAGTT TAAGGCGGCT GGGAACAACG CGTTCTCCGC CGGTCGATAC TACGCCGCCA CGGCTGAGTT CACCAAGGCG ATTGAGTGCG ATCCGTACGA TCACATTTTC TACTCTAACC GCTCCGCGTG CTACGCCAAC TTGGACCAAC ACTCCAAGGC GTGCGCGGAT GCGCGCCGAT GCATCGAGTT GCGCCCGGAT TTCGCCAAGG GGTACTCTCG TCTCGGTTTC GCGCTCTTCA AGGCTGGTTT CTTGCACGAC GCCATGAATG CGTACGAGCG CGGTTTGACC GTGGACCCGA AGAACAACAA CTTGATTGAA GGTTTGGGTG AAGCCAAGTT GGCGCAAAAG GCTAAGATTG AAGCGGCCAA GTTGAGCGCG CAAATGGATA ACGTCACGCT CGACGAGCAC GTCATCGGTA TCGATCTCGG TACGACGTAC TCTTGCGTGT CCGTGTGGAG AAACGGTGAA GCGCACGTGC TGACGAACGC CGAAGGCGAC CGTACCACCC CGTCTTGGGT TGCGTTCACC GAGCAAGGCC GCCTCGTCGG CGACGCCGCC AAGCGCCAAG CCGCCATCAA CCCGAAGAAC ACGCTGTTCA ACATCAAGCG TATCATCGGT CGTCAATACA GCGAGTGCGC TCACGAGCTC GAATTAATGC CGTTCGATGT CAAAGAAGGC GAAGGCGGCA AGCCGATCGT CTCCGTTGAC GTGAACGGCG AAAAGAAGGA CTTTGCCCCG GAGCAAATTT CCGCCATGGT TTTGCAAAAG ATGAAGGCGA CTGCCGAGGC GCAGCTCGGT GTCCCGATCA CCAAGGCTGT CGTCACCGTG CCGGCGTATT TCAACGATGC CCAGCGTCGC CAAACCAAGG ATGCCGGTGC CATCGCGGGT CTCGACGTCT TGCGTATCAT CAACGAGCCG ACCGCGGCGG CGCTCGCGTA CGGCCTCGAT CGCCGCGAAG GCGAAAACGG CGAAGTCATC AAGAACCAAT GCATCTTGGT CTTCGATCTC GGTGGTGGTA CCTTCGATGT GTCCTTGTTG AACTTGCAAG ACGGCGTCTT CGAAGTGCTC TCCACCGCCG GTGACACGCA CTTGGGTGGT GAAGATTTCG ACACGTCCCT CGCGGCTTTC GCGCAAAAGG AGATTGAAAA GGAGCGCGGC GCCGACATCT TCACCGGCGA TGAAAAGGCT CTTCGCAAGT TGCGCACGGC GTGCGAAAAG GCCAAGCGCG AGTTGTCCGT GGCCAACCAC GCCAACATCG AATGCTTCAT CGGTGAAATC GAAATCAACA TGAAGATTAC CCGTGAACAA TTCGAGAAGG TTTGCGAACC GACGTTCCAA CGCTGCCTCG ACTCTGTCAA GCGCGTGCTC AGCGACGCCG GCAAAAAGAA GGAAGAAGTC GACGAAATCG TCCTCGTCGG TGGTTCCACC CGTGTCCCGC GCGTGCAAGG TATCCTCACC GAATACTTCG ATGGCAAGAC CCTCAACAAG TCTGTCCACC CCGATGAGGC GGTGGCGTAC GGTGCGGCCG TGCAAGGTGC GATTCTCGCG GGCGTCCGCG ACAAGCAGAC TTCTCGCGTT CTCCTCATGG ATGTCGTTCC GCTTTCCCTC GGTGTCGAGT GCGAAGGTCG CCAATTCGCC AAGGTTGTGC AACGCAACAC TGCGATTCCG TGCAAAAAGA AGAGCGAGTT CACCACCGTC TATGATAACC AAGACGAGAT TGATGTGCGC ATTTTCGAAG GCGAACGCTC CAACACCGAC GGCAACCACT TGCTCGGCGA GTTCCAAATT TCTGGCATCG AGCGCGCTTC CGCGGGCGAA CCGAAGATTG ATGTCACTTT CGAGGTCAAC ACCAACGGTT TGTTGACCGT CACCGCCAAG GATCGCGTCA CCGGCGTCGA GGCCAACGTG TCCTTGCAAC ACGACCGCGG CCGTTTGACT GCGGAAGAGA TCGAGCGCAT GTGCGCCGAA GCCGAAGCCA TGGCGGAAGA AGATGAGCGC CTCGCGCGCA TGCGTGAGTA CGAAGGCACC GACTAGGCGA TAAAAAGTCA AGTTCTGCGT CCTTTTAGAT AGCCGCGTCG GTCCCGAGTC GCTCTGCGCG CCTCGACGAC GGTTCGCGCC ACCCACGTAA CCATTCTTTC TCACCGCAAC ATATGATCTA GTAGCTCATG TT
|
Protein sequence | MAPSKKGGKT AAAPVVAHDE DDLIQVTPAM KQQAAKFKAA GNNAFSAGRY YAATAEFTKA IECDPYDHIF YSNRSACYAN LDQHSKACAD ARRCIELRPD FAKGYSRLGF ALFKAGFLHD AMNAYERGLT VDPKNNNLIE GLGEAKLAQK AKIEAAKLSA QMDNVTLDEH VIGIDLGTTY SCVSVWRNGE AHVLTNAEGD RTTPSWVAFT EQGRLVGDAA KRQAAINPKN TLFNIKRIIG RQYSECAHEL ELMPFDVKEG EGGKPIVSVD VNGEKKDFAP EQISAMVLQK MKATAEAQLG VPITKAVVTV PAYFNDAQRR QTKDAGAIAG LDVLRIINEP TAAALAYGLD RREGENGEVI KNQCILVFDL GGGTFDVSLL NLQDGVFEVL STAGDTHLGG EDFDTSLAAF AQKEIEKERG ADIFTGDEKA LRKLRTACEK AKRELSVANH ANIECFIGEI EINMKITREQ FEKVCEPTFQ RCLDSVKRVL SDAGKKKEEV DEIVLVGGST RVPRVQGILT EYFDGKTLNK SVHPDEAVAY GAAVQGAILA GVRDKQTSRV LLMDVVPLSL GVECEGRQFA KVVQRNTAIP CKKKSEFTTV YDNQDEIDVR IFEGERSNTD GNHLLGEFQI SGIERASAGE PKIDVTFEVN TNGLLTVTAK DRVTGVEANV SLQHDRGRLT AEEIERMCAE AEAMAEEDER LARMREYEGT D
|
| |