Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24612 |
Symbol | CLPB1 |
ID | 5001884 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 795886 |
End bp | 799120 |
Gene Length | 3235 bp |
Protein Length | 923 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417305 |
Product | chaperone, Hsp100 family, ClpB-type |
Protein accession | XP_001417871 |
Protein GI | 145346802 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR03346] ATP-dependent chaperone ClpB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.311804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCGG CGCACGCGCT CAAGCTCTCG CCCTCGTCCG GCTCCACGTC GACGCACCAC GCCTCGCCGT CGCCCACGGG CGCGCACCGC GCGACGTTTC GTCGCCCGAT GGCGCGCTCG ACGACGCTCA CCGCGTCCGC GCCGCGCGCG CCGCGCGCGA TGCGCGCCGA CACCCGACAC CGCGCGGCGT CGGCGTCGAC GCGAACGCGC AGCGACCGCG CGCGCGCGTC GACGTCGTCG AGCGTCAGCG GACGCGCGTC GCGCTCGACG CGCGCGTCTC CGCGCGCGCG CGCGTCGCGC TCGACGACGA CGACGAATCG AGGCGTCATG GGCCGGGCGC GACGCGACGT GACGGCACAC GCGCGCGCGA GCGACGGCGA TGGCGACGAC GGCGACCGGG ACGACGACGA CGACGCGGGC GGGCGTCGCG CGCGCGCGGC GCGCGCGCGC GACGGCGACG CGTCGAACGG CGACGCGCGT CGGGAGGGGC GTCGAAGACG CGCGACGCGA TGACGAAGCC TCGGCGCGAC GACGAGGGCG CGCGGTCGCG CGCGCGAGCG CGGGCGCGGG CGGGAGCGGA GGGAAGAAGA TAAGCCAGAA TGAGTTCACG GCGCGCGCGT GGGACGCGAT CGTGCGGGCG CCGGAGGTGG CGAAACAGAG CAAACAACAA ATCGTGGAGA CGGAGCACGT GTGCGAGGCG CTGTGCTCGC AAAAGGACGC GTTCGCGATG CGGATATTCG CCCAAGCGGG AGTGAAGGAT TTGAAATTAG TGATATCGAG AACGCGTGAT TTCATCGCGG GGCAGCCGCA GGTGAGCGGG GCGGCGCAAC AGGTGCTGGG GAGGTTTTTG GAGTCGCTCG TGGACGACGC GCGGACGATT TCGAGCGGCA TGTCGGACGA GTTCGTGGCG GTGGAACATT TGGTGCTCGC GCTGGCGCGA GACGAGAGAT TTGGGAAAGG ATTGATGGCG GATTTGGGGA TCACGTACGC GAATCTCGAG GCCGCGGTGA TCACGCTGCG TCGGGGAGAG AACGTGACGG ATCAAGACGC CGAGGATAAG TACGAGGCGC TGAAAAAGTA CTCGCGGGAT TTGACGGAGG AGGCTCGGGC GGGGAAGTTG GATCCCGTCA TCGGCCGAGA CGATGAAATT CGTCGCACGA TTCAAATCTT AAGTCGTCGC TCGAAGAATA ATCCGGTGTT GATCGGAGAA CCCGGGGTAG GGAAGACGGC GGTGGCGGAG GGACTGGCGC AACGCGTCGT TCGAGGCGAC GTCCCGACGT CTCTCCAAGA CGTGAAAATC ATGTCATTGG ACATGGGTTT GCTCATCGCC GGCGCAAAGT TTCGAGGGGA GTTTGAAGAT AGGCTAAAGG CGGTGATGAA GGAAGTTTCC GACTCCATGG GGAAAATTAT CCTTTTCATC GACGAGATTC ACACCGTCGT CGGCGCCGGA GGCGGAGGCG GCGGGGGCAA CGGCATGGAC GCGGGTAACT TGCTCAAACC CATGCTCGGG CGAGGTGAGC TGCGATGCAT CGGAGCAACG ACGTTGGATG AGTATCGCAC GTACATCGAG AAAGATCCCG CGCTCGAACG TCGATTCCAA CAAGTTATAA TCGCGCAGCC GACGGTGGAG GACACGATCA GTATCTTGCG AGGTTTACGC GAACGCTACG AGCTTCATCA CGGGGTCTCG ATCTCTGATT CTGCCTTGGT CGAAGCGGCG ACGCTCAGCG ACCGATACAT CGCCGATCGC TTCTTGCCGG ACAAGGCGAT CGATCTCGTA GATGAGTCCG CAGCCAAGCT CAAGATGGAA ATCACGTCGA AACCGACGGT TTTGGATGAA ATCGATCGCG AGATTTTGAA ACTTCAAATG GAGAAGATAT CGTTGTCGCG ACCAGGCGCT TCGCGAGACG CGCGCTCGAT CCAATCCAAA GTTGAGAAGC TCGACAGCGA TCTGAAGGCG CTCACCGAAA AGCAGTCGGT GCTCAACGAT CAATGGCAAG GTGAACAAAA CAAGTTGAAG GCGATTCAGA CTTTGAAGGA AGAAATTGAT TCGGTTACGA ACAGCATTCA GCGCGCCGAG CGTGAGTATG ACTTGAATAA AGCGGCTGAG TTGAAGTACG GTACGCTCAT GACGTTGCAG CGAAGACTAA ACGAGGCAGA GGAAGTGCTC GAGCTCGCTA CTTCGGAGGG ACCGACTCTT TTGCGCGACG AAGTCACCGA GGCCGACATC GCGGACGTCA TCTCCAAGTG GACCGGTATT CCCGTCGCCA AGCTCCAGCA AGGCGAGCGG GAAAAACTTC TTGATTTACC CGCCGAACTT CACAAGCGAG TCGTCGGCCA AGATGAAGCC GTGCAGTCTG TGTGCGAGGC TATTCAGCGC TCTCGCGCGG GTCTCTCTGA CCCGAACCGC CCGATTGCGA GTTTCATGTT CCTCGGCCCC ACGGGCGTGG GTAAAACGGA GCTGTGCAAA ACATTGGCAA ACTTTTTGTT CAACACCGAG GAAGCGATGA TTCGCATCGA TATGAGCGAG TATATGGAAA AACACTCTGT GAGTCGCTTG ATTGGCGCTC CGCCCGGGTA CGTCGGTTTC GAGGAAGGCG GTCAACTCAC CGAGGCGGTA CGACACCGTC CGTATTCTGT CGTACTTTTC GACGAGATGG AAAAAGCACA CGGTGATGTT TTCAACGTAT TGCTTCAGAT TTTGGACGAT GGTCGCGTCA CCGATTCGCA AGGGCGTTTG ATCAACTTCA AAAACACTAT TCTCATCATG ACGTCAAACA TCGGTAGTCA ATATGTGCTC GACACCAACG AAGCTTCGAA AGAAACTAGA CGCGAGCGTG TGATGGATGC TGTGCGTGGA CACTTCCGAC CGGAGTTCAT CAACCGTGTG GACGAGTGGA TCGTCTTTGA TCCGCTTGCG AAGGATCAAG TCACCGCCAT CGTTCGACAA CAAGTCGAAC GCGTCACCTC CCGTCTCGCT GATCGTAAGA TTGGACTCCG AGTCTCCGAC GAAGCCGTTG CGTTACTCTC CGACACCGGC TACGACCCCG CATTCGGCGC TCGTCCCGTG AAGCGCGCGG TGCAGAGCTT ATTAGAAACC GCCGTAGCCC AAGCCATCCT TCGAGGCGAC GTCAACGAGG ATCAAACCGC CGTCGTCGAC GTCGACCCGT CGTCCACGGG GAAGCTTGTC GTCACCGCAA AAGATTCTCC CAAGTCTGCA AACGTCATCG CATGA
|
Protein sequence | MASAHALKLS PSSGSTSTHH ASPSPTGAHR ATAGAGGSGG KKISQNEFTA RAWDAIVRAP EVAKQSKQQI VETEHVCEAL CSQKDAFAMR IFAQAGVKDL KLVISRTRDF IAGQPQVSGA AQQVLGRFLE SLVDDARTIS SGMSDEFVAV EHLVLALARD ERFGKGLMAD LGITYANLEA AVITLRRGEN VTDQDAEDKY EALKKYSRDL TEEARAGKLD PVIGRDDEIR RTIQILSRRS KNNPVLIGEP GVGKTAVAEG LAQRVVRGDV PTSLQDVKIM SLDMGLLIAG AKFRGEFEDR LKAVMKEVSD SMGKIILFID EIHTVVGAGG GGGGGNGMDA GNLLKPMLGR GELRCIGATT LDEYRTYIEK DPALERRFQQ VIIAQPTVED TISILRGLRE RYELHHGVSI SDSALVEAAT LSDRYIADRF LPDKAIDLVD ESAAKLKMEI TSKPTVLDEI DREILKLQME KISLSRPGAS RDARSIQSKV EKLDSDLKAL TEKQSVLNDQ WQGEQNKLKA IQTLKEEIDS VTNSIQRAER EYDLNKAAEL KYGTLMTLQR RLNEAEEVLE LATSEGPTLL RDEVTEADIA DVISKWTGIP VAKLQQGERE KLLDLPAELH KRVVGQDEAV QSVCEAIQRS RAGLSDPNRP IASFMFLGPT GVGKTELCKT LANFLFNTEE AMIRIDMSEY MEKHSVSRLI GAPPGYVGFE EGGQLTEAVR HRPYSVVLFD EMEKAHGDVF NVLLQILDDG RVTDSQGRLI NFKNTILIMT SNIGSQYVLD TNEASKETRR ERVMDAVRGH FRPEFINRVD EWIVFDPLAK DQVTAIVRQQ VERVTSRLAD RKIGLRVSDE AVALLSDTGY DPAFGARPVK RAVQSLLETA VAQAILRGDV NEDQTAVVDV DPSSTGKLVV TAKDSPKSAN VIA
|
| |