Gene OSTLU_24612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24612 
SymbolCLPB1 
ID5001884 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp795886 
End bp799120 
Gene Length3235 bp 
Protein Length923 aa 
Translation table 
GC content60% 
IMG OID640417305 
Productchaperone, Hsp100 family, ClpB-type 
Protein accessionXP_001417871 
Protein GI145346802 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR03346] ATP-dependent chaperone ClpB 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.311804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCGG CGCACGCGCT CAAGCTCTCG CCCTCGTCCG GCTCCACGTC GACGCACCAC 
GCCTCGCCGT CGCCCACGGG CGCGCACCGC GCGACGTTTC GTCGCCCGAT GGCGCGCTCG
ACGACGCTCA CCGCGTCCGC GCCGCGCGCG CCGCGCGCGA TGCGCGCCGA CACCCGACAC
CGCGCGGCGT CGGCGTCGAC GCGAACGCGC AGCGACCGCG CGCGCGCGTC GACGTCGTCG
AGCGTCAGCG GACGCGCGTC GCGCTCGACG CGCGCGTCTC CGCGCGCGCG CGCGTCGCGC
TCGACGACGA CGACGAATCG AGGCGTCATG GGCCGGGCGC GACGCGACGT GACGGCACAC
GCGCGCGCGA GCGACGGCGA TGGCGACGAC GGCGACCGGG ACGACGACGA CGACGCGGGC
GGGCGTCGCG CGCGCGCGGC GCGCGCGCGC GACGGCGACG CGTCGAACGG CGACGCGCGT
CGGGAGGGGC GTCGAAGACG CGCGACGCGA TGACGAAGCC TCGGCGCGAC GACGAGGGCG
CGCGGTCGCG CGCGCGAGCG CGGGCGCGGG CGGGAGCGGA GGGAAGAAGA TAAGCCAGAA
TGAGTTCACG GCGCGCGCGT GGGACGCGAT CGTGCGGGCG CCGGAGGTGG CGAAACAGAG
CAAACAACAA ATCGTGGAGA CGGAGCACGT GTGCGAGGCG CTGTGCTCGC AAAAGGACGC
GTTCGCGATG CGGATATTCG CCCAAGCGGG AGTGAAGGAT TTGAAATTAG TGATATCGAG
AACGCGTGAT TTCATCGCGG GGCAGCCGCA GGTGAGCGGG GCGGCGCAAC AGGTGCTGGG
GAGGTTTTTG GAGTCGCTCG TGGACGACGC GCGGACGATT TCGAGCGGCA TGTCGGACGA
GTTCGTGGCG GTGGAACATT TGGTGCTCGC GCTGGCGCGA GACGAGAGAT TTGGGAAAGG
ATTGATGGCG GATTTGGGGA TCACGTACGC GAATCTCGAG GCCGCGGTGA TCACGCTGCG
TCGGGGAGAG AACGTGACGG ATCAAGACGC CGAGGATAAG TACGAGGCGC TGAAAAAGTA
CTCGCGGGAT TTGACGGAGG AGGCTCGGGC GGGGAAGTTG GATCCCGTCA TCGGCCGAGA
CGATGAAATT CGTCGCACGA TTCAAATCTT AAGTCGTCGC TCGAAGAATA ATCCGGTGTT
GATCGGAGAA CCCGGGGTAG GGAAGACGGC GGTGGCGGAG GGACTGGCGC AACGCGTCGT
TCGAGGCGAC GTCCCGACGT CTCTCCAAGA CGTGAAAATC ATGTCATTGG ACATGGGTTT
GCTCATCGCC GGCGCAAAGT TTCGAGGGGA GTTTGAAGAT AGGCTAAAGG CGGTGATGAA
GGAAGTTTCC GACTCCATGG GGAAAATTAT CCTTTTCATC GACGAGATTC ACACCGTCGT
CGGCGCCGGA GGCGGAGGCG GCGGGGGCAA CGGCATGGAC GCGGGTAACT TGCTCAAACC
CATGCTCGGG CGAGGTGAGC TGCGATGCAT CGGAGCAACG ACGTTGGATG AGTATCGCAC
GTACATCGAG AAAGATCCCG CGCTCGAACG TCGATTCCAA CAAGTTATAA TCGCGCAGCC
GACGGTGGAG GACACGATCA GTATCTTGCG AGGTTTACGC GAACGCTACG AGCTTCATCA
CGGGGTCTCG ATCTCTGATT CTGCCTTGGT CGAAGCGGCG ACGCTCAGCG ACCGATACAT
CGCCGATCGC TTCTTGCCGG ACAAGGCGAT CGATCTCGTA GATGAGTCCG CAGCCAAGCT
CAAGATGGAA ATCACGTCGA AACCGACGGT TTTGGATGAA ATCGATCGCG AGATTTTGAA
ACTTCAAATG GAGAAGATAT CGTTGTCGCG ACCAGGCGCT TCGCGAGACG CGCGCTCGAT
CCAATCCAAA GTTGAGAAGC TCGACAGCGA TCTGAAGGCG CTCACCGAAA AGCAGTCGGT
GCTCAACGAT CAATGGCAAG GTGAACAAAA CAAGTTGAAG GCGATTCAGA CTTTGAAGGA
AGAAATTGAT TCGGTTACGA ACAGCATTCA GCGCGCCGAG CGTGAGTATG ACTTGAATAA
AGCGGCTGAG TTGAAGTACG GTACGCTCAT GACGTTGCAG CGAAGACTAA ACGAGGCAGA
GGAAGTGCTC GAGCTCGCTA CTTCGGAGGG ACCGACTCTT TTGCGCGACG AAGTCACCGA
GGCCGACATC GCGGACGTCA TCTCCAAGTG GACCGGTATT CCCGTCGCCA AGCTCCAGCA
AGGCGAGCGG GAAAAACTTC TTGATTTACC CGCCGAACTT CACAAGCGAG TCGTCGGCCA
AGATGAAGCC GTGCAGTCTG TGTGCGAGGC TATTCAGCGC TCTCGCGCGG GTCTCTCTGA
CCCGAACCGC CCGATTGCGA GTTTCATGTT CCTCGGCCCC ACGGGCGTGG GTAAAACGGA
GCTGTGCAAA ACATTGGCAA ACTTTTTGTT CAACACCGAG GAAGCGATGA TTCGCATCGA
TATGAGCGAG TATATGGAAA AACACTCTGT GAGTCGCTTG ATTGGCGCTC CGCCCGGGTA
CGTCGGTTTC GAGGAAGGCG GTCAACTCAC CGAGGCGGTA CGACACCGTC CGTATTCTGT
CGTACTTTTC GACGAGATGG AAAAAGCACA CGGTGATGTT TTCAACGTAT TGCTTCAGAT
TTTGGACGAT GGTCGCGTCA CCGATTCGCA AGGGCGTTTG ATCAACTTCA AAAACACTAT
TCTCATCATG ACGTCAAACA TCGGTAGTCA ATATGTGCTC GACACCAACG AAGCTTCGAA
AGAAACTAGA CGCGAGCGTG TGATGGATGC TGTGCGTGGA CACTTCCGAC CGGAGTTCAT
CAACCGTGTG GACGAGTGGA TCGTCTTTGA TCCGCTTGCG AAGGATCAAG TCACCGCCAT
CGTTCGACAA CAAGTCGAAC GCGTCACCTC CCGTCTCGCT GATCGTAAGA TTGGACTCCG
AGTCTCCGAC GAAGCCGTTG CGTTACTCTC CGACACCGGC TACGACCCCG CATTCGGCGC
TCGTCCCGTG AAGCGCGCGG TGCAGAGCTT ATTAGAAACC GCCGTAGCCC AAGCCATCCT
TCGAGGCGAC GTCAACGAGG ATCAAACCGC CGTCGTCGAC GTCGACCCGT CGTCCACGGG
GAAGCTTGTC GTCACCGCAA AAGATTCTCC CAAGTCTGCA AACGTCATCG CATGA
 
Protein sequence
MASAHALKLS PSSGSTSTHH ASPSPTGAHR ATAGAGGSGG KKISQNEFTA RAWDAIVRAP 
EVAKQSKQQI VETEHVCEAL CSQKDAFAMR IFAQAGVKDL KLVISRTRDF IAGQPQVSGA
AQQVLGRFLE SLVDDARTIS SGMSDEFVAV EHLVLALARD ERFGKGLMAD LGITYANLEA
AVITLRRGEN VTDQDAEDKY EALKKYSRDL TEEARAGKLD PVIGRDDEIR RTIQILSRRS
KNNPVLIGEP GVGKTAVAEG LAQRVVRGDV PTSLQDVKIM SLDMGLLIAG AKFRGEFEDR
LKAVMKEVSD SMGKIILFID EIHTVVGAGG GGGGGNGMDA GNLLKPMLGR GELRCIGATT
LDEYRTYIEK DPALERRFQQ VIIAQPTVED TISILRGLRE RYELHHGVSI SDSALVEAAT
LSDRYIADRF LPDKAIDLVD ESAAKLKMEI TSKPTVLDEI DREILKLQME KISLSRPGAS
RDARSIQSKV EKLDSDLKAL TEKQSVLNDQ WQGEQNKLKA IQTLKEEIDS VTNSIQRAER
EYDLNKAAEL KYGTLMTLQR RLNEAEEVLE LATSEGPTLL RDEVTEADIA DVISKWTGIP
VAKLQQGERE KLLDLPAELH KRVVGQDEAV QSVCEAIQRS RAGLSDPNRP IASFMFLGPT
GVGKTELCKT LANFLFNTEE AMIRIDMSEY MEKHSVSRLI GAPPGYVGFE EGGQLTEAVR
HRPYSVVLFD EMEKAHGDVF NVLLQILDDG RVTDSQGRLI NFKNTILIMT SNIGSQYVLD
TNEASKETRR ERVMDAVRGH FRPEFINRVD EWIVFDPLAK DQVTAIVRQQ VERVTSRLAD
RKIGLRVSDE AVALLSDTGY DPAFGARPVK RAVQSLLETA VAQAILRGDV NEDQTAVVDV
DPSSTGKLVV TAKDSPKSAN VIA