Gene OSTLU_29402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29402 
SymbolCLPC1 
ID5006733 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp174788 
End bp178075 
Gene Length3288 bp 
Protein Length840 aa 
Translation table 
GC content60% 
IMG OID640422154 
Productchaperone, Hsp100 family, ClpC-type 
Protein accessionXP_001422509 
Protein GI145356586 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0120883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGCGATGCG CGCGCACGTC ACCCTCCGCG CGGCGACGCC CGCGCGCGCC GCCGCGACCG 
TGCGCGCGGG CCGACGACCG TCCGCGATGC GCGCCGACGC GGCGCCGACG CGCGCGACGA
GCGCGCGCGT CGCGTACGGC AACGCGTCGA CGACGACGCT CAAGCTCGAC GCGCGAGGCG
CGACGCGAAG GCCGACCCTA GGCGGGGGGC GTCGACGATT CGAGACGCGC GCGATGTTTG
AGGTGCGCGC GACGCGACCG CGAGGGCGGG GCACGCGAGC GTGTATTTTG ATGCGTCGCG
GACGCGGCGC GGAGGGGGAG GGGCGACGCG GGAGAGGACG AGGGACTGAC GAGACGATGG
CTTCGATCGA CGCGCGCGCA GCGGTTCACG GAGAAGGCCA TCAAGGTGGT GATGTTGGCG
CAGGAGGAGG CGCGGCGATT GGGGCACAAC TTCGTCGGGA CCGAACAGGT GAGTGGAGAG
AATATTGAAC GCGCGAACGG GCGCGGATTC GGGGCGCGGG GAAGGGGTAG GGCGGCGTCG
GGGGGGACGC GGAGGACGCG ACGTTAGGTT TAAGCGGGAA GGCGCGGCGC GCGGGGAACG
CGCGGCGGCG TCGCGGGCGC GAGGAACGCG AGGACGGCGC GCGAGGACGC GTGGGAGGGG
TGGAAGATGC GCGCGCGCGC GCGCGACGGC GCGATACGGA CGGTGACTGA CGAGACGGCG
TTCGCGACGC GACAGATCAT GCTCGGTTTG ATTGGCGAAG GCACCGGCAT CGCGGCCAAG
GTGCTCAAGT CGATGGGCAT TTCGTTGAAG GAAGCGCGCA TCGAGGTTGA GAAGATTATT
GGACGCGGTT CCGGTTTCGT CGCGGTGGAG ATTCCGTTCA CGCCGCGCGC GAAGCGCGTG
TTGGAGCTCG CGCTCGAGGA GGCGCGCCAG TTGGGCCACA ACTACATCGG CACCGAACAC
TTGTTGCTCG GTTTGCTCCG TGAAGGCGAG GGCGTGGCTG CGCGCGTGCT CGAAAACCTC
GACGCCGACC CGGCAAAGAT TCGTTCTCAA GTGATTCGCA TGGTTGGTGA GACGCAAGAA
GCCGTCGGCG CGGGCGCCGG CGGCGGCCAA GGCGCGCAAT CCGGCTCCAA GACGCCGACT
TTGGAAGAGT TCGGTAGCGA CCTCACCAAG AAGGCTGAAG AGGGTAAGCT CGATCCGTGC
ATCGGTCGTG CGAACGAAAT CGTTCGCGTC ACGCAAATTC TCGGTCGCCG TACCAAGAAC
AACCCGTGCT TGATTGGTGA ACCGGGCGTC GGTAAGTCTG CCATTGCCGA AGGTCTCGCG
CAAAAGATTG CCGCCAACGA CGTTCCGGAT ACTCTCGACT CCAAGCGTAT GATGACGCTC
GATATGGGTT TGCTCGTCGC CGGTACCAAG TACCGTGGTG AGTTCGAGGA GCGTCTCAAG
AAGCTCATGG ACGAAGTGAA GAGTGATGAA AACATCATCC TTTTCATCGA CGAAGTCCAC
ACGCTCATCG GCGCCGGCGC GGCGGAGGGC GCCATCGATG CGGCGAACAT CTTGAAGCCG
GCCTTGGCGC GTGGTGAACT CCAGTGCATC GGTGCGACGA CTATTGACGA GTACCGCAAG
CACATCGAGA AGGATCCGGC GCTCGAGCGT CGTTTCCAAC CGGTTCAAGT TCCGGAGCCG
TCGGTGGATG AAACCATTCA AATTCTCCGC GGACTCCGCG AGCGTTACGA ATTGCACCAC
AAGCTCAAGT ACGATGACGA CGCTTTGATC GCCGCCGCAA AGTTCTCGAG CCAGTACATC
TCCGATCGTT TCTTGCCGGA CAAGGCGATC GATCTCATCG ATGAGGCTGG TTCTCGCGTG
CGACTGGAAA ACGCCGCCCT CCCGGAGGAA GCCAAGGAAC TCGATAAGGA GCTCAAGGCT
TTGATGAAGG AGAAGGATAC GGCCATCCGC TCTCAGGATT TCGAAGCCGC TGGTGGCCTT
CGTGATCGCG AAGTCGAGCT CAGAGCTCAA ATCAAGCAAA TCACCGAGCG AAAGCAAGAG
GAGAACAAGG CGAAGGCTGA ATCCGGTGAC GCGTCTGGTC CGACGGTGGT CGAGCAAGAC
ATTGCCGATA TTGTCGCCGC CTGGACTGGT ATCCCCGTCG ATAAGGTGTC TTCCGATGAA
GGTACCCGAT TGATGGACAT GGAAGAAACC CTTCACAAGC GATTGGTTGG TCAAGAAGAA
GCCGTCGTGG CGTGCGCTCG CGCCATTCGC CGCGCCCGTA CCGGCTTGAA GAACCCGAAC
CGTCCGGTCG CGTCCTTCAT CTTCTCCGGT CCCACTGGTG TCGGTAAATC CGAGCTCGCC
AAGTCCCTCT CTGCCTTTTA CTTCGGTTCC GAAGAAGCCA TGGTTCGTCT TGATATGTCC
GAATTCATGG AGCGCCACAC TGTTTCCAAG CTCATCGGTT CCCCGCCGGG TTACGTCGGT
TACTCTGAGG GTGGTCAGCT CACCGAGGCT GTTCGGCGGC GTCCGTACAC CCTCGTGCTT
TTCGATGAAA TCGAAAAGGC GCACCCGGAT GTGTTCAACA TGATGCTTCA AATCCTCGAA
GACGGTCGCT TGACCGACTC CAAGGGTCGC GTGATTGACT TCAAGAACAC CCTCATCATC
ATGACGTCGA ACGTTGGTGC GTCGGCGATT GAAAAGGGTG GCGGTGGATT GGGCTTCCAA
CTCGACGACA ACGCGGAGGA TCAGTCCTAC AACCGCATTA AGAGCTTAGT CATGGAAGAC
TTGAAGAACT ACTTCCGACC GGAATTCCTC AACCGTCTTG ACGAAATCAT CGTGTTCCGT
CAACTCAACA AGCAAGAAGT GCGAGAAATT GCGTACATCA TGCTCGAGAA CGTCTTCAAG
CGCCTTAAGG AGAAGGAAAT CGTCCTCGAA TGCACGGAGC GATTCAAGGA TCGTCTCGTC
GACGAGGGTT TCTCTCCGGC GTACGGTGCG CGTCCTTTGC GTCGCGCCAT CATGCGCTTG
CTTGAGGACA ACCTCTCCGA GAAGATGCTC ACGGGTGAGA TTTCGGAGGG TTCGTCGTGC
ATTATGGACG TGAACGCGGA AGGTGAAATC ACGGTGTTGA CTGGCGACGG TAGAGAGCTG
AAGGCTGGTA GCGCGATCGG TGGTCCGGCT GGCATCGCGT AATTCTAGGC TGCGAGAACG
CTTAAAAAAT GATTGATTAC AAAATGACTT TTGTTCTTTG GGGGTCACCC CATCATCAGC
GTCGGAGCCT GTGTTATTAG GCGATTTAGA TTACGTGTCT CTCACGCA
 
Protein sequence
MFERFTEKAI KVVMLAQEEA RRLGHNFVGT EQIMLGLIGE GTGIAAKVLK SMGISLKEAR 
IEVEKIIGRG SGFVAVEIPF TPRAKRVLEL ALEEARQLGH NYIGTEHLLL GLLREGEGVA
ARVLENLDAD PAKIRSQVIR MVGETQEAVG AGAGGGQGAQ SGSKTPTLEE FGSDLTKKAE
EGKLDPCIGR ANEIVRVTQI LGRRTKNNPC LIGEPGVGKS AIAEGLAQKI AANDVPDTLD
SKRMMTLDMG LLVAGTKYRG EFEERLKKLM DEVKSDENII LFIDEVHTLI GAGAAEGAID
AANILKPALA RGELQCIGAT TIDEYRKHIE KDPALERRFQ PVQVPEPSVD ETIQILRGLR
ERYELHHKLK YDDDALIAAA KFSSQYISDR FLPDKAIDLI DEAGSRVRLE NAALPEEAKE
LDKELKALMK EKDTAIRSQD FEAAGGLRDR EVELRAQIKQ ITERKQEENK AKAESGDASG
PTVVEQDIAD IVAAWTGIPV DKVSSDEGTR LMDMEETLHK RLVGQEEAVV ACARAIRRAR
TGLKNPNRPV ASFIFSGPTG VGKSELAKSL SAFYFGSEEA MVRLDMSEFM ERHTVSKLIG
SPPGYVGYSE GGQLTEAVRR RPYTLVLFDE IEKAHPDVFN MMLQILEDGR LTDSKGRVID
FKNTLIIMTS NVGASAIEKG GGGLGFQLDD NAEDQSYNRI KSLVMEDLKN YFRPEFLNRL
DEIIVFRQLN KQEVREIAYI MLENVFKRLK EKEIVLECTE RFKDRLVDEG FSPAYGARPL
RRAIMRLLED NLSEKMLTGE ISEGSSCIMD VNAEGEITVL TGDGRELKAG SAIGGPAGIA