Gene OSTLU_15385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_15385 
SymbolClpP4 
ID5001853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp294237 
End bp295196 
Gene Length960 bp 
Protein Length319 aa 
Translation table 
GC content64% 
IMG OID640417274 
Productchloroplast Clp protease, subunit of ClpP peptidase complex 
Protein accessionXP_001417979 
Protein GI145347023 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0740] Protease subunit of ATP-dependent Clp proteases 
TIGRFAM ID[TIGR00493] ATP-dependent Clp protease, proteolytic subunit ClpP 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0468078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.200159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTCG TGACGCGCGC GACGACGACG GCGCCGCGCG GAAGCGCGAC GCGACGCGAC 
GCGACGACGC GGGCGACGCG ATCGAGGCTA GGTTTATCGC GCGCGCGGCG CGCGGACGGG
CGCGGACGCG CGACGCGCGT CGCGCGCGCG GGAGGAGCGT TCTACGAGAA CCCGAACGAA
ATCGCGCACG TCGACGGCGT GAACGAGGCG CAGAGAGAGA TTTACGACGC GGTGAACGCG
AGCGGGGCGC CGGGCGCGCG CGCGGGGCCG GTGGTGCCGC GAGACCAAAG CGGGTCGCCG
TTCGATAGCT TGCTGAGGAA CCGAATCGTG TTTCTCGGCT CGCAGGTCGA TGATTTTAAC
GCCGATGCGG TGATCTCGCA GTTGCTGCTG CTCGATCAGC AAGATCCGAC GAAGGAGATT
AAACTCTTCA TCAACTCTCC GGGAGGGAGC GTGACGGCGG GGATGGGGAT TTACGACGCG
ATGCAGTTTT GCCGAGCGCC CGTGAGCACG GTGTGCCTTG GACTCGCGGC GTCCATGGGG
GCGTTCTTGC TCGCGAGCGG GGAGAAGGGT CGCCGCATGT CGATGCCGAA CGCGCGCATC
ATGATTCACC AGCCGCTCGG CGGGGCGCAA GGGAGCGCCG TGGACATCGA AATCCAAGCG
AAGGAAATCA TGCACCACAA GGCCAACTTG AACAGACTCA TGGCGTTCCA CACCGGACAA
GACGTTAAGA CGATAGACGA AGACACCGAT CGCGATCGCT ACATGTCGCC GCTCGAGGCG
AAGCAGTACG GAGTCATCGA CATCGTCATC GGCGGCGACG ACGCCGGCTT GAAGATCGAG
GGGTCGTTCA CGGAAAAGCT CAAGACGAAG AAGGATTACG TGGCGTGGGG TAACGACGGT
AACGACGGTT CGTCTTCTGC GCGATTCACC GGCGACACGC AGGATGCGAA GCTCAACTAA
 
Protein sequence
MGLVTRATTT APRGSATRRD ATTRATRSRL GLSRARRADG RGRATRVARA GGAFYENPNE 
IAHVDGVNEA QREIYDAVNA SGAPGARAGP VVPRDQSGSP FDSLLRNRIV FLGSQVDDFN
ADAVISQLLL LDQQDPTKEI KLFINSPGGS VTAGMGIYDA MQFCRAPVST VCLGLAASMG
AFLLASGEKG RRMSMPNARI MIHQPLGGAQ GSAVDIEIQA KEIMHHKANL NRLMAFHTGQ
DVKTIDEDTD RDRYMSPLEA KQYGVIDIVI GGDDAGLKIE GSFTEKLKTK KDYVAWGNDG
NDGSSSARFT GDTQDAKLN