Gene OSTLU_25048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25048 
SymbolClpR1 
ID5003780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp366933 
End bp368241 
Gene Length1309 bp 
Protein Length381 aa 
Translation table 
GC content58% 
IMG OID640419201 
Productchloroplast Clp protease, subunit of ClpP peptidase complex 
Protein accessionXP_001419790 
Protein GI145350809 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0740] Protease subunit of ATP-dependent Clp proteases 
TIGRFAM ID[TIGR00493] ATP-dependent Clp protease, proteolytic subunit ClpP 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.333156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGACGTTC ACGCCTCGTC GTAAACGCGC TCGATCGATC GAAAGACTTT AAACACGCCG 
AACGAAGGCG CGAACGAGTC CAACGATGCT CGGCGCGCGC GCGACGTCGA CGACGACGCA
GCGAGTAACC CTAAGAGGCC ACATCGCGAA GAATGCGCTC GCGACGCGCG CGGTGACGCC
CGCGCGCGCG AATCGAGGCG CGGTAAATGT GACGAAGAGT ATGAACTTTG GACACGAGGA
CTTTTTCGAC GTCGTGGGCA CGGAGCGGGA CTTGATTTTG GAGGAAAGGT TTCCAGACGT
GAAGGGCGCC GGGCGCGGGC TGACTAAGAA TCAAATCGAA GCGCTCGGGT TGGCGGGAAG
CGAGGCGAGA GAACGATTCA GCGTGAAGGC GCTTGATTTA GGCGCGCGTG CGGCGTACGC
GAAGGAGATG CCGAGCGACA ATAATGAGCC GACGAGGTAC AGAACGGTGA TGAGCGGGGC
GACGATGACG CACGGGGGTA TGGCGCCGGC GGGTGCGGTC GCACCGCCGG ATCTTCCGTC
GCTGTTGTTG AACGCGCGCA TTTGTTACAT CGGTATGTCG CTCGTGCCCG CGGTGACGGA
GTTAGTCGTG GCGGAGTTGT TGTATCTTGG ATACGAGCAG GCGGAAAAGC CGTGCTATGT
GTACATCAAC TCTGGGGGGT CGCTGAACGA GAAGGGCGAA GTTGTGGGGA TGGACAACGA
GGCGTACGCT ATTTTGGACA CCATGCGTTA CATTCGACCG AAAATTCACA CCGTGGCTGT
CGGGAAGTGC CACGGGAACG CGTCGTTGAT TTTGGCGGCG GGCGATAAAG GATGCAGACA
CGCGCTGCCG CACGTGCAAA TTTCCACGCA GCCGCCCAAG TTGAACCGCA CGTTCGACTC
GACGCAAAAC GTTCAAATCC GTGCCAACGA GTGCGCGCTG TACGAAGACA CTTATATGGG
TTTCATGTCC GAGTTCTCTG GAAAGGACAT TGACGTAGTG CGCAAGGACC TCGACCGTAC
GCGTTACTTC ACCCCGAACC AAGCGATCGA GTACGGTTTA ATCGACAAGG TCATCACCAA
GGGCTTGAAC GTGATGGAAG CCCAAGATTA CGAGCGTTTG TTGGCGCAAC AACAAGCACA
ATACGAGTCG GCGGGTGTGC CGATGCCGGG CCAAGAGCAG CAGCAAAACG ATCGCAGTTC
GCACGCCGAC AAGGGGACGG TGCGAAAGTA AACGTCCATC ATGCATGCGC GCGTTCTGGT
AATTTTAGAT TTAGAAGAAA GCTCAACATT GTCATACAAA CGGTACGGA
 
Protein sequence
MLGARATSTT TQRVTLRGHI AKNALATRAV TPARANRGAV NVTKSMNFGH EDFFDVVGTE 
RDLILEERFP DVKGAGRGLT KNQIEALGLA GSEARERFSV KALDLGARAA YAKEMPSDNN
EPTRYRTVMS GATMTHGGMA PAGAVAPPDL PSLLLNARIC YIGMSLVPAV TELVVAELLY
LGYEQAEKPC YVYINSGGSL NEKGEVVGMD NEAYAILDTM RYIRPKIHTV AVGKCHGNAS
LILAAGDKGC RHALPHVQIS TQPPKLNRTF DSTQNVQIRA NECALYEDTY MGFMSEFSGK
DIDVVRKDLD RTRYFTPNQA IEYGLIDKVI TKGLNVMEAQ DYERLLAQQQ AQYESAGVPM
PGQEQQQNDR SSHADKGTVR K