Gene OSTLU_31719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31719 
SymbolPEX5 
ID5001804 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp420370 
End bp422588 
Gene Length2219 bp 
Protein Length712 aa 
Translation table 
GC content60% 
IMG OID640417225 
Productperoxisomal targeting signal 1 receptor 
Protein accessionXP_001418009 
Protein GI145347087 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0456101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.395958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGGT TCATGCGCGA TCTCGCGACC GGCGGCGCGT CGTGCGAACC CAGCGACGGC 
GCCGGTCCGT CGAGCGCCGC GAATCCAATC GCGCGCGTCG CCGACGTCTT ACTCGGCGAT
CGAGGCACGC GCGCGCGCGA ACAACAGATA CAGCGCGCGA GCGAACGAGG TTCGTGCGCG
CGGACGCTCG ACGCGACGCG CGAGACGACG CGCGAGACGA CGTGACTGAC GGTTGGACTC
GGCGCGCAGA CCAACTCGCG CGAGGCGCGG CGAACGCGGC GAATGGGTTA CATTTAGCGA
GAGAGGGGCT GTTGGACGGT GTGGAAGACG TGCGTGAAGG AACGATGCGT GCGAGTGGTC
GATATGCGGC GGCACCGGTG CAGATGGTGC CGGATGCGAG GTTGGACGCG CGCGGGGCGT
CGGCGCTGGA GAAGTCGTTC GCTCGAAATA GGAGCGCTGT GATGGCGAGA CATTTGCAGT
CGCAGGGGGA GATGGAAGAC GGCTTCGAGC GCATGCGCAT GGATGACGCT GAGATGAGCG
CCCGAGCGCA GATGGCGCAT AATGCGGCGA TGCTTCGCGA GATGCAGCGC GGCGACCGCT
GGGCGCGCGG ATTTCAGCAA CCACTTCACG CGCAGGCGTC GCCGAGTTCC TGGGCGGAAG
AATTCGCGAA TGTTGAGACT TCAAGGCACG CGCAGTGGGC AAACGAGTTT CACAACCACC
CGCCGCCGCC ACACACGATG CACGCGCCAC CGCGCGCCGA TGCGTGGGCG CAAGAGTACA
GATCAAATCA AGGCACGCGC TGGGGCGAGG AGTTTGGCGC GGCGCAAAAC AGATGGGCGG
ACGAGTTTTC ACAACAACAC GCCAGTGATG TGCAAGCACC GCTTACTGAC GTTGCGGCAC
AAACTGCGAA ACAAAGTAGT GAGCTTGCGG CGACGTTGAA CGCCGATCCA AAGTTTGCGA
ATTCTAAATT CGCGCAACTC ATGTCCAAAC TTGGTAGCGG GCAAGTCGTC GTGAGAGAAG
ACGGTTTGCA GGAGGTAAAC GCGGCACCTC AGGCGCGGCA TGTGGACCAA GGTGAGCGAT
GGGCGGCCGA GTTCATCGAG CAAACACAGC AACCGCAACC TCAATGGGCG GACCAGTTTA
CAGCGCAGAT GCAGCGTCAA AATCCGACGA GCCAAGCGTG GGCGCAGCAA TTCTCGCAGC
GCGAGATGGA ATCGCAAAGG CAGTCAAACG TCGACGATTG GGCAGAAGAG TTCAAGAACG
TACCGCGCGA GTGGGCGAGC GAGTTTGAAG ATATGCAGCG CAACAACCCA GAATGGATGC
AAAACGTTTG GGACGAAATG CAACAAAATC CGCTTTCAGA ACGGAGCAAC TACAAATTCA
CCGACCCGAA CCCATACCTC GGTCAAAGTG GTTTGCAAGA GAAAACGATG GAATTGGCAA
AGACGGGCGT ACTCGCCGAA GCTGCCCTCG CCGCTGAGGC GTGGGTGCGT CAAGATCAGA
GCAACAGCGA AGCGTGGTAC CATCTGGGAC GCATCCAAGC GGAAAATGAC GATGACCAGC
AAGCCATCGC GGCGATGTCA AAAGCGTACG AGGCGAACCC GCAAAATCCA AATGTTCTAC
TCGCCCTCGC TGTGAGCCAC GCGAACGAGC TGGACCAGGA TGAAGCCCTC GGCCACGCGT
GTGAGTGGCT CGGAAGTCAA GAGCGCTTCA AACACATCGC CGCCGGACAA GCGCCACACA
CGCCCGAAAA CGTCATGGCA ATGTTCAGAG AAGCCGCTCG ACAAGCACCG AACGACGCTG
ACGTGCAAAC TGTGCTCGGT GTCATGGCGC ATTTGACGAG GAATTACGAA GACGCAGTCA
ACGCCTTTCA ACGAGCCGCC AATTTGCGCC CCGACGACCA TTCATTGTGG AACAAAATCG
GCGCCACGCA GGCGAACGGC GCCGAGAGTG CCGACGCAGT AGGGGCATAT CGCCGCGCGC
TGACCATCAA ACCCAACTAC GTTCGCGCGT GGTCAAACAT GGGCATCAGT TATGCCAACC
AAGGCCGGTA CGCCGAGTCG ATGCCTTATT ACATTCGCGC GTTATCGATG AATCCCAACC
CCGAAAGTCC GACCTGGGGC TACGTCCGCA TCAGTCTTGG TTGCACCGGA CGATTAGATC
TCTTAGAACA CGTCGACAAG CACGACATCG GCGCTCTTCG ACGCGAATTT CCACTCTAG
 
Protein sequence
MSRFMRDLAT GGASCEPSDG AGPSSAANPI ARVADVLLGD RGTRAREQQI QRASERDQLA 
RGAANAANGL HLAREGLLDG VEDVREGTMR ASGRYAAAPV QMVPDARLDA RGASALEKSF
ARNRSAVMAR HLQSQGEMED GFERMRMDDA EMSARAQMAH NAAMLREMQR GDRWARGFQQ
PLHAQASPSS WAEEFANVET SRHAQWANEF HNHPPPPHTM HAPPRADAWA QEYRSNQGTR
WGEEFGAAQN RWADEFSQQH ASDVQAPLTD VAAQTAKQSS ELAATLNADP KFANSKFAQL
MSKLGSGQVV VREDGLQEVN AAPQARHVDQ GERWAAEFIE QTQQPQPQWA DQFTAQMQRQ
NPTSQAWAQQ FSQREMESQR QSNVDDWAEE FKNVPREWAS EFEDMQRNNP EWMQNVWDEM
QQNPLSERSN YKFTDPNPYL GQSGLQEKTM ELAKTGVLAE AALAAEAWVR QDQSNSEAWY
HLGRIQAEND DDQQAIAAMS KAYEANPQNP NVLLALAVSH ANELDQDEAL GHACEWLGSQ
ERFKHIAAGQ APHTPENVMA MFREAARQAP NDADVQTVLG VMAHLTRNYE DAVNAFQRAA
NLRPDDHSLW NKIGATQANG AESADAVGAY RRALTIKPNY VRAWSNMGIS YANQGRYAES
MPYYIRALSM NPNPESPTWG YVRISLGCTG RLDLLEHVDK HDIGALRREF PL