Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31719 |
Symbol | PEX5 |
ID | 5001804 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 420370 |
End bp | 422588 |
Gene Length | 2219 bp |
Protein Length | 712 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417225 |
Product | peroxisomal targeting signal 1 receptor |
Protein accession | XP_001418009 |
Protein GI | 145347087 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4235] Cytochrome c biogenesis factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0456101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.395958 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGGT TCATGCGCGA TCTCGCGACC GGCGGCGCGT CGTGCGAACC CAGCGACGGC GCCGGTCCGT CGAGCGCCGC GAATCCAATC GCGCGCGTCG CCGACGTCTT ACTCGGCGAT CGAGGCACGC GCGCGCGCGA ACAACAGATA CAGCGCGCGA GCGAACGAGG TTCGTGCGCG CGGACGCTCG ACGCGACGCG CGAGACGACG CGCGAGACGA CGTGACTGAC GGTTGGACTC GGCGCGCAGA CCAACTCGCG CGAGGCGCGG CGAACGCGGC GAATGGGTTA CATTTAGCGA GAGAGGGGCT GTTGGACGGT GTGGAAGACG TGCGTGAAGG AACGATGCGT GCGAGTGGTC GATATGCGGC GGCACCGGTG CAGATGGTGC CGGATGCGAG GTTGGACGCG CGCGGGGCGT CGGCGCTGGA GAAGTCGTTC GCTCGAAATA GGAGCGCTGT GATGGCGAGA CATTTGCAGT CGCAGGGGGA GATGGAAGAC GGCTTCGAGC GCATGCGCAT GGATGACGCT GAGATGAGCG CCCGAGCGCA GATGGCGCAT AATGCGGCGA TGCTTCGCGA GATGCAGCGC GGCGACCGCT GGGCGCGCGG ATTTCAGCAA CCACTTCACG CGCAGGCGTC GCCGAGTTCC TGGGCGGAAG AATTCGCGAA TGTTGAGACT TCAAGGCACG CGCAGTGGGC AAACGAGTTT CACAACCACC CGCCGCCGCC ACACACGATG CACGCGCCAC CGCGCGCCGA TGCGTGGGCG CAAGAGTACA GATCAAATCA AGGCACGCGC TGGGGCGAGG AGTTTGGCGC GGCGCAAAAC AGATGGGCGG ACGAGTTTTC ACAACAACAC GCCAGTGATG TGCAAGCACC GCTTACTGAC GTTGCGGCAC AAACTGCGAA ACAAAGTAGT GAGCTTGCGG CGACGTTGAA CGCCGATCCA AAGTTTGCGA ATTCTAAATT CGCGCAACTC ATGTCCAAAC TTGGTAGCGG GCAAGTCGTC GTGAGAGAAG ACGGTTTGCA GGAGGTAAAC GCGGCACCTC AGGCGCGGCA TGTGGACCAA GGTGAGCGAT GGGCGGCCGA GTTCATCGAG CAAACACAGC AACCGCAACC TCAATGGGCG GACCAGTTTA CAGCGCAGAT GCAGCGTCAA AATCCGACGA GCCAAGCGTG GGCGCAGCAA TTCTCGCAGC GCGAGATGGA ATCGCAAAGG CAGTCAAACG TCGACGATTG GGCAGAAGAG TTCAAGAACG TACCGCGCGA GTGGGCGAGC GAGTTTGAAG ATATGCAGCG CAACAACCCA GAATGGATGC AAAACGTTTG GGACGAAATG CAACAAAATC CGCTTTCAGA ACGGAGCAAC TACAAATTCA CCGACCCGAA CCCATACCTC GGTCAAAGTG GTTTGCAAGA GAAAACGATG GAATTGGCAA AGACGGGCGT ACTCGCCGAA GCTGCCCTCG CCGCTGAGGC GTGGGTGCGT CAAGATCAGA GCAACAGCGA AGCGTGGTAC CATCTGGGAC GCATCCAAGC GGAAAATGAC GATGACCAGC AAGCCATCGC GGCGATGTCA AAAGCGTACG AGGCGAACCC GCAAAATCCA AATGTTCTAC TCGCCCTCGC TGTGAGCCAC GCGAACGAGC TGGACCAGGA TGAAGCCCTC GGCCACGCGT GTGAGTGGCT CGGAAGTCAA GAGCGCTTCA AACACATCGC CGCCGGACAA GCGCCACACA CGCCCGAAAA CGTCATGGCA ATGTTCAGAG AAGCCGCTCG ACAAGCACCG AACGACGCTG ACGTGCAAAC TGTGCTCGGT GTCATGGCGC ATTTGACGAG GAATTACGAA GACGCAGTCA ACGCCTTTCA ACGAGCCGCC AATTTGCGCC CCGACGACCA TTCATTGTGG AACAAAATCG GCGCCACGCA GGCGAACGGC GCCGAGAGTG CCGACGCAGT AGGGGCATAT CGCCGCGCGC TGACCATCAA ACCCAACTAC GTTCGCGCGT GGTCAAACAT GGGCATCAGT TATGCCAACC AAGGCCGGTA CGCCGAGTCG ATGCCTTATT ACATTCGCGC GTTATCGATG AATCCCAACC CCGAAAGTCC GACCTGGGGC TACGTCCGCA TCAGTCTTGG TTGCACCGGA CGATTAGATC TCTTAGAACA CGTCGACAAG CACGACATCG GCGCTCTTCG ACGCGAATTT CCACTCTAG
|
Protein sequence | MSRFMRDLAT GGASCEPSDG AGPSSAANPI ARVADVLLGD RGTRAREQQI QRASERDQLA RGAANAANGL HLAREGLLDG VEDVREGTMR ASGRYAAAPV QMVPDARLDA RGASALEKSF ARNRSAVMAR HLQSQGEMED GFERMRMDDA EMSARAQMAH NAAMLREMQR GDRWARGFQQ PLHAQASPSS WAEEFANVET SRHAQWANEF HNHPPPPHTM HAPPRADAWA QEYRSNQGTR WGEEFGAAQN RWADEFSQQH ASDVQAPLTD VAAQTAKQSS ELAATLNADP KFANSKFAQL MSKLGSGQVV VREDGLQEVN AAPQARHVDQ GERWAAEFIE QTQQPQPQWA DQFTAQMQRQ NPTSQAWAQQ FSQREMESQR QSNVDDWAEE FKNVPREWAS EFEDMQRNNP EWMQNVWDEM QQNPLSERSN YKFTDPNPYL GQSGLQEKTM ELAKTGVLAE AALAAEAWVR QDQSNSEAWY HLGRIQAEND DDQQAIAAMS KAYEANPQNP NVLLALAVSH ANELDQDEAL GHACEWLGSQ ERFKHIAAGQ APHTPENVMA MFREAARQAP NDADVQTVLG VMAHLTRNYE DAVNAFQRAA NLRPDDHSLW NKIGATQANG AESADAVGAY RRALTIKPNY VRAWSNMGIS YANQGRYAES MPYYIRALSM NPNPESPTWG YVRISLGCTG RLDLLEHVDK HDIGALRREF PL
|
| |