Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24879 |
Symbol | PPR5 |
ID | 5002738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 774010 |
End bp | 775358 |
Gene Length | 1349 bp |
Protein Length | 424 aa |
Translation table | |
GC content | 63% |
IMG OID | 640418159 |
Product | pentatrichopeptide repeat (PPR) protein |
Protein accession | XP_001419038 |
Protein GI | 145349220 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGCAACGAC GCGCGTCGAC GCCATGGCGC CGCACCCGCG TCGCGCCGCG GGCGCGAAGA CGCCGCGCGA GGCGAAGAAG AAGCCGTCGT CGGGTAAGCG CGCCGGGTAC AAGCCGCACG GGTCCGTGGT GTACGAGACG ACGGAGGCGC TGTCGCGAAT CGCTGAAAGC GCGCTCGACG CGAACGAGCG GCGCCGCGCG ATTTTCGATG CGCTGGCGCG CGCGATGAGC GCGGACGATA ATCCGCGACC GCTGCGCGCG GACCCGCGGG CGTTCACGAC GTTCATCTCG CGATTGAATC GAGGGAAACG CTTCGACGCG GCGTTGGACG TCTTCGCGGC GCAAAAGGCG CTCGGCGTCG AGCGAAACGC GGTGAATTAC AACGCCGCCA TGGTGGCGAA CGTCAAGGCG GAGAAACCTG AAGAGGCGTT GAAGTTGTTC GAAGAGATGC GCGAGATCGG GCACGAGCCG AGCGTGATAT CTTTCAACGT CGCCATGGGG GCGTGCGCGC GCGCGGGCGA CGGCGCGCGC GCGTTGAAAC TTTTCGACGA GATGGTGGGG CAGAACATGG ACGTGGACGC GGTGAGCATC AACACCGCAA TGGCGGCGGC GGAGTTGGTC GGTGATGAGG CGCGCTTGGA AGAACTGCGC GCTGGGTCGC ATTTCAAGCG CGCGGGCGAC GTCGACGAGC CGGCGCCGAC GCGAGTCGTT AAAGACGACG CGAAGAAAGA CAGCGACGCG AGCGACGACG AAGATAGCGA CGACGACAGC GACGACGACA GCGACGAAGG CGAGCCGGAA GCGGTAGCAG ACGACGTCGG CGACGCGAAT GTGGTTTTGA GTGAAGAAGA AGAGGCGAAG CGCGCGGCGG CGAGAAAGCG CAAGCGCGAG ATTAAGAAAA AGTTATTAGC CGAATACGCC GCGAACGTCG GCGGTGCCCG CGACGACGAG GACGAGGAGA TGGCGAAGCG CCGTCGAAAA GACGAGCGAC GAGCCAAGTT TGAGGAAAAA CGGCGCGCGC GGAAAGAGGC GACGCGCGCG AAGAGACCTT GGTCCGCCGC CGCCGCGCAC GCGAAGAAGA AAGAAAATAA GACGTCCAAG GCGGCAAAGC GCACGGCGAT GAAGGCGGAT CCGAAAGTTT GGATCGAAGA CGTTCCTTCC GAAGATGAAG TTCTGTACGA CGACGATGGT GAACGCGTTT TGCCCGCGAT CAAAGGTCCT TCGTTTCTCG ACGACCCAAA CGCGGCCAAG CTGATGAACG CGTCCGGCGG CGAAGACTTT TGGTCGGTTG GATTCTAGCA ATGGTTACAC TGGTTACACG AGTAAAATAC TTCCTAAAAT AGCGATAGA
|
Protein sequence | MAPHPRRAAG AKTPREAKKK PSSGKRAGYK PHGSVVYETT EALSRIAESA LDANERRRAI FDALARAMSA DDNPRPLRAD PRAFTTFISR LNRGKRFDAA LDVFAAQKAL GVERNAVNYN AAMVANVKAE KPEEALKLFE EMREIGHEPS VISFNVAMGA CARAGDGARA LKLFDEMVGQ NMDVDAVSIN TAMAAAELVG DEARLEELRA GSHFKRAGDV DEPAPTRVVK DDAKKDSDAS DDEDSDDDSD DDSDEGEPEA VADDVGDANV VLSEEEEAKR AAARKRKREI KKKLLAEYAA NVGGARDDED EEMAKRRRKD ERRAKFEEKR RARKEATRAK RPWSAAAAHA KKKENKTSKA AKRTAMKADP KVWIEDVPSE DEVLYDDDGE RVLPAIKGPS FLDDPNAAKL MNASGGEDFW SVGF
|
| |