Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_2458 |
Symbol | PPR4 |
ID | 5005867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 89686 |
End bp | 90816 |
Gene Length | 1131 bp |
Protein Length | 377 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421288 |
Product | pentatrichopeptide repeat (PPR) protein |
Protein accession | XP_001421968 |
Protein GI | 145355437 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0995914 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGGCGT ACGTCAGCGA AGGGCGCGTG GAGGAGGCGG AACGCGTGCT GAGGGAGATG ACTGCAGAAG GCGTCAGGGC GGGACCGAGA TTGTTTAACA CGCTGATCAC CGGGTACGGG CGAGAGAAGA ATTTACGAGG CGTCGAGGCG AGCTCGATGG CGATGCGTAC GCTCGGCGTG ACGCCGAATC AAGCGACGTG GGGGGCGAAA GTGAACGCCT ACGTGAGCTG CGATAGATTA GATTTAGCCA TGGATACGCT TGAGCAGGGC GTACGATTCT CTCCGCGCGT CGAACGTCGG CCCGGGGTGC AGGCGTACAC GGCGCTCGTG CAAGGATTGG CGCACTCGGG ACGAGTGGTC GAGGCGGACG AGCTCCTTCG TCGGATGGCT CGAGACGGCG TCAAGCCGAA CGTGTACACA TATTCCACAC TCATCGACGG CTTGGCGAAA AGCGCGCAAA TTGGGCTCGC CGAGACGGCG CTGGCGGAGA TGCGCCGCGC AAAAATCAAG CCGAGCGTCG TGACGTACAA TTCCTTGCTC AAGGGCGTCG TGCGCGGCAT CGGCAAGGCG GATCGCACGG ATGAAGTTCT CCGACGTGCG CGTGAGATGT TTGATAGGAT GCGAGACGAC GGCGTGCCAC CGGATTTGGT GACGTACAAC ACGTTGATTG ACGCGTGCAT CAATGCTCGC GCTCCCGCCG AGGCGTGGAA CATTTTGCGC GAGATATCAG AATCTGGTTT GAAGCCCGAT GTCGTGACGT ACACGACGCT GCTGAAGTAT TTCGTGCAAG TCGGGGACGA CTCGGCGACG CAATGGGTGA TCGCCGAATT GGAAACCGAT CCACAAGTCG TCGAGGACGT CGGCGTGTAC AATTGCTTAA TCAACGCCTA CGCGCGTCAA GGCGACATGT GCCGCGCCGT CGAGACGCTC GAAGGCATGA AGGCGAAGAA CATCACGCCC AACGTATCCA CGTACGGCTC GATATTAGAA GGGTACATTC GCCTAGGGAA CGTGGGCGAG GCTTTTAAGG TGTACAATTT GTGCGTAAAG TCGGCTGGAC TGGCGCCGGA CGCTCGCATG CGAAAGTCGC TCATCTACGG TTGCGGTTTG CACGGAATGT CGGACATTGC C
|
Protein sequence | VGAYVSEGRV EEAERVLREM TAEGVRAGPR LFNTLITGYG REKNLRGVEA SSMAMRTLGV TPNQATWGAK VNAYVSCDRL DLAMDTLEQG VRFSPRVERR PGVQAYTALV QGLAHSGRVV EADELLRRMA RDGVKPNVYT YSTLIDGLAK SAQIGLAETA LAEMRRAKIK PSVVTYNSLL KGVVRGIGKA DRTDEVLRRA REMFDRMRDD GVPPDLVTYN TLIDACINAR APAEAWNILR EISESGLKPD VVTYTTLLKY FVQVGDDSAT QWVIAELETD PQVVEDVGVY NCLINAYARQ GDMCRAVETL EGMKAKNITP NVSTYGSILE GYIRLGNVGE AFKVYNLCVK SAGLAPDARM RKSLIYGCGL HGMSDIA
|
| |