Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14352 |
Symbol | PPR1 |
ID | 5000287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 799469 |
End bp | 800803 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | |
GC content | 61% |
IMG OID | 640415708 |
Product | pentatrichopeptide repeat (PPR) protein |
Protein accession | XP_001416498 |
Protein GI | 145343933 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.373458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCGG ATGACTCTGA GGTGTGGCCG AACGCGCACG TGTGCACGAC GATGATGTCG TTGGCGACGC GCGCGCGAGA CGCGGATAAG GCGTTGGCGA CGCTGGAGTG GATGAAAAGA CGGAGCGCGG ACGCGCGAGA GGTGGCGCCG ACGACGCACA CGTATACGAC GTGCGTGCAA GCGCTGGACG GCGCTGGACG GTGGGAGGAG GCGCTGGGGA TGCTGGGGGA GATGAAGCGC GCGGGGACGG CGAGAAACGC GCACACGTAC AGCGCGTTGG TCAGAGCCGG GGCGAACGGC GGGCGCGCGG GGGCGCGCGC GGTGGTGAAG TTGATTCCAG AGATGAAACG CGATGGCGTG GAACCGGACT TGGCGATCGC GTCGGCGCTC ATCAGCGCTT TCGGGGTTTT GGGGAGCGAA GACAACGCGA GGGCGATGTT GCGAGCCATC GAGTCGAGCG GGCGGGCGAC GGATAGCAAG CTGTACGTGG ATTACATGAC GGCGATGTGT CGATGTGGGA ACTACGAAGA GGCGATAGAA ACCTTCACGC GCGTGCCTCG CACGACTTTC ACGTGCACGG CGGTCATGAA GGCGTACGCT GAGGCGATGG ATTGGAAGCT CGCCGAGGCT TTGTTCGAGG ACATGCGTCG CGACGGACCG GCGCCGAACG ACCACACGTA TAGCGCCATG TTGCACGCGT ACGAAAAGAG TTTGCAGTGG GAGCGCGCGG TGCGCTTGCT CAATCAGCTC ACGAGCGAAG GAAAGGCTAA GGAGATACAT CACAACATTG TCCTCAGCGT TTTGGGCAAG TGCACGCAGT GGCAGCGCGC GGAGGTTTTA TTTAGAGACA TGCGAGAGTT GTACAACATT CAGCCCTCTC GCGTGACGTA TAGCACGCTG ATTTCGGCGT ACGGACGAAG TGGGAAAACT GATCTAGCGC GCAAGGTGTT CGAGCAAATG CTCGCGCGGC GTATTCCGCC AGACGACTAC ACCTTTGTCG GCTTGATGCT CGGTCCCGCG AGCGAGGGCA ACTTTCGAGA GTGCTCAAAA ATCGCGACCG AGATGAAGGA GTTGTACGAC GTCGAGCACA CCGTGCACAC GTACAACACG CTCATCCAAG CCGCGGACAT CGCCGGTAAT TACGACAAGG CGGTGGAGGT GTACGACGAA CTCTTGCAGC GCGGCATCGA GCCGAACAAC ACGACGAGAG AACTCGTCGT CGCCGTGAGT AAGCGCGGGG CGAACTTTTA CGATCGCCAA CAAAAGACCG CCGCCGTCGC ATCCTACGCC GCAGGTCTCG TCGGCGTCAT GGGCATGGCG CTCGGTCGCT GGTAG
|
Protein sequence | MTADDSEVWP NAHVCTTMMS LATRARDADK ALATLEWMKR RSADAREVAP TTHTYTTCVQ ALDGAGRWEE ALGMLGEMKR AGTARNAHTY SALVRAGANG GRAGARAVVK LIPEMKRDGV EPDLAIASAL ISAFGVLGSE DNARAMLRAI ESSGRATDSK LYVDYMTAMC RCGNYEEAIE TFTRVPRTTF TCTAVMKAYA EAMDWKLAEA LFEDMRRDGP APNDHTYSAM LHAYEKSLQW ERAVRLLNQL TSEGKAKEIH HNIVLSVLGK CTQWQRAEVL FRDMRELYNI QPSRVTYSTL ISAYGRSGKT DLARKVFEQM LARRIPPDDY TFVGLMLGPA SEGNFRECSK IATEMKELYD VEHTVHTYNT LIQAADIAGN YDKAVEVYDE LLQRGIEPNN TTRELVVAVS KRGANFYDRQ QKTAAVASYA AGLVGVMGMA LGRW
|
| |