Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25441 |
Symbol | PPR3_13 |
ID | 5005213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 455138 |
End bp | 457413 |
Gene Length | 2276 bp |
Protein Length | 739 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420634 |
Product | pentatrichopeptide repeat (PPR) protein |
Protein accession | XP_001421156 |
Protein GI | 145353726 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.224466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.223836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGA CGCCGAATTG CGCGCCGAAT ATTCGTTCGT ACAACGGCGT CATATCCGCG GCGACGCGCA AAAAGCACTT CCCGGGCGCG ATGTGGGCTT GGGAAGAGAT CGAGACGGCG AATTTACAGC CAACCATGAT CACATACGGC GCCATGCTCG CCGCGGGCGC GGCGGCGGAC GACGTCGACG TGCGTTGGTC GGAGGACTTG TTCGCTCAAG CGTTGGAATC TGGCGCGTGT GGCAACGCAG GGAACGATCA CATGGTGACT TCTATGCTCC AGACGTATGC TCGCGGCGTC GCGCTCGAGC AAATCGAGCG TGATGTCGCG ATGGAACGCG GAGAGAGCGT CGTGCAAGCG CTCATAGAGG ACGCGCAGTG GGACGAACGG CGCGCCGAGT CAACGCCCAA CGGACGCGTC TGGTCGGCGT TGATAACGCT TTGCGCTCGA TGTGGGCGCG CCGCGCGCGC GATCGAAGTG CTCAAGATTA TGATTTCTAC TCGCTCGCAT CGTGTGGGAC ACGAATGGTT GCACTTGACG TACGCCTTGA CGTCCGCGCT CGAAGCGTCG AAGGAGGGCA TCGAGTACTT TGCCCGCGTG CAGCGCGAAA TCGAACAGAG CCCGGCGATG GTTCGAGACT GCACGGGCGT TCGCAACGGC TTAATCGCCA CACATTTGCA CTTTGGTGAT TTCAAGAGTG CGTTTAAAGT CTACGACGAC TTCAAAAACG ATATTTTCAA GTACCGCAAG GCGCACGAAG AGAATTGGAA CGCGCGCGCG CGGAGAGGGT TGGAACGTCA ACTTCCAGAT ACGATTACGT ACAACTCCCT CATCTACGCG TGCGCTGATG ACGACGTCAA GGCCATGGGA CTGTATCACG ACATGGTCTC GAACGGAATC AATCCTACAG TGCGAACGTA CGTCGCGCTC ATCGTCGCCT TGAGCCGATC AAAGCGTGGA AGTAAAGTCA CCGAGGCGGA AAAAATATTC AAAGCCGCCA TCGACGACGG CGTGACGCCG AACGAGTTTT TGTTCACCGC GCTCATGGAC GCTCAGGTCA AAGGCAACCG GCCGTTATCT GCGTTTGAAA CGTACGCGCG CATGATTGAA GCTGATGTCA ACTGCACGAC TGTGACTTTC GGGTGCGCAC TGCAAGCGTG CTGTTACGTC GAGGATGTCG AGGAGAGCGT CGAGCGCGCG TACTCGGTGC TTCGTGATAT GACGGAGCGT GACGTCCAGA TGAACGACTG GTGCTCTAAC ACGTTCTTGC GCGTCATATC ACGCGCCGGT CGCATTGAAG AAATGCTCGA AGAAGTGAAA AAGACGGTTC GTCGTAAAGG CAAGCTCGAA CAAGAGACGC TTGAAGCCAT CATTCGGGCC TTGTGTTCGG CGGGTTACGT CGAGCGCGCC AATCGCTTCA TCTCTATGAT GAACTCGCGT AACTTGGAAC CGCGTGAGCA AACGTTCAAG GAGTTCATCG TCGCGAGTAG TCGCGATGGA TTCGTCGATT GGGCTTGGGA GTCGTACAAA CGTTTCACAC GTTTAGGACA CAAACTCGAC GCGGGCACGC GGTCGGCGCT AGTCACCGTC TTGAGCGTCG CTTCGACGAG CCCGGACCCC GACGATGCGG AACTCTTGCT CGCTCGCGCG ATCGGCGTCT TCGAAGCCGC GTTTAAACGC GCGGATGAAG ACAATGGGCC ACAAGTCTTG GACGTCATCG ACGCCGAAGC GCGATGCGCG CTCATAGTCG CCATGGCGCG AAGCGAAAAA CTTGACCGAG CATTGGATAT CTGGCGCGAC TCCCCAAAGG CTCAATCATT TTCCAAGGCG CGTAAGCACA CTTCGAGTGA AGGGAACGAT TACATCGGCG ATGTGCGCGC CATGTATGAG TGCCTCATCG AAGTGTGCTG TCACGAAGAT CGCATCGACG ACGCGCTGGA GGTGTTCGAT CACTTGAAGG ACGCCGGCGT GCGCGTCAGT ACGGTGACGC TCGCGTTTTT GGAATCATCG TGCCGCCGAT GTCGCGTGGA GGAATGGCGA ATGTTTGACG TGTGCGCACA AATGCGCGCG CAAGTGGAAC AAAAAAACGA AGGTCGCCTG GCGAAGCCGA CGAAGATGAG CCACCACGTG CGCGACGACG GCAACATCGC GAGCGAACTC GCCACGGATG GACTGGGGGG CGAACAAAAG ACGTCGGCGT GGCGCAAAAA CGTCGATTGA TACACTACAT TTACAAGCAC CGCACGCCGC GCTGCATTTG CTACCACTAC TACTGC
|
Protein sequence | MKTTPNCAPN IRSYNGVISA ATRKKHFPGA MWAWEEIETA NLQPTMITYG AMLAAGAAAD DVDVRWSEDL FAQALESGAC GNAGNDHMVT SMLQTYARGV ALEQIERDVA MERGESVVQA LIEDAQWDER RAESTPNGRV WSALITLCAR CGRAARAIEV LKIMISTRSH RVGHEWLHLT YALTSALEAS KEGIEYFARV QREIEQSPAM VRDCTGVRNG LIATHLHFGD FKSAFKVYDD FKNDIFKYRK AHEENWNARA RRGLERQLPD TITYNSLIYA CADDDVKAMG LYHDMVSNGI NPTVRTYVAL IVALSRSKRG SKVTEAEKIF KAAIDDGVTP NEFLFTALMD AQVKGNRPLS AFETYARMIE ADVNCTTVTF GCALQACCYV EDVEESVERA YSVLRDMTER DVQMNDWCSN TFLRVISRAG RIEEMLEEVK KTVRRKGKLE QETLEAIIRA LCSAGYVERA NRFISMMNSR NLEPREQTFK EFIVASSRDG FVDWAWESYK RFTRLGHKLD AGTRSALVTV LSVASTSPDP DDAELLLARA IGVFEAAFKR ADEDNGPQVL DVIDAEARCA LIVAMARSEK LDRALDIWRD SPKAQSFSKA RKHTSSEGND YIGDVRAMYE CLIEVCCHED RIDDALEVFD HLKDAGVRVS TVTLAFLESS CRRCRVEEWR MFDVCAQMRA QVEQKNEGRL AKPTKMSHHV RDDGNIASEL ATDGLGGEQK TSAWRKNVD
|
| |