Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14603 |
Symbol | PPR2 |
ID | 5000950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 420165 |
End bp | 422543 |
Gene Length | 2379 bp |
Protein Length | 792 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416371 |
Product | pentatrichopeptide repeat (PPR) protein |
Protein accession | XP_001416655 |
Protein GI | 145344261 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0183985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCGCG CGGTCGCGCG CGCCTGCGCC CGGGCGCGAT CGCGCGCGGA CGCGCAAACG TGCGGTCGAA CGCGCGCGCG CGACGAGTCG GCGGTGTGGA CGTCGGTGCG CGCGACGACG CCGAGGTGTT GGCTCGGGAC AAAGGTTGAA CTCGCCGAAG ACGACGGGAC GAGCGCGAGC GCGAGAGAGG ACAGCGCGAG AGAGCAGACG CTGCGCCGAC GGCACGTGGC GAGCATGCGA CGCGCGCTTC GCGCGAATGA TTTTGAGGGC GTGCGGAGAG AGTTTGAGGC GCTCGCGGCG TCGACGACGA AACCGGGACG CGCGGCGTGG AACGTAAACG TGCTCGCGGC GGCGCGAGCG CGAGAGCCCG AGGCGGCGCT GTCGATCGTC GAGTCAATGC TCGCGGATAA TATCGGACCG GATGTGAGCA CGCACACGGC GGTGATGCAG GCTTACGTGC GCGCGGGACG ATACGGGGAC GCGTTTAACT GGTTGAAGAT GCACCTGTAC GGCGTACCGA AGGACGTTGA ACAAGGCTTT GATGGCGAAG ATAACGTAGA AATGGACACG AGGTATGATC TATCACATGT GATTGGGGAC ATGATCGATG GCGATGAGGA ACACGTGTCG AAGACGCGCG TGAATAGCAT GATGTTTACA ACCGTGATGA ACGGCGCGGC AAAGGCGGGC GACCGAGCAA CGGTGCAAGA AATCGAGACG ATGATGTTTG AGTTCGATGT GGCACCTTTA GCAGATACGT TACACTGTTT GCTCAAGCTC GAGCGCGTCG CGGGGACGAG TGCATCGGTG GAATCTGTGT GGGATCGTTC CAAGCTCAAA GCGCGCGCGT TGAAAGCACA TCAAGAACGC GTCGTCGCTC ACGCGTTTCT GGGTCGCCAA AGAACGCAGC GCGCCGCCGA ATCGCGTGCG CTCGCAGTGA AAGCATTAGA AGATCTGTAC GATCGCGTGG GTCGGAAGCG CTCGGACGAA GACTTTGCGA AGCTGGAGAA GAACAATCAT TCAGGGTCGA GATATGCTCG CTGGACCGGC GAACGTGGTT TAGAAGAGTA CCGAAACAAC TTGGTGAGCG ACGACGATAT CAAGAAAGTC GGAACGAAGG AAGTTTTAAT AGCCACGAAC GCCGTGATGC TCGCGCATGC TGCGGTCGGT GACACGGACA CAGTCAATAC ATGGTTTGTT CGAATGGAAG AGGATTTGGG CATCAACGCA GACGTGCGAA CGTTCAACGC AGTACTACGC GCCGAATACA TGCGAGATAA GATAAAGCTC GGTGTCGTGG ATCATGAGCA AGCAATGATG CGATTTCAAG AAGTGATGCT TGCGATGGAA GATCGAGAAG TCGAGCCGAC GACGTACACG TTTTCGACAC TTCTCTTGGC GCACGCAGAA CAAAGCCACT TACCCGGTGT CGCAGAGGTT TTGAAGCTCA TGCAAGAGCG AGGACTCGCG CTTGATACTA CGATGTACAA TATTTTATTG GGTGCTTGCG CGCGAGCGGG TGATTTAGAC GCTGCGCTCA ACGCGCGGGC AAGCATGGTC AAGTCTGGCA TTGCCGCCGG CCCAGACACG TTCGTGCCGC TCTTCGCCGC GTGCACGAAA CAGGCGGAGG AACTGGACTT GGACGGTGAT ATGTTTGAAT TAGATGAAGA AGTCGTTGGA CCTCTGTTAG ATCGCACACG AGCGACTTTA GATAGCGTGG AGTTGGATAT GCTCTCAAGC AACGTCGAGC ACAACACGCA GTCGTTCACG GCTTTGGTCA AAGCCAGAGG GGCTTTGGGT CAGCCAGACG CCGTGTTCGA CATGATCAGC GAATTGCCAG AAAACATCGA ACTCGACGAA GTCGCCATCG GCCTGAGTGT TTTGGCGGTT GCGAAGCGTG AACCAACCAA GGCTATTGCG CTCGCCGATT CCCTCATGGA CGAGTCGATG GTGCTGGATC CGTGGCTTCT CAACTGCATC ATCACCGCTT ACTCTCATCT CGGTCAAATC AAAGCAGCTA CCGAGCGCGT GCAAGACTTC ATCAAGCGAG GCGGTGTGCC GACTGTCGCA ACGTACAACG CGCTCTTCCG ATGCGCCGCC CAGAGCGGCA GCTTTGCCGA ATACGCCCCG CTCGTCATGA ACGAGATGTC ATCGCGCGAT CTCAAACCGG ACAAGTACAC TCGCAAATTT ATCGCTGCTG TCACCGCCGG TTCATCCGTC CTCGATCGCG AGACGGCGGA AGCGCTTCTC AAAAAGTGTC CCGGCGGCGA CAAGTTCTTC GTCGCCGAAG ACGCCGCCGG ACAGGACGAC TTCGATTTCC TCGCCGACGC CGGCTTCGAC GACGACGACG ACGACGACGA CGACGATCTA TTATTGTAA
|
Protein sequence | MFRAVARACA RARSRADAQT CGRTRARDES AVWTSVRATT PRCWLGTKVE LAEDDGTSAS AREDSAREQT LRRRHVASMR RALRANDFEG VRREFEALAA STTKPGRAAW NVNVLAAARA REPEAALSIV ESMLADNIGP DVSTHTAVMQ AYVRAGRYGD AFNWLKMHLY GVPKDVEQGF DGEDNVEMDT RYDLSHVIGD MIDGDEEHVS KTRVNSMMFT TVMNGAAKAG DRATVQEIET MMFEFDVAPL ADTLHCLLKL ERVAGTSASV ESVWDRSKLK ARALKAHQER VVAHAFLGRQ RTQRAAESRA LAVKALEDLY DRVGRKRSDE DFAKLEKNNH SGSRYARWTG ERGLEEYRNN LVSDDDIKKV GTKEVLIATN AVMLAHAAVG DTDTVNTWFV RMEEDLGINA DVRTFNAVLR AEYMRDKIKL GVVDHEQAMM RFQEVMLAME DREVEPTTYT FSTLLLAHAE QSHLPGVAEV LKLMQERGLA LDTTMYNILL GACARAGDLD AALNARASMV KSGIAAGPDT FVPLFAACTK QAEELDLDGD MFELDEEVVG PLLDRTRATL DSVELDMLSS NVEHNTQSFT ALVKARGALG QPDAVFDMIS ELPENIELDE VAIGLSVLAV AKREPTKAIA LADSLMDESM VLDPWLLNCI ITAYSHLGQI KAATERVQDF IKRGGVPTVA TYNALFRCAA QSGSFAEYAP LVMNEMSSRD LKPDKYTRKF IAAVTAGSSV LDRETAEALL KKCPGGDKFF VAEDAAGQDD FDFLADAGFD DDDDDDDDDL LL
|
| |