Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29142 |
Symbol | PPR7 |
ID | 4999912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 888728 |
End bp | 890675 |
Gene Length | 1948 bp |
Protein Length | 632 aa |
Translation table | |
GC content | 57% |
IMG OID | 640415333 |
Product | pentatrichopeptide repeat (PPR) protein |
Protein accession | XP_001415964 |
Protein GI | 145341746 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00572015 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACG ACGCGTTCGA GAGGTTGGAT AAGCAAAAGG CGAGTCGACG GGAGAAGCGC ACGGCGGTGC GGGCGGTGCG CGAGGAGGCC GCGAGTGGTG GTGCGAGAGG CGGGACGGAC GCGCGGTTGT CGTTTTTGTA CGAAGAGGGG GTGAACGCGA ACGACGCGGG GGCGGTGACG AAGTGCAACG ACGTGTTGCG GCGATGCGCG AGCATGGAGG AAGTGCTCGG TTTGGTGAAG GAAATGCGAG ACGCGGGGTT GGAGCCCGTG GAGAGCACGT ACGTAGCGGT GATGCTCGTG TGTCAAAACG TTGGCTCGCC CGAACGTGCG GTGCAGGTGT ACGACGCGAT GACGGAGGCG CAAGTGCGCA TGACGGGACG TACGTTTCAT CTGGCCATCG AGTTGGCGCT CAAGGCTGAG ATGTTCAAGG ACGCGCTTCG TATCAAAGAT GATATGCGCT ATGTCGGGTT GCCGGTGAGC TCGAAGCTTT ACGTGACGCT TCTGCGCGCG TTGGCGGACA ACGACATGGG AAAGCGAAAG GGGCCACAGG AGCGACTCAT TCGCACGTGT CGTCTTTTTG AAGAGATGCT CAACGAAGAT GTCGACCCAG CGCCGGCGGC GTATCACACT CTCATCGTTG CCGCACATCG CGCGAATCAG CACGACCTGG CGGTGCGCAC TTTCGATGAA CTCATCGAAG AGGGGATCAC GCCGGCGAGA CAAACGTACG AGACTGCCCT CGATTCCATG GCACGAAGCG GACTTCTCGC GCAAGCTTTG GAAGTTTTTG TGCAGATGAA GTCGAACGGA CTCATGCCAC GAAAGGTGAC GTACAACACA CTTTTGAGTG CTTGCGTGAA CGCACCGCAA CCGCGCGTGG AGGCGGCGTT TGAAATTTTT GAAGAAATGC AAACGAAAGG CAACGTGACG CCGGACAGGA GCACGTACTC ACTACTCATC GACGCCGCGT GTAAAGCGGG TAAACCCGAG ATGGCGTTTG ACGCGTTCAG CCACATGCGG GATTCTGGTA TTGAAATCCA AGTCGGCACG TTAAATCGAT TGATTCACGC TGCGGGATTG AACGCCAAGG AGGACTCTAC ATCGGTACAA GCAACTCTCG AGCTTTATGA CGCGATGAAA AAGCTTGGCG TTGAACCAGA CGTGATCACG TATGGTAGTT TGATCGCTAC ATGCGCTAAG GCGCGCGATG CGGCAACTGC GATAAAGGTG TATGAGGAAA TGCGCGCCGC GGGCGTCGAG CCAAACCGCA TCTTGTTCAA CGTCTTGATC AACGCCCTCG GGCGTGCGAA CCGTAGCGAA GAGGCGATGG AGTACTTTCG CGTGATGCAG AAGCAATCCG AAGTGAATAG TTCGTTAACT CCGAATCGGG AGACGTACAC GACGGTTTTC GATGCGTTCA TCGGCGGCGG CGGCGCAGAG CTCGCCATGG CGAAACAATC GCTCGAATCA GGCGACGACG CATCCTTCGT GCACAGCGCC CAGGTTGCGA AGTTGCGCGA AATCTACGCT CAGGGCGTTG AACACGGTGT GTACGAAGAC TTGTCAGTCG TACTGTCGAC GCGCGAAGCG GAGTCTGAGT CTTGTACGAT AAACATGAAC CAGCTGTCGC GCACCGAAGC TACGGTGGCG ACGCTCGTCC TGCTCGAACG CATGTCCACG CTTTCGAGCG ATCAAGTACC ATCCGCCATG TTCATCTACG CGGGTAAGGT AAAGCCGGGT AAAAACGGTG CCCAGCGTCG ATTGTTGGCG ATCGAAACTG TCCTTCGTGC GGCTAACGTC AAGTTTGAAA TCGGCGAAGT CGGTTCCGCC GAACTCATCG CCGTGAAGGG CAAGCACTGT CGCGCGTGGA TCGAGAAAAA CGCCGGCACG TTTGTGTAAA TACTCTAGAC CTTAGTCATT CTTTTAACCA TCAGATGTGC TATTTGCT
|
Protein sequence | MDDDAFERLD KQKASRREKR TAVRAVREEA ASGGARGGTD ARLSFLYEEG VNANDAGAVT KCNDVLRRCA SMEEVLGLVK EMRDAGLEPV ESTYVAVMLV CQNVGSPERA VQVYDAMTEA QVRMTGRTFH LAIELALKAE MFKDALRIKD DMRYVGLPVS SKLYVTLLRA LADNDMGKRK GPQERLIRTC RLFEEMLNED VDPAPAAYHT LIVAAHRANQ HDLAVRTFDE LIEEGITPAR QTYETALDSM ARSGLLAQAL EVFVQMKSNG LMPRKVTYNT LLSACVNAPQ PRVEAAFEIF EEMQTKGNVT PDRSTYSLLI DAACKAGKPE MAFDAFSHMR DSGIEIQVGT LNRLIHAAGL NAKEDSTSVQ ATLELYDAMK KLGVEPDVIT YGSLIATCAK ARDAATAIKV YEEMRAAGVE PNRILFNVLI NALGRANRSE EAMEYFRVMQ KQSEVNSSLT PNRETYTTVF DAFIGGGGAE LAMAKQSLES GDDASFVHSA QVAKLREIYA QGVEHGVYED LSVVLSTREA ESESCTINMN QLSRTEATVA TLVLLERMST LSSDQVPSAM FIYAGKVKPG KNGAQRRLLA IETVLRAANV KFEIGEVGSA ELIAVKGKHC RAWIEKNAGT FV
|
| |