Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25292 |
Symbol | PPR6 |
ID | 5004813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 68543 |
End bp | 71685 |
Gene Length | 3143 bp |
Protein Length | 775 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420234 |
Product | pentatrichopeptide repeat (PPR) protein |
Protein accession | XP_001420577 |
Protein GI | 145352494 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.025628 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGG TATTGGCGTG GACACCGGGG CGCAAGAATG ATGCGGAGAG ACTCTGGGGG GAGATTTTTG AACGCGCGCG GGGCGACGCA AAGAGGCGAT GCGAGGCGCA CTGCGCGTTG TTAGAAGCGC GGTGTTTGCG ACCGACGGCG GTGCATGAAT TCTTACGAGC GTACGATGCA TTTCGAGACG AATTCCGCAA CGACGAGGGA TGGAAGAGTC GATGGATGAC GCCCATGGTG CAACGCGCGT ACACGGCGGC GTGTCGGCTG TGCAACGACT CGCACGACGA GCGACTGGCG AGGCGAGTGA CGTACGATTT ATTACGTGAC CACAAGCAAA TGAAAGCGGC GAATAATTTC GCACTTGCGA CGGAGTATTG CGTGAGTTCG CTACTTTCTG CAGCAACGCC GAGCTCGGCG CACGCGGCGG CTGAACTGTT TGAGGTTGCC CTCGAAGACG GCGTGAAATT CGGCGGCACC GCTTGGTCGT ACGCGGTAAG CATGTACGCG GGAATGAACA AGGCGGACGA GGCAATCGCG CTATTGGACC GTTTAGAAAC TTTGGATTTC ATGCAATCCG GTTCTCTCGG CGCTTTGGCT TTCGCCGCCG CGTTTTCTGG CAAGAAGTCT GCCTCGTCCT CGGATGCGGT GATGAAATTG GAGCGCCGCG CAAACGATTT GAAGATGGTC GAAGCGCGCA ACGCGAAGGA GCGCGCTAAA GTGGAACGCG CTTACGCGCA GGTGATGCAT ATGCTAAACA AGCAAGAGCG ATTTAAGCTC GCACTTCGCA TGTTCACTCG CATGCAATCC TCAGGCGTTC GCCCGCTCGG CGCGCAGACG TACATCAACT TGTACGACGC GCTGAGTGAA ACACCCGCAG ATGACGCACT TTCTAAGAGT AAACGCAGAG ACAATTTGCA TCGCGTCATG CAAACGGTAA AAACACACTT GAAGAGCTTA GAGGCGCTCA TGGTTGGTAT TGAATTGGCG TCGCATGCCG GCGTCGCAGA CGTCTGCGAG GATATTCAAG ACCGTCTACG TTGGAACGGG CACTTACAAA GCGACAGCGT TCGATCGAGG GTGTGGCAGT GTATGATGAT CGTGCGATAC AAAACATACG ACAGACAAGG AACAATGCAG CTCTACGAAG AGTGGCTGCG TTCGGGTGAG GAAAAAATCG CCGTCGGTGA TGACTTTTGG TTTTATCTCA TCAAGTCGTA TTGCGACGAT CCCGTCAACG TGGACGTCGC GGCGCTCCTG CTAGAGGACG CCATTCGTGA GAACGCCATG GTGAGGCGTA AAACGGTCCC AGTACGGACG TTCAATATTG TTATTCAAGC GTGCGCGCAA GATCGACGCC CAGCGCTCGC GCTGACCGTC ATCGATCGTA TGCGCGCCGC CGGCGTCAAT CCCACAGACG TGACGTATTT AGCCGCCCTA CGCGCGTGCG CGTCGGCGGC AAAAATAGAC CGTCGACGGG ATGAAAAGGC TGAAAATGAC TCCGAAGAGT CGCTCGTGAG GGTAGCGCGG ACGTTGATTG TGACCGCGTG CACAGAAGGA ATTCAGCCGA CGAGCAAGAT GTGGAGCGCG GCGCTACACG TCTGCGCTCT GTGCGGCGAT GTCGCCGCGG CGACGGAAAT CTTTGCGGAC ATGCGCGCTA GTGGATCTGC GCCAAACGTG CACAGTTGTA CCGCGCTTAT GCGTGCGTAC GCTAGCGTGC GAGACTTGAG CGGTGCGATT GACGCGTACT GGACCATGAG AAACGAATTC GACGTCGCGC CGGATTCAAG CACGCTACAA ACGATGCTCG AAGTCGTCCG CGCCGCCGGG CACAACAAAT CCCGAGGAAC TAGTGACGTC ATATCGCGAG TGCAAGAAGT GTACGCCGAC ATGCGCGCGC TCAACGTCCG TCCGAACAAC GCCGTCTTAT CCCTGCTCAC GGAATCCGTC GTGGAGGACG TCTTGGCGCT TGGCTCGCCG AATTCAGCCA TCGACGTCAA GCGTCTCACC TTGGCCATCG ATGGCTGCGT CAGCGTGGAA TCCGAGATCA CGCAAACTTG GGACGATATC GCATCGATGA ATCTCAAGGA GTACTCCGTC GCCGAGGCAC GGGTCGCGTT TTTGGGTCTA TTGCAGACGC TTCGCGATCG CCGCGCCGCT CGATTGCGCG TCGGCGACGT CCGCGTCGGT GTTGGTCGCG GTAAAATTCG TGATTCCATC GAAACCTTGG CCAGGGACGT AAGTTTGATG TTGTCTTACG CCGAAGACGA CGATGGTGTC GTGGTGTTGA CGTCCGCGGC GATCGAGCAC TGGTTCGCCA AGCCGTAGCG TCGAATCGGC GCCGATTCGA CGACGTCGAG ACACACCTCG ACGCGCCATG TCGCGAAGAT TTCTTCGTCC CGCGCCCGCG CTCGTCCTGG CGCTCGTCTT AGCGACAATG ATCGACGGCA CGCACGCCGT GAAACGGGCG TACGCGACGA AACCGCGTCG GGAGGACAAC GAAACGTTCA ACGTTTTCGA CGCGAGCGCG CGGCGAGTCG CGCTCCCGAG TGAACTCTCG CGCGCTGTCA CGCGCGTGAC TCGCGTCGTC GTCCCTGGAG ACGTGCACGT GCGCGCGCCA AATGCGCTCG ATCGCGTGTT TCGCGCGTAC GCGCTCGCGA CGAACGAAAC GATGGTTCGT CTGGAAAAGC GGCGGCGGAG GCAGAGGCAG CCGATCGGCG ACGGCGACTC CGCAAAGACG ATCAATCTTC TCGTGCGGAA ATTCGCGAGA ACCAACGATG CGCGACGCGC TAGTTGGCGC GTCGTCGATG CCATCGTCGG TGAGGGCGAA AGCGCGTGGG TCGTGTTCGA GACGTGCGAC GATGATTGGA TCGACGACGA CGAATCGATC GGTATCGGTT TTGACTATCG CCTGCGCGTC GATTTCGTCA ACTCGAGCGC GGCGACGACG GGTGATATCG CGGCGTTCAT CGCCACGACA GATCCCGCGC GCACCGGAGG CAAGTCTCGA CGGTGTGCTC GGAGACGTGC GTACGGGTGA CGCGTCGACG CGCGAATTCC GCCGCGGGGT TCGTTCGCGC GCGCGCTCGC ATTCATCTAT TCATTCATTC ATTCATTCAT TATTCATTCG CCCGCGCGCT CGC
|
Protein sequence | MAKVLAWTPG RKNDAERLWG EIFERARGDA KRRCEAHCAL LEARCLRPTA VHEFLRAYDA FRDEFRNDEG WKSRWMTPMV QRAYTAACRL CNDSHDERLA RRVTYDLLRD HKQMKAANNF ALATEYCVSS LLSAATPSSA HAAAELFEVA LEDGVKFGGT AWSYAVSMYA GMNKADEAIA LLDRLETLDF MQSGSLGALA FAAAFSGKKS ASSSDAVMKL ERRANDLKMV EARNAKERAK VERAYAQVMH MLNKQERFKL ALRMFTRMQS SGVRPLGAQT YINLYDALSE TPADDALSKS KRRDNLHRVM QTVKTHLKSL EALMVGIELA SHAGVADVCE DIQDRLRWNG HLQSDSVRSR VWQCMMIVRY KTYDRQGTMQ LYEEWLRSGE EKIAVGDDFW FYLIKSYCDD PVNVDVAALL LEDAIRENAM VRRKTVPVRT FNIVIQACAQ DRRPALALTV IDRMRAAGVN PTDVTYLAAL RACASAAKID RRRDEKAEND SEESLVRVAR TLIVTACTEG IQPTSKMWSA ALHVCALCGD VAAATEIFAD MRASGSAPNV HSCTALMRAY ASVRDLSGAI DAYWTMRNEF DVAPDSSTLQ TMLEVVRAAG HNKSRGTSDV ISRVQEVYAD MRALNVRPNN AVLSLLTESV VEDVLALGSP NSAIDVKRLT LAIDGCVSVE SEITQTWDDI ASMNLKEYSV AEARVAFLGL LQTLRDRRAA RLRVGDVRVG VGRGKIRDSI ETLARDVSLM LSYAEDDDGV VVLTSAAIEH WFAKP
|
| |