Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30981 |
Symbol | |
ID | 5001144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 86899 |
End bp | 88445 |
Gene Length | 1547 bp |
Protein Length | 439 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416565 |
Product | predicted protein |
Protein accession | XP_001417423 |
Protein GI | 145345872 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.116585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCGCGCGCG AGGGAGGGAG GACGAGGGAA GGCGACGACG GAGGTGGATC TGAGGGTGAC GAAGTCGTCG CACGGGGCGC TGCAGAGGCA CGTGAACGCG AGTAAGAGGG AACCGACGCA CGCGCAAAAA AGGGAGACGA GAGGCGGTGG ACGCGGGGAT GGCGCGGCGA GTGGAAGCGG GAGGCCGCCG TTGCGCGCGG GGGAACGGAT CGTGCGGGCG ATGGAATCTA TCGACGCGCC GGCGCAGGGC GAACGCATCG CGTGGGACGT CGTCGCGAGC GCGGTGAATC CCGAGGGTGG GAAGTTTAAA TTCACCACGT CGACGCTTAA CTTTGCCATT AAGGAGCTCG GTGAGCGTGG GAGCTTCGAT CGCGCGCACG CGCTGTATTT GTGGATGTCG CGCAAGCAAG GGCGGTACGC GCCGAATGAG TTTACGTACG TGTCGCTCGC CAGCGCGGCG AAGACGCTGA CGCAAACGCG AACGGTGCAA AACTTGTGGC AGCGCGGTAT CGCGGAGAAT GATGAGAGCT TGATTTGTAA CGAGATCGCG TCTGCGGTGA TCGCCGCACT CAATCGCGTG AGTGATTGGA GCGGGGCGTA TCAAGTGTTT CGGGACATGG GCGACAAGGG CAAACCGAGA AATTTGTACA CTTACACAGC CGTGCTCACG GCGTTGAGAG ATGAGGCGAA GCCGGACGAG GCGTTGGCCG TACTCAACGA GATGGCGCGT GAGCCTGGAG TTCAACCGAC GAGCTTAGCG TTTTCGTTGA CGCTGACGGC GTTCGATAAT TGTCGACGCT GGATCGAAGG TAACGCGGTC GCAAAACGTA TTAAAAAGTA CGACGTGCGT CCGGACGCGA CGTTGATGCA TGCCATCATC ACCATGGCAG GACGTGCCGG CGACATGGCG CACGCGAACG AAGTTTTCGA CGCTATGCGC AACTCAACGA TGATCGTCAC GACGTACACC TTCAACGCGC TCTTGGGCGG ATACGCGAGG TACGGGGACT GGGAAGGATG CACTGAAGTG TATGACGAGA TGAAACGCTC GAAAATCCAG CCAGACTCGT ACACTTTCAC GCAGCTCATA TCTGCTGCCG AGCGAAGCGG AGAGTACATC GCGGCGGACG GCGTTTGGAC GGAGATGTTG CGCAACCGAA TCATCCCGCA CACCGTCATG TGCGGAGCGT ACATTCACTG CTTGGGATGC CAAGGTCGAG ACCTCGAAGC GGAGGCCGTG ATGGAGAAAA TGCGCAACTA TTGGGATGTT CCACGGAACG CTGCGGTGTA CAATGCTCTC ATCGGGGCGC ACGTGCGAAG CGGCGAAGTC ACGCGAGGAC TCAGCGTGCT CGACGATATG CAACGCATCG ATGGACTTAT GCCCACCGAA ATCACATTCG CCGTGCTTAT TCGCGCATGC CAGGAAAGTG CGCTACACAA ACGCGCGGAA GGTTTAGAAG GCATGCGCGC CTCGCTCGCG AACGCTGGAC AACTCATTCA AGATTTGTCT GGCGCTTCAA CCTCGACGGC GAAAGCATAA CCATTGT
|
Protein sequence | MESIDAPAQG ERIAWDVVAS AVNPEGGKFK FTTSTLNFAI KELGERGSFD RAHALYLWMS RKQGRYAPNE FTYVSLASAA KTLTQTRTVQ NLWQRGIAEN DESLICNEIA SAVIAALNRV SDWSGAYQVF RDMGDKGKPR NLYTYTAVLT ALRDEAKPDE ALAVLNEMAR EPGVQPTSLA FSLTLTAFDN CRRWIEGNAV AKRIKKYDVR PDATLMHAII TMAGRAGDMA HANEVFDAMR NSTMIVTTYT FNALLGGYAR YGDWEGCTEV YDEMKRSKIQ PDSYTFTQLI SAAERSGEYI AADGVWTEML RNRIIPHTVM CGAYIHCLGC QGRDLEAEAV MEKMRNYWDV PRNAAVYNAL IGAHVRSGEV TRGLSVLDDM QRIDGLMPTE ITFAVLIRAC QESALHKRAE GLEGMRASLA NAGQLIQDLS GASTSTAKA
|
| |