Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_813 |
Symbol | |
ID | 5001290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 598332 |
End bp | 600488 |
Gene Length | 2157 bp |
Protein Length | 719 aa |
Translation table | |
GC content | 55% |
IMG OID | 640416711 |
Product | predicted protein |
Protein accession | XP_001417294 |
Protein GI | 145345602 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0197823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATGGAGCT TGAGTGAAAT TGTCGAGGTA CATCACATGC GATATCGCCT TCAGCACAAG GCTTTCGAGA TGCATTGCGC GGATCACACC AGTGCCTTTT TTGCGTTCGA TACCAAGAAA ACGGCGCGAT ACGCAGCTAC GCGCGTGGCA TCATCGGCGG GGGCGACTTT GATGAACAGA CGTGCGAAAA CTGAAGCAGC CGAACGCGCG AAGGAGCTCT GGCGCAGACA AAAACTGAGC ACTTTCGATT ATCTTATGGC CTTGAACGTC TTCGCCGGGC GAACGTTGCA CGACTTGAGT CAATATCCCG TTTTCCCTTG GGTGTTGAAA GAATACGAAG CTGAAACAAT AGATCTGGCC GACCCTTCGG TGTATCGAGA TCTGAGAAAA CCCGTTGGCG CGCTCAACGA AGAGCGTCTC AAGAACTTCG TTGAGAGATA CAAGTCGCTG TTGGACGACC CTGACACGCC GCCTTTCCAT TACGGCAGTC ATTACTCATC GTCCCCGATC GTACTTTTCT TCTTGTTACG CCTGGAACCT TACACGAAAC TCGCTCGAGC ATTACAGGGC GACAGATTTG ATCGAGCAGA TCGCTTATTC CATTCGGTCG CGGAAACGTT CAAAGCTTGC GTAGAATCAT CCGCTGATGT CAAGGAACTT ATTCCAGAAT TTTACTACTC GTCCGAGTTC TTGACGAATA CAAATGGACT AAAACTTGGG GTGCGACAGG ACGGATCGAC TATTGATGAT GTAGTTCTCC CGCCTTGGGC CAAGGGTTCG AGGCACGAAT TCACTCGAGT GATGCGGGAA TCGCTAGAAT CGGACTACGT AAGCGAAAAT CTGCACCACT GGGTTGATTT GATTTTCGGG CACGCACAGC GAGGGGCCGC CGCGGTCGAG CGATGCAACG TGTTTTATTA TCTCACATAC GAAGGCGCCG TGGATATCGA CACATTAGAA GATGATGATC AACGGAACGC GATCGAGACA CAAATCATCA ACTTCGGTCA GACTCCCGCG CAAATTTTCA GGCGCGCGCA CCCAGTTCGT CTGCCGCCCC AAGCGACAGA ACACGTGGTT TCAATCTCTC CCGAATCGCT CAAACTCGCG ACAGTTGTCT CATCCGAGTC TAACGGCCTA CCGATGCGGG CGCAAGCGGT GGTGCACGCC ACCGCGTACG ATTCGCGTAT TGCCGTGGTC ACCGCTGGAC GAATGGTTTC CATTCAGCGC TTACAACGTC CTGGAACGAC ATTCGGGCTG GGTGGCGTCG ATCACTCAAC CGCGTACGCG CTCGAACCAG AGACGACATC CCGGCTCATG CTCGAAATTG ACGTCGACGC CGATTCGCTT GCGCATTCGC AATCCGTCAA TGTCGCTCTG AAAGGCAAAG TCTTGCTGAG TGTTGGCCAT TGGGACAGAA GCATGCGTAT TTTCGACATT GAGGAGGGGC GCGAGATGCA GCGTATCAGT GCACATCGAG ATGTGACGAC GTGTCTCGCG TTGTGCGAAC TCGGGAGCTC GAGGAGCTGG GACGAGGCTT CGCACCAAAT GGACCAAGTC ATCGTGGTTA CGGGAAGTCG CGACACCACG CTCGCGATCT GGGAGATGGT GCTCCCTCAA GGAGGCTGGG GCTTTAGTAA AGGCACCAAA GTCCTGAGCG CCGAACCAAA AATGATCTGT TTCGGTCATG ACGAAGCCAT CACGTGCGTT GCTGTGAATT CATCGCTCAA CTTGGTAGCG AGTGGCAGCA TTGATGGCAC CCTCATCCTG CACGACAGCC GGGATGGGCA CATCGTTCGT GCATTGGAAA GCACCCCACC TGGGTGCATC CCATCGTCCA TCGAATTGCT TCCAAAATCA TCTCTCGTCG TGTGCGCGTG CGGCGTCGCC GGCGCGCTGT CCGTGCATGA TGTCAACGGT GCCACACTCG CCAAATCGCT CAGTCGGCAC GAAGCCTTTG ACGCATTTTG CGTCACCCGC GACGAGCGAC ATATTCTCAT CGGCAATCGT CGCGGAGACA TCACTGTCCG CGCCGTGCAT GATTTATCGA TTCGTGCGCA AATCAACGTC GCCAACGCGG GCGTCGTCTC CATTTCACCC GTCGCGCGCG ACGAGTGCCT CGTCGTCGGT CTTGCCGACG GTCGTGTGTG CCTCTGG
|
Protein sequence | RWSLSEIVEV HHMRYRLQHK AFEMHCADHT SAFFAFDTKK TARYAATRVA SSAGATLMNR RAKTEAAERA KELWRRQKLS TFDYLMALNV FAGRTLHDLS QYPVFPWVLK EYEAETIDLA DPSVYRDLRK PVGALNEERL KNFVERYKSL LDDPDTPPFH YGSHYSSSPI VLFFLLRLEP YTKLARALQG DRFDRADRLF HSVAETFKAC VESSADVKEL IPEFYYSSEF LTNTNGLKLG VRQDGSTIDD VVLPPWAKGS RHEFTRVMRE SLESDYVSEN LHHWVDLIFG HAQRGAAAVE RCNVFYYLTY EGAVDIDTLE DDDQRNAIET QIINFGQTPA QIFRRAHPVR LPPQATEHVV SISPESLKLA TVVSSESNGL PMRAQAVVHA TAYDSRIAVV TAGRMVSIQR LQRPGTTFGL GGVDHSTAYA LEPETTSRLM LEIDVDADSL AHSQSVNVAL KGKVLLSVGH WDRSMRIFDI EEGREMQRIS AHRDVTTCLA LCELGSSRSW DEASHQMDQV IVVTGSRDTT LAIWEMVLPQ GGWGFSKGTK VLSAEPKMIC FGHDEAITCV AVNSSLNLVA SGSIDGTLIL HDSRDGHIVR ALESTPPGCI PSSIELLPKS SLVVCACGVA GALSVHDVNG ATLAKSLSRH EAFDAFCVTR DERHILIGNR RGDITVRAVH DLSIRAQINV ANAGVVSISP VARDECLVVG LADGRVCLW
|
| |