Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_35581 |
Symbol | |
ID | 5002719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 424056 |
End bp | 425216 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | |
GC content | 65% |
IMG OID | 640418140 |
Product | predicted protein |
Protein accession | XP_001418699 |
Protein GI | 145348528 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0000902697 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.474336 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCG CGACGCCGGC GCCGGCGCCG GCGCCGGCGA CGGCGCGCGC GCGCGCGGAG GCGCGCGGCG ACGCCGGTCG GCGGTCGGTC GCGGCGCGCG CGCGACGCGC GCGGGCGCGC GCGCCGGTCG GCGCGGTGGT GGACGACGCG ACGACGCGCG CGCGCGGCGC GCGCGGCGTC GACGGCGCGC GGCGAACGCG AGACGCGCGC GACTATTGGT TTCCGGTGTG CTTCTCGGGG AATCTGCGAG ATAAGGACGC GCTGGTGGCG TTTGATTTGT TCAACGTGCC GTGGGTGCTG TTTCGGGGGC GCGACGGCGA GGTGGGGTGC GTGAAGGATG AGTGCTGTCA TCGCGCGTGC CCGCTGTCGC TCGGGAAGGT GGTGGACGGA CGGGTGCAGT GCCCGTATCA CGGGTGGGAG TACGAGACGA ACGGCGAGTG CGCGAAAATG CCTTCGTGTA GTTTTTTGAA GAACGTCTTC GCGGACGAGC TCAGGGTGAT CGAGAGGGAT GGGATGATAT TCGTTTGGGC CGGAGAAAGC GATCCCGCGG ACTTTGTTGG ACCGGAGGCG GCGTGCGAGT CTTGGGACGA GGACGTGTAC GAAGCCAACG AACCGGGGAT GTTCACGACG GGCGAGGGAT TCGTGACGAT GGCGGAAGTC ATCGCGGACG TGAAACTTGA CAGCGATGTC GTGGTGGAAC GGTTGTTAGA CATCACCGAG CGCGCGCGGC GGGAACCGGT GTCGGTGAAG AATCGCGGTC GAGGCGCACT CTTTCCGGTG GACGGCACGC GGTTGATTTC AAAAGTCTTG CGGATCGGTT ACGACGCGGT ACCGCAGAGC GTGGTTTTCA AACCGTCGTG CGTGATCGCG AGCACGATCG CGCTGAGACC GCGCGTAGGG GGTGGAGATG GGACTTCGAT GCAAGTAGAG CAGTTGCACG TGTGCTTGCC CGCCAAGCCG GGGCTCACGC GCGTCCTGTT CCGCATGGCT TTCGATTTCG TCCCGGAGGG TGCGCAAAAC GCGGCGGGTG ACGTGTGGAA GAACTTAGCA ATGCAGGTCC TACAAGAAGA GCTCGAAGAC GTGCGAAGCG CGGGGCTGAA ATCGGAAACG ACATCCATAG CGGTTGAATC GTTTCGCGTC TTCAAGAGAG GAGACAAGTA G
|
Protein sequence | MARATPAPAP APATARARAE ARGDAGRRSV AARARRARAR APVGAVVDDA TTRARGARGV DGARRTRDAR DYWFPVCFSG NLRDKDALVA FDLFNVPWVL FRGRDGEVGC VKDECCHRAC PLSLGKVVDG RVQCPYHGWE YETNGECAKM PSCSFLKNVF ADELRVIERD GMIFVWAGES DPADFVGPEA ACESWDEDVY EANEPGMFTT GEGFVTMAEV IADVKLDSDV VVERLLDITE RARREPVSVK NRGRGALFPV DGTRLISKVL RIGYDAVPQS VVFKPSCVIA STIALRPRVG GGDGTSMQVE QLHVCLPAKP GLTRVLFRMA FDFVPEGAQN AAGDVWKNLA MQVLQEELED VRSAGLKSET TSIAVESFRV FKRGDK
|
| |