Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_4520 |
Symbol | |
ID | 5002078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 86761 |
End bp | 87750 |
Gene Length | 990 bp |
Protein Length | 330 aa |
Translation table | |
GC content | 61% |
IMG OID | 640417499 |
Product | predicted protein |
Protein accession | XP_001417668 |
Protein GI | 145346382 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | [TIGR01289] light-dependent protochlorophyllide reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.645511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.871752 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACTGAGACGA AGAAAAGGGT CGTGATTACC GGGTCGAACT CGGGGATCGG GCTCGACGCG GCGACGAAAC TCGCGGCGAG CGGGGACTGG GTCGTCGTGT TGGCGTGCCG AACGCGCGCG AAGGCGGAGG CGGCCAAGGC GAATATATTG TCTGCGACGA ACGCGGACGG GGCGAACATC GAGTGCGTCG AGTGCGATTT GTCGAGTTTG GACTCGGTGC GCGCTTTCGT GCGCGAGGTG AGGAAAACGG GCGGTGTCGA CGCTTTGTGT CTGAACGCCG GCGTGGAATA CAGCGGCGAT CCCGTGGTGC ATCGAACGAG GGACGGTTTC GAGGAGACGT TCGGTGTGAA CCATTTGGGG CACTTTTTGC TCGCCAACTT GTTATTGGAG GATCTCGAAA AGTCGAGCGA GGCGCATCCG CGAATCGTCG TGACGGCGAG CGAGGTGCAC GACCCGGCGT CGCCGGGAGG ATCGGTGGGC AGCGGCGCGC ACATCGGCGA CTTGCGAGGC CTCGAACGCG ACGGCGCGGC GTTCGAGATG GCGGACGGTG AAGCGTTCGA CGCCGATAAG GCGTACAAAG ACTCTAAGTT GGCGAACATG CTCTTCATGT ACGAGCTCGA GCGACGCCTG CAGGCGAGAA ACTCGAAAAT CACGGTGAAC GCGTTCGGTC CGGGACTCAT CACGCGCACC GGCTTATTTC GCAACCAAAA TCCTCTCTTC GTCAAAGTCT TCGACTTCGC CACGAACGAG ATTTTCCACG TCGCAGAAAC CGTTTCCGGA GGTGGGAACT GCTTAGTCTT CATGCTCACC GACCCTTCGC TCGAGGGCAG CGGGGGCGTG TACTGGAACA ACGATTTGTC GCCCGGCGCG CCGCCGTCCC TCGTCGCCGC CGGACACAAA TTCGCTCAAA CCAACTCTTC TGTCGAATCA AACGATGCCG TCGAAGCGCA AAAGCTTTGG AAGCTCAGCG AATCGCTCGT CGGGTTGGCC
|
Protein sequence | TETKKRVVIT GSNSGIGLDA ATKLAASGDW VVVLACRTRA KAEAAKANIL SATNADGANI ECVECDLSSL DSVRAFVREV RKTGGVDALC LNAGVEYSGD PVVHRTRDGF EETFGVNHLG HFLLANLLLE DLEKSSEAHP RIVVTASEVH DPASPGGSVG SGAHIGDLRG LERDGAAFEM ADGEAFDADK AYKDSKLANM LFMYELERRL QARNSKITVN AFGPGLITRT GLFRNQNPLF VKVFDFATNE IFHVAETVSG GGNCLVFMLT DPSLEGSGGV YWNNDLSPGA PPSLVAAGHK FAQTNSSVES NDAVEAQKLW KLSESLVGLA
|
| |