Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36252 |
Symbol | |
ID | 5000143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 96483 |
End bp | 97445 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | |
GC content | 65% |
IMG OID | 640415564 |
Product | predicted protein |
Protein accession | XP_001416071 |
Protein GI | 145341968 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.140786 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGAG CGTCCGCGCG CGTCGCGGCG TCGCGCGATG CCGACGGCGC GTCGCGCAAG ACGTGCGTCG TGACGGGCGC GAACACCGGG ATCGGGCTCG CGACGGTGCG CGCGCTGCGG GCGTCGAACG AGTACTCGAA GATTACGCTC GCGTGTCGAG ACGCGAGCAA GGCGCGACGC GCGATCGACG CGCTCGCGCG CGACGGCGCG GCGTCGACGT GCGCGCTGGT GTTTCGAGAG CTCGATCTGG CGAGCGTCGC CAGCGCGCGG GACTTCGCGG CGTCGTACCT GGACGACGAG GGCGACGACG GGTTGGATTG CCTGGTGAAT AACGCGGGCG TCATGGCGGT GCCGAAACTC GAGCGCACGC GCGATGGATT TGAATTGCAA GTCGGCGTGA ATCACTTGGG ACATCACGCG CTCACGGCGG GGCTGATGCC GGCGCTGGCG AAGAGCGAGG ACGCGCGGGT GATTTGCGTG TCGAGCGAGG CGCACAGAAT CGCGGGGAAA GGCTTGGCGC GGGAGGATTT GTTCGGGGAG AAGAATTACA GCGCGTGGGG ACAGTACGGA CAGTCGAAGC TCGCGAACGT GTTGTTTGCG TTCGAGCTCG CGCGAAGGTG CGAAAGGGCG GGACTCGGGA ACGTCACGGC GTCGGCGTTG CACCCGGGCG CCGTCGACAG CGAGTTGGGT CGATATCTCC AACCGCCGGA TGAAGAAATC AAGTGGTGGC AGACGAAGCT GTACGATTTC ATCAGATTGA ATTTTTTGAA GACGACGGAA CAAGGCGCGG CGACGAGCGT GTTTTTGGCT CGAGAGATCG CGCGGGGAGA GGCGCGGGGG AAGTACTACA GCGACTGCGC GGAGAAGACG CCGGCGAAGA ACTGCTTGGA CGTGGACGAC GCTCGTTGGT TGTGGGATCG CTCCGCCGAG CTCACGGGCG TGGGTTTCGA TTCGCTGCTT TGA
|
Protein sequence | MPRASARVAA SRDADGASRK TCVVTGANTG IGLATVRALR ASNEYSKITL ACRDASKARR AIDALARDGA ASTCALVFRE LDLASVASAR DFAASYLDDE GDDGLDCLVN NAGVMAVPKL ERTRDGFELQ VGVNHLGHHA LTAGLMPALA KSEDARVICV SSEAHRIAGK GLAREDLFGE KNYSAWGQYG QSKLANVLFA FELARRCERA GLGNVTASAL HPGAVDSELG RYLQPPDEEI KWWQTKLYDF IRLNFLKTTE QGAATSVFLA REIARGEARG KYYSDCAEKT PAKNCLDVDD ARWLWDRSAE LTGVGFDSLL
|
| |