Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38463 |
Symbol | |
ID | 5001869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 462465 |
End bp | 463493 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417290 |
Product | predicted protein |
Protein accession | XP_001417779 |
Protein GI | 145346610 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.622688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.36197 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTTCG ACGTTTTCGC GCGCGTCCGA TGCCGTCTGG ACATTCTGAA CGACATCCTC CTGCAGCGTT ATCACTGCAA GCACCTCCCG AGCCGCGCCG AGGACGTCAT CGTCGACGAC CTGCGCGGCA AGACGTGCGT TGTGACCGGA CCGACGAGCG GAATCGGCGT CACCACCGCG CGTGCGCTCG TCAAACGCGG CGCGCGCGTC GTTTTGGCGT GTCGCACTCC CTCAAAGGCC GAGGCATTGG TCGAGCGTTG GACGAAGGAG GCGGCGGCGG TCGGGACGGC GCCGCCGGAC TGCGCGGTGA TGGCGTTGGA TTTGGACTCG CTCGCGAGCG TCGAAGCGTT CGCGAAGGCG TTTCAACAAC GTGAGAAACG GTTGGACGTG TTGATCAATA ATGCCGGGAT TTTCGATATG TCTGGAGCGT ACGTCAGGAC GAGCGATGGG CGGGAGCAGC ATTTACAGGC GAATTTCCTG GCACCGGCGT TGCTGACGAT GACGCTGTTG AACGCTCTGC GAAAGACTGG GGCGGAGACT GGGGATGCGA GGGTGGTTTT CGTGAGTAGT AAGCTGCACG AATTGTGCAC GGGGTTAAAT TTGAGTGATA TGGACTTTAA ACGCTCGTCG TACAGCTCGC AAGCGGCGTA TGCGTCCAGC AAGCTCGCGG AGGTGCTATT CGTAAAGGCG CTCGACGCGC GTCTGCGCGC AAAGGCGCCG GGCACGCGCG CGCTGGTGTT GCACCCTGGA AACATCGTCA CTGGCGTCGT TCGCACGCTT CCGATGTGGT TACAGTCTTT GTATAAGATA TTATTCGAGC GAATTTTGCT CACTGCTGAC CAAGGCGCGC GATGTTCGCT GTACTGTGCG ACAAGAACCG AGGCGGTGGC GAGCGCCACC ATCGGGCCGT ATTTTACATC CGAGTGCGAG GAGCGTACGC CGAGCAAGTA TGCGCTCGTC GAAGGCGAGG CAGAGAAGCT GTGGCGATAC ACACTGAAGG AATTAGGCTT AGAAGATGAT ATCATCTAG
|
Protein sequence | MLFDVFARVR CRLDILNDIL LQRYHCKHLP SRAEDVIVDD LRGKTCVVTG PTSGIGVTTA RALVKRGARV VLACRTPSKA EALVERWTKE AAAVGTAPPD CAVMALDLDS LASVEAFAKA FQQREKRLDV LINNAGIFDM SGAYVRTSDG REQHLQANFL APALLTMTLL NALRKTGAET GDARVVFVSS KLHELCTGLN LSDMDFKRSS YSSQAAYASS KLAEVLFVKA LDARLRAKAP GTRALVLHPG NIVTGVVRTL PMWLQSLYKI LFERILLTAD QGARCSLYCA TRTEAVASAT IGPYFTSECE ERTPSKYALV EGEAEKLWRY TLKELGLEDD II
|
| |