Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34089 |
Symbol | |
ID | 5001030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 548135 |
End bp | 548941 |
Gene Length | 807 bp |
Protein Length | 268 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416451 |
Product | predicted protein |
Protein accession | XP_001416693 |
Protein GI | 145344340 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTCG ACGACACCAC GCTCGCGCGT TCTTTCTCCC TGCGCGGTGC CACCGCCGTC GTCACCGGTG GCACGCAAGG CCTCGGAAAA GCCATCGTCG AAGCGCTTTG CCATCACGGC TGCCGCGTAT TCACCTGCGC ACGCACGGCG GGAGACGTCG AGACGTGCGT CGAGGACTGG CGGCGCCGTG GATACGACGT CGATGGGTGT GTTTGCGACG TCAGTGACGC GAACGCGCGA GAAGAACTCG CGCGGAGGGT GTCGGAAAAG TTTTCGGGCG AGTTGAACAT ACTGGTGAGC AACGTGGGCT TTAACATTCG AAAGCCGACG GTGGAATTCA CGAGCGAAGA TTACCAAAGG CTGATGCGGA CGAACTTAGA GGCATCATTC GAGCTTTGCA AGAGGTTTCA CGCGATGTTG AAGGCGAGCG GTGATGGGAG GATCGTATTT AATTCATCCG TCGCCGGCCT CGTGTCTATC CAGAGCGGCG CTCTGTATGC AATCTCGAAG GGTGCCATGA ATCAACTGAC GAAGAGTCTG GCGTGCGAGT GGGCGAAAGA TAACATTCGT GTGAACGCCG TCGCGCCTTG GTACACCAAC ACGCCGCTCG CGAAGCAGGT GCTGAAAAAC CAAGTCTACC TCAAAGCCGT CGTGGACCGA ACGCCGATGG GTCGCGTCGG CGAGCCTCAC GAAGTCGGCG CCGTCGTTGC GTTTCTATGC ATGCCCGCGT CCTCGTACGT CAACGGTGTC ATCGTGCCAA TAGATGGCGG TTTCACCGTC CACGGGTTCA TCCCGCCCAA ATTGTGA
|
Protein sequence | MSVDDTTLAR SFSLRGATAV VTGGTQGLGK AIVEALCHHG CRVFTCARTA GDVETCVEDW RRRGYDVDGC VCDVSDANAR EELARRVSEK FSGELNILVS NVGFNIRKPT VEFTSEDYQR LMRTNLEASF ELCKRFHAML KASGDGRIVF NSSVAGLVSI QSGALYAISK GAMNQLTKSL ACEWAKDNIR VNAVAPWYTN TPLAKQVLKN QVYLKAVVDR TPMGRVGEPH EVGAVVAFLC MPASSYVNGV IVPIDGGFTV HGFIPPKL
|
| |