Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_89854 |
Symbol | |
ID | 5006889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | - |
Start bp | 170659 |
End bp | 171750 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | |
GC content | 60% |
IMG OID | 640422310 |
Product | predicted protein |
Protein accession | XP_001422911 |
Protein GI | 145357408 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 0.264825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0060203 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCCCGAG CGAAGTGGAT AAAGTCGTCA CTCCGCCGCG ACGTCGCGTC GCGCGCGCGA TCGCGCCCGA CCGTCGTCGA CGTCGTCCAA CGAAGCGCCA TGGGCATTCG CGCCGCGTGC GCGGTGGACA TGCGCGGTAA GATCGCCGTG GTCACCGGCG CGAACACCGG GATCGGGTTG CAGACGGCCA GGTTGCTCGC GGACGCCGGC GCGCGCGTCG TCATGGCGTG TCGCTCGATC GACAGAGCGA GGGCGGCGCT CGAGTACGCG TCAAACGGGG GCGCGAACGA CGTGGCGGTG ATGGCGCTCG ATCTGAGCGA CGCCGCGAGC GTCAGGGCGT TCGCGGAAAA GTTTGGGAAG GAGTATGAAA AATTGGACGT CTTGGTGAAT AACGCGGGAT TGAACGGGGC GAGCGGATAC AGTGGACCGA AGACGACGAA ACAAGGGTAC GACATATGCA TGGGGGTGAA TTATCTCGGA CACTTCATGC TCACGTCGTT GTTGTTGCCG CAGTTGATGA AGAGCGACGG CGCGAGGGTG GTGGCGTTGA GTTCGGTGAC GACGTGGTTC GGGTCAAACA AGTATCAGTA CTACTACAAG GGTGCGAGTA AGACGAAGGG GAACTACGGG TCGAGCAAGT TGGCGTGTTT GGCGATGACG GTGGAGCTGC AGCGACGATT GGATGCGGCG TATCCGGATA ACAAGATCGT CTGCGCCGCC GCAGATCCGG GCTTCGTGGC GAGCAACATT TGGAGAGATT ACAACCCGGT TTTGCGAAAG ATCATGAGCA TCTTGGCCTT GACGCCCGCG CAAGGAGCGA TGACGAGCGT GAACGCGGCG TCGCTGCCGT CCATAACAAA GGCGACGCTT TACATGCCGT TCAAGATTAA AATGTCCAAG CTGTTCAAGA TGCACAAGAG CGCGACGTAC ATGTTCGGCA TGCCTCTCTT ATCCAAAGCG TTCGCCGGTT TCGGCGCCGA TGCCATGGCG CCGCGCGCGA AAAACGCCGA GTCTAACGCG AAGCTTTGGG ATTTGAGCAT CGAGTTTTGC AAGGAGAACG GCGTCAAGAG CTGCGAGGCG TACAATTTGT AA
|
Protein sequence | VSRAKWIKSS LRRDVASRAR SRPTVVDVVQ RSAMGIRAAC AVDMRGKIAV VTGANTGIGL QTARLLADAG ARVVMACRSI DRARAALEYA SNGGANDVAV MALDLSDAAS VRAFAEKFGK EYEKLDVLVN NAGLNGASGY SGPKTTKQGY DICMGVNYLG HFMLTSLLLP QLMKSDGARV VALSSVTTWF GSNKYQYYYK GASKTKGNYG SSKLACLAMT VELQRRLDAA YPDNKIVCAA ADPGFVASNI WRDYNPVLRK IMSILALTPA QGAMTSVNAA SLPSITKATL YMPFKIKMSK LFKMHKSATY MFGMPLLSKA FAGFGADAMA PRAKNAESNA KLWDLSIEFC KENGVKSCEA YNL
|
| |