Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_12863 |
Symbol | HDS |
ID | 5003594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 137320 |
End bp | 139362 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | |
GC content | 58% |
IMG OID | 640419015 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase, putative chloroplast precursor (1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase) (ISPG) |
Protein accession | XP_001419500 |
Protein GI | 145350194 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00611793 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAC GAGGCGTGCT GATGCCGACG GGGAAGTACT GCGTGAATCA CGCGGAGACG GTGCGAAGGA AGACGCGCAC GGTGCACGTG GGCAACGTGA AGATTGGCTC GGAGCACCCA ATCGTGAAGC AGACGATGAC GACGAGCGAC ACGAGAGACG TCGAGAAGAC GGTGGCGGAG GTGATTCGAT GCGCCGATGC CGGTGCGGAG ATGGTGCGGA TCACGGTTCA GGGGATGCAA GAGGCGAAGG CGTGCAAGAT CATCAAGGAG ACGCTCGTGG CGAAGGGATA CGATACGCCG CTCATCGCGG ATATCCACTT CGCGCCGAAG GTGGCGATGA TGGTGGCTGA ATGCTTCGAT AAGATTCGCG TGAACCCCGG TAACTTTGCG GATGGTCGTA AGACGTTTGA AAACATCCTG TACGAAACGG ATGAAGAGTA CAACGCCGAA CTCGCGGAGA TTGAAGAAAT CTTCACGCCG CTCGTGTTGA AGTGCAAGGA GCGAGGCGTG GCGATGCGCA TCGGTACGAA CCACGGTTCT TTGTCCGCGC GCACGTTGTC GCGCTTTGGT GACACGCCGA TGGGGATGGT GGAATCTGCG TTCGAATTCG CGCGCATTTG CCGCAAGCAC GATTACCACA ACTTTGTCTT CTCCATGAAG GCGTCTAACC CTCTCGTCAT GGTGCAAGCG TATCGCCTGT TGTCCCACGA GATGTACAAG CTCGGCTGGG ACTACCCGCT TCACCTCGGC GTCACCGAAG CCGGCGAAGG CGAAGACGGT CGCATGAAGT CTTCCATCGG TATCGGTGCT CTCTTACTTG ACGGTCTCGG AGACACCATT CGCGTGTCTC TCACCGAAGC ATCCGAGCTC GAAATCGAGC CGTGCACGCG CTTGGCCAAC CTCGGTATGA AGGCGTGCGC CGAAAACATC GGTGTCGAGC AATTCGAGGA CTCTGTCCGT GACTTCAAGA CCTTCACGCG CAGAACTGGT GATTTGCCCG AACAAAAGGA CTCTGATGCG ATTGATTTCC GGAACATTCT CCACCGCGAC GGCTCTGTGC TCGCCGCCGT CACCACGGCG CAACTCGCGT CGGAAGATGG CTTCTACCGC GAATTGGGCG CCAAGCTTGC GGTCGGTATG CCGCTTCGTG ATATCGCCAC CTGTGATTCC ATCTTGCTCA GCGAAGTTCC CGCCGCGTCC GAAGAGAAGG CGCTCCGATC TATCCGTCGC CTCCAAGAAA TCGGCTGCGG CGTTGTCGTG CCGGCTGACC TTCTCCGAGC GACGCCGTTG GAGAACGCCA TCGCCCTCGT CAGCGCCGAC GAAGTCGGTG CTCCGTTGCC GGCCGGTGCC GCGCGAAAGG CTGTCACGTT GAACGGCTTC GAGACCGTCG AACAACTCAA GGCTGTCGCA GCGTCTGACG CCGTCATGGT GCTCATCAAG ACGAAGGATG GCGAGTCTCG TCTCCACTCT TCTCGCCGCA TCGCCGAAGT GATGGCGCAA GTCGGTGCGA AGATGCCGGT CATTCATCAC ATGTACATGC CGACCGGCGA CAAGTCGGAC ATCGTCATTC AATCCGGCTC CCAAGTCGGT GGTTTGCTCG TCGACGGCTT TGGCGACGGC GTGTTGATCC AATACGCCGG CTCCGACAAG GATGTCTCAT TGGACTTCCT TCGTACGACC TCGTTCGGTT TGCTCCAAGG CTCTCGTATG CGTAACACGA AGACGGAGTA CGTCTCTTGC CCGTCGTGCG GACGCACCCT CTTCGACTTG CAAGAAGTCA CGGCGCAAAT CCAAGAGAAG ACTGGACACT TGCCTGGCGT CGCCATCGCC GTTATGGGTT GCATCGTCAA CGGTCCGGGT GAGATGGCGG ATGCGGATTT CGGTTATGTT GGTGGTGCTC CGGGTAAGAT TGACTTGTAC GTCGGTAAGG AAGTCGTCAA GCGAGGTATC GCGATGGAAA CTGCGTGTGA TGAACTCATT CAACTCATCA AGGACAACGA CCGCTGGATC GAAAAAGAAG TCGAGGAAGC CGTCGCCGCG TGA
|
Protein sequence | MDERGVLMPT GKYCVNHAET VRRKTRTVHV GNVKIGSEHP IVKQTMTTSD TRDVEKTVAE VIRCADAGAE MVRITVQGMQ EAKACKIIKE TLVAKGYDTP LIADIHFAPK VAMMVAECFD KIRVNPGNFA DGRKTFENIL YETDEEYNAE LAEIEEIFTP LVLKCKERGV AMRIGTNHGS LSARTLSRFG DTPMGMVESA FEFARICRKH DYHNFVFSMK ASNPLVMVQA YRLLSHEMYK LGWDYPLHLG VTEAGEGEDG RMKSSIGIGA LLLDGLGDTI RVSLTEASEL EIEPCTRLAN LGMKACAENI GVEQFEDSVR DFKTFTRRTG DLPEQKDSDA IDFRNILHRD GSVLAAVTTA QLASEDGFYR ELGAKLAVGM PLRDIATCDS ILLSEVPAAS EEKALRSIRR LQEIGCGVVV PADLLRATPL ENAIALVSAD EVGAPLPAGA ARKAVTLNGF ETVEQLKAVA ASDAVMVLIK TKDGESRLHS SRRIAEVMAQ VGAKMPVIHH MYMPTGDKSD IVIQSGSQVG GLLVDGFGDG VLIQYAGSDK DVSLDFLRTT SFGLLQGSRM RNTKTEYVSC PSCGRTLFDL QEVTAQIQEK TGHLPGVAIA VMGCIVNGPG EMADADFGYV GGAPGKIDLY VGKEVVKRGI AMETACDELI QLIKDNDRWI EKEVEEAVAA
|
| |