Gene OSTLU_12863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_12863 
SymbolHDS 
ID5003594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp137320 
End bp139362 
Gene Length2043 bp 
Protein Length680 aa 
Translation table 
GC content58% 
IMG OID640419015 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase, putative chloroplast precursor (1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase) (ISPG) 
Protein accessionXP_001419500 
Protein GI145350194 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00611793 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAC GAGGCGTGCT GATGCCGACG GGGAAGTACT GCGTGAATCA CGCGGAGACG 
GTGCGAAGGA AGACGCGCAC GGTGCACGTG GGCAACGTGA AGATTGGCTC GGAGCACCCA
ATCGTGAAGC AGACGATGAC GACGAGCGAC ACGAGAGACG TCGAGAAGAC GGTGGCGGAG
GTGATTCGAT GCGCCGATGC CGGTGCGGAG ATGGTGCGGA TCACGGTTCA GGGGATGCAA
GAGGCGAAGG CGTGCAAGAT CATCAAGGAG ACGCTCGTGG CGAAGGGATA CGATACGCCG
CTCATCGCGG ATATCCACTT CGCGCCGAAG GTGGCGATGA TGGTGGCTGA ATGCTTCGAT
AAGATTCGCG TGAACCCCGG TAACTTTGCG GATGGTCGTA AGACGTTTGA AAACATCCTG
TACGAAACGG ATGAAGAGTA CAACGCCGAA CTCGCGGAGA TTGAAGAAAT CTTCACGCCG
CTCGTGTTGA AGTGCAAGGA GCGAGGCGTG GCGATGCGCA TCGGTACGAA CCACGGTTCT
TTGTCCGCGC GCACGTTGTC GCGCTTTGGT GACACGCCGA TGGGGATGGT GGAATCTGCG
TTCGAATTCG CGCGCATTTG CCGCAAGCAC GATTACCACA ACTTTGTCTT CTCCATGAAG
GCGTCTAACC CTCTCGTCAT GGTGCAAGCG TATCGCCTGT TGTCCCACGA GATGTACAAG
CTCGGCTGGG ACTACCCGCT TCACCTCGGC GTCACCGAAG CCGGCGAAGG CGAAGACGGT
CGCATGAAGT CTTCCATCGG TATCGGTGCT CTCTTACTTG ACGGTCTCGG AGACACCATT
CGCGTGTCTC TCACCGAAGC ATCCGAGCTC GAAATCGAGC CGTGCACGCG CTTGGCCAAC
CTCGGTATGA AGGCGTGCGC CGAAAACATC GGTGTCGAGC AATTCGAGGA CTCTGTCCGT
GACTTCAAGA CCTTCACGCG CAGAACTGGT GATTTGCCCG AACAAAAGGA CTCTGATGCG
ATTGATTTCC GGAACATTCT CCACCGCGAC GGCTCTGTGC TCGCCGCCGT CACCACGGCG
CAACTCGCGT CGGAAGATGG CTTCTACCGC GAATTGGGCG CCAAGCTTGC GGTCGGTATG
CCGCTTCGTG ATATCGCCAC CTGTGATTCC ATCTTGCTCA GCGAAGTTCC CGCCGCGTCC
GAAGAGAAGG CGCTCCGATC TATCCGTCGC CTCCAAGAAA TCGGCTGCGG CGTTGTCGTG
CCGGCTGACC TTCTCCGAGC GACGCCGTTG GAGAACGCCA TCGCCCTCGT CAGCGCCGAC
GAAGTCGGTG CTCCGTTGCC GGCCGGTGCC GCGCGAAAGG CTGTCACGTT GAACGGCTTC
GAGACCGTCG AACAACTCAA GGCTGTCGCA GCGTCTGACG CCGTCATGGT GCTCATCAAG
ACGAAGGATG GCGAGTCTCG TCTCCACTCT TCTCGCCGCA TCGCCGAAGT GATGGCGCAA
GTCGGTGCGA AGATGCCGGT CATTCATCAC ATGTACATGC CGACCGGCGA CAAGTCGGAC
ATCGTCATTC AATCCGGCTC CCAAGTCGGT GGTTTGCTCG TCGACGGCTT TGGCGACGGC
GTGTTGATCC AATACGCCGG CTCCGACAAG GATGTCTCAT TGGACTTCCT TCGTACGACC
TCGTTCGGTT TGCTCCAAGG CTCTCGTATG CGTAACACGA AGACGGAGTA CGTCTCTTGC
CCGTCGTGCG GACGCACCCT CTTCGACTTG CAAGAAGTCA CGGCGCAAAT CCAAGAGAAG
ACTGGACACT TGCCTGGCGT CGCCATCGCC GTTATGGGTT GCATCGTCAA CGGTCCGGGT
GAGATGGCGG ATGCGGATTT CGGTTATGTT GGTGGTGCTC CGGGTAAGAT TGACTTGTAC
GTCGGTAAGG AAGTCGTCAA GCGAGGTATC GCGATGGAAA CTGCGTGTGA TGAACTCATT
CAACTCATCA AGGACAACGA CCGCTGGATC GAAAAAGAAG TCGAGGAAGC CGTCGCCGCG
TGA
 
Protein sequence
MDERGVLMPT GKYCVNHAET VRRKTRTVHV GNVKIGSEHP IVKQTMTTSD TRDVEKTVAE 
VIRCADAGAE MVRITVQGMQ EAKACKIIKE TLVAKGYDTP LIADIHFAPK VAMMVAECFD
KIRVNPGNFA DGRKTFENIL YETDEEYNAE LAEIEEIFTP LVLKCKERGV AMRIGTNHGS
LSARTLSRFG DTPMGMVESA FEFARICRKH DYHNFVFSMK ASNPLVMVQA YRLLSHEMYK
LGWDYPLHLG VTEAGEGEDG RMKSSIGIGA LLLDGLGDTI RVSLTEASEL EIEPCTRLAN
LGMKACAENI GVEQFEDSVR DFKTFTRRTG DLPEQKDSDA IDFRNILHRD GSVLAAVTTA
QLASEDGFYR ELGAKLAVGM PLRDIATCDS ILLSEVPAAS EEKALRSIRR LQEIGCGVVV
PADLLRATPL ENAIALVSAD EVGAPLPAGA ARKAVTLNGF ETVEQLKAVA ASDAVMVLIK
TKDGESRLHS SRRIAEVMAQ VGAKMPVIHH MYMPTGDKSD IVIQSGSQVG GLLVDGFGDG
VLIQYAGSDK DVSLDFLRTT SFGLLQGSRM RNTKTEYVSC PSCGRTLFDL QEVTAQIQEK
TGHLPGVAIA VMGCIVNGPG EMADADFGYV GGAPGKIDLY VGKEVVKRGI AMETACDELI
QLIKDNDRWI EKEVEEAVAA