Gene WD0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0116 
SymbolispG 
ID2738965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp109315 
End bp110595 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content38% 
IMG OID637172344 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionNP_965936 
Protein GI42520021 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAGATA AAGACCTGAT ATTGAATGAT AATACATGTG AATTATCGCA GATTTCCAGG 
CACAAAACTC ATGTTGTAGA AGTTGGAAAA GTAAAGATAG GCGGAAATAA TCCTGTAGTT
GTACAATCTA TGGCGCTTGG CGTGCATATA GATGCTGATA ATATAAAAAG TAGCGCAAAA
CATTATGCAA AAGAGATAAC AGAACTAGCG CGTACAGGTT CAGAATTGGT GCGAATTGCC
TTGAACTCAG AGGAGGTAGC AAGAGCAATA CCTTATATAG TAGAAGAGAT AAATAAAGAA
GGTTTTGACG GTAAAATATT GGTAGGCTGC GGTCAATATG AGCTAGATAA ATTAGTTCAA
GATTATCCAG ACAACATCAA AATGCTGGGT AAAATTAGAA TCAATCCAGG CAATATAGGC
TTTGGCGACA AACGTGATGA GAAGTTTGAA AGGGTGATAG AGTATGCAAT AATGCACGAT
CTTCCGGTTA GAATTGGGGT AAATTGGGGT AGTCTTGATA AATACCTTTT GCAAAAATTG
ATGGATGAAA ACTCTTCGCT TAGTAATTCA AGGCCTTCTG ATGTTATACT GCGCAAAGCA
CTTGTAATGT CTGCTCTTGA TAGTGCAAAA AAAGCTGAAG AAATTGGTCT AAATTCAAAC
AAAATAATCA TTTCATGTAA AGTTAGCAAA GTACAAGATT TAATTTTAGT TTATATGGCA
CTTGCAAAAT CTTCCAATTA TGCGCTGCAT TTGGGTTTAA CTGAGGCTGG TATGGGCAAT
AAAGGTGTGG TAAATACCGC AGCAGGGCTT ACTTATTTAT TGCAAAATGG CATTGGAGAT
ACAATACGGG CTTCTTTGAC TCAACGTCCT GGTGAATCGC GTACTAATGA AGTGGTGGTG
TGTCAGGAAA TATTGCAGTC TATAGGCTTG CGACACTTTA ACCCTCAGGT TAATTCATGT
CCTGGTTGTG GGCGCACGAG TAGCGATCGT TTTCGTATAT TAACTGAAGA GGTGAATGGC
TATATAAAAA CTCATATGCC GATGTGGAAG AAAAAAAATC CAGGTGTAGA GCATATGAAC
ATTGCTGTTA TGGGATGCAT AGTAAATGGT CCTGGAGAAA GCAAACACGC AAATTTGGGA
ATCAGCTTAC CTGGATATGG AGAAAAACCT GTTTCAGCAG TCTACAAAAA CGGCAAATAT
TTCAAAACTT TACAAGGTGA TAATGTCTCT GAAGAATTTA AGGCAATTAT TGATAATTAT
GTGAAGAAGC ATTACACGTA A
 
Protein sequence
MLDKDLILND NTCELSQISR HKTHVVEVGK VKIGGNNPVV VQSMALGVHI DADNIKSSAK 
HYAKEITELA RTGSELVRIA LNSEEVARAI PYIVEEINKE GFDGKILVGC GQYELDKLVQ
DYPDNIKMLG KIRINPGNIG FGDKRDEKFE RVIEYAIMHD LPVRIGVNWG SLDKYLLQKL
MDENSSLSNS RPSDVILRKA LVMSALDSAK KAEEIGLNSN KIIISCKVSK VQDLILVYMA
LAKSSNYALH LGLTEAGMGN KGVVNTAAGL TYLLQNGIGD TIRASLTQRP GESRTNEVVV
CQEILQSIGL RHFNPQVNSC PGCGRTSSDR FRILTEEVNG YIKTHMPMWK KKNPGVEHMN
IAVMGCIVNG PGESKHANLG ISLPGYGEKP VSAVYKNGKY FKTLQGDNVS EEFKAIIDNY
VKKHYT