Gene Dvul_1724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1724 
SymbolispG 
ID4663016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2035324 
End bp2036484 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID639819963 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_967168 
Protein GI120602768 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.694346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.709473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGGCT TGCTGGGTGA AATGCAGTAT GTACCCGGCA TCCATTCACC CAATGAAGGA 
CCGCGCCCCA TGACCATCCA ACGCAAGCAG ACCCGCGAGG TGCGCATAGG CAAGGTGCGT
ATCGGCGGTG CCAATCCCGT TGTCGTGCAG AGCATGACCA ACACCGACAC GCGGGACGTC
GAGCAGACTG TCGAGCAGAT ACGCCAATTG CAGGAAGCCG GCTGCGAGAT CGTACGTCTC
GCCGTCCTCA ACGAGGACGC GGCATGGGCC ATCAAGCCCA TCCGGTCGCA GGTTTCCGTA
CCGCTGGTCG CTGACATCCA TTTCGACCAC AGGCTCGCCG TCTCCGCCCT CGAGGCGGGT
GTGGACGCCC TGCGCATCAA CCCCGGCAAC ATCGGAACGA GGGCTGCGGT CGACCGCGTG
GTGGACGCCG CCAAGGCCCA TAACGCCGTC ATCCGCATCG GCGTGAACTC GGGCTCGCTG
GAGACCGACC TCATCGACCA GTATGGCGGG CCCACTCCCG AAGCCATGGT GGAGAGTGCG
TTCCGTCACA TCAGGATGCT CGAGGATCGC AATTTCGGCG ACATCAAAGT CTCGCTCAAA
TCCTCGTCCG TGTCGCGTTG TATCGAAGCG TATACGCTGC TTTCCGCGAA GTGCGACTAC
CCGCTGCATA TCGGCGTCAC TGAAGCCGGT ACGGTGCTGC GTGGTTCCAT CAAGTCTGCT
GTCGGGCTCG GTGTCCTGCT GTGGCAGGGC ATCGGCGATA CCCTGCGGGT GTCGCTCACC
AGCGACCCCG TGGCCGAGAT GGCGGTGGCG TGGGAGATAC TCCGCTCACT CGGGTTGCGC
TCGCGGGGGC CTGAGATCAT CGCCTGTCCC ACCTGTGGTC GCTGTGAGAT AGGACTCATC
GCCCTCGCGG AAGAGGTCGA GCGACGTCTC GAAGGCGAGA CGGAGAGCTT CAAGGTGGCG
GTGATGGGGT GTGTGGTCAA TGGCCCCGGA GAGGCGCGCG AGGCCGACCT CGGCATCGCG
GGCGGGCGCG ACAAGGGCAT CATCTTCCGC AAGGGTGAGA TTGTGCGCAC CGTCAAGGGC
GGCTCGAACC TGCTTGCCGC CTTCATGGAA GAACTCGACA CTTTTCTGGC CCACCGCAGG
GCCGAACGTA AGGATGACTG A
 
Protein sequence
MLGLLGEMQY VPGIHSPNEG PRPMTIQRKQ TREVRIGKVR IGGANPVVVQ SMTNTDTRDV 
EQTVEQIRQL QEAGCEIVRL AVLNEDAAWA IKPIRSQVSV PLVADIHFDH RLAVSALEAG
VDALRINPGN IGTRAAVDRV VDAAKAHNAV IRIGVNSGSL ETDLIDQYGG PTPEAMVESA
FRHIRMLEDR NFGDIKVSLK SSSVSRCIEA YTLLSAKCDY PLHIGVTEAG TVLRGSIKSA
VGLGVLLWQG IGDTLRVSLT SDPVAEMAVA WEILRSLGLR SRGPEIIACP TCGRCEIGLI
ALAEEVERRL EGETESFKVA VMGCVVNGPG EAREADLGIA GGRDKGIIFR KGEIVRTVKG
GSNLLAAFME ELDTFLAHRR AERKDD