Gene BURPS1106A_2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2228 
SymbolispG 
ID4902675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2216794 
End bp2218044 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID640135457 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_001066492 
Protein GI126452211 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0113793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCGGCG GGCATGCGCC GCGGCGCGTG TCGCATGCGG TCGATGTCCG CTGGGGCGGC 
ACGCTCGTGA CGATCGGCGG CGCGGCGCCC GTGCGCGTGC AGTCGATGAC GAACACCGAT
ACGGCCGACG CGATCGGCAC CGCGATCCAG GTGAAGGAGC TCGCGAACGC GGGCTCCGAG
CTCGTGCGCA TCACCGTGAA CACGCCGGAG GCGGCCGCTG CCGTGCCGGC GATTCGCGAG
CAGCTCGACC GGATGGGCGT GACGGTGCCG CTTGTCGGCG ATTTCCACTA CAACGGCCAC
CTGCTGCTGC GCGACTACCC GGACTGCGCG CAGGCGCTGT CGAAATACCG GATCAACCCG
GGCAACGTCG GCCAGGGCGC GAAGCGCGAT TCGCAGTTCG CGCAGATGAT CGAAGCCGCG
ATCAAGTACG ACAAGCCGGT GCGGATCGGC GTGAACTGGG GCAGCCTCGA TCAGGACCTG
CTCGCGCGGA TGATGGACGA GAACGGCGCG CGCGCCGAGC CGTGGGAGGC GCAGAGCGTG
ATGTACGAGG CGCTGATCCA GTCGGCGATC GGCTCGGCCG AGCGCGCGGT CGAGCTCGGC
CTCGGCCGCG ACAAGATCGT GCTGTCGTGC AAGGTGAGCG GCGTGCAGGA CCTGGTCGCC
GTGTACCGCG AACTGTCACG CCGCTGCGGC TTCGCGCTGC ACCTCGGCCT CACCGAGGCG
GGCATGGGCT CGAAGGGCAT CGTCGCGTCG ACCGCGGCGA TCGGTCTGCT GCTGCAGGAA
GGCATCGGCG ACACGATCCG CATCTCGCTC ACGCCGGAGC CGGGCGCGCC GCGCACGGGC
GAAGTGGTGG TCGGCCAGGA GATCCTGCAG ACGATGGGGC TGCGCTCGTT CGCGCCGATG
GTCGTCGCGT GTCCGGGCTG CGGCCGCACG ACGAGCACGC TGTTCCAGGA GCTCGCGCTG
CGGATCCAGA CCTACCTGCG CGAACAGATG CCCGTGTGGC GCAGCGAATA CCCGGGCGTC
GAGAAGATGA ACGTCGCGGT GATGGGGTGC ATCGTCAACG GCCCGGGCGA GTCGAAGCAC
GCGAACATCG GCATCAGCCT GCCGGGCTCG GGCGAGAATC CGGCCGCGCC GGTGTTCGTC
GACGGCGAGA AAGTGAAGAC GCTGCGCGGC GAGCACATCG CGGAAGAGTT CCAGCAGATC
GTGAGCGACT ACGTCGCGCG CACCTACGGC CGCGCCGCGG CGCAGAATTA A
 
Protein sequence
MFGGHAPRRV SHAVDVRWGG TLVTIGGAAP VRVQSMTNTD TADAIGTAIQ VKELANAGSE 
LVRITVNTPE AAAAVPAIRE QLDRMGVTVP LVGDFHYNGH LLLRDYPDCA QALSKYRINP
GNVGQGAKRD SQFAQMIEAA IKYDKPVRIG VNWGSLDQDL LARMMDENGA RAEPWEAQSV
MYEALIQSAI GSAERAVELG LGRDKIVLSC KVSGVQDLVA VYRELSRRCG FALHLGLTEA
GMGSKGIVAS TAAIGLLLQE GIGDTIRISL TPEPGAPRTG EVVVGQEILQ TMGLRSFAPM
VVACPGCGRT TSTLFQELAL RIQTYLREQM PVWRSEYPGV EKMNVAVMGC IVNGPGESKH
ANIGISLPGS GENPAAPVFV DGEKVKTLRG EHIAEEFQQI VSDYVARTYG RAAAQN