Gene Dole_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2059 
SymbolispG 
ID5694902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2510208 
End bp2511305 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content60% 
IMG OID641264660 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_001529940 
Protein GI158522070 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTTG ATATAACCCG CAGAAAATCA AGACAGATTC ACATCGGTTC CGTTGCCGTG 
GGCGGCGAGG CCCCGGTATC GGTCCAGTCC ATGACCACCA CCGACACAGC GGACGTGGAA
GCCACGGTGG CCCAGATCGC AAGGCTTGAA GCGGCAGGGT GCGAGATTGT GCGGGTGGCG
GTGCCCAACC CGGAGGCGGC AAAGGCCATC GGGCGGATTA AAAATGAGAT CGCTATTCCT
CTGATCGCCG ACATTCATTT CAACTGGCGC CTGGCCGTGG CCGCCATGGA GGCCGGGGCC
GACGGCCTGC GCATCAATCC CGGCAACATC GGGGGCCGGG AGAAAACCCT TAACGTCATC
GACTGCGCCC GGCATCACAA GGTGGCCATT CGCATCGGCG TCAACTCCGG GTCTGTGGAA
AAAGACCTGC TGAAAAAACA CAGCGGGCCG ACGCCCCGGG CCATGGTGGA AAGCGCCCTG
CGCAACATCG CCATTTTCGA AGCGGCAAAT TTTTTTGATA TCAAGCTCTC CCTCAAAGCC
TCGGATGTGG CCCGCACCGT GGAGAGCTAC CGTCTGATCG CCGCCGCCTG CGATTACCCG
CTTCATGTGG GGGTCACTGA AGCCGGGGGG CTGTTTCCCG GGCTGGTCAA GTCTTCTCTG
GGCATCGGCA TGCTGCTGGC CGAAGGCATC GGCGACACCA TTCGCGTGTC GCTCACCCGG
GATCCGGTGG AGGAAATCCG GGCCGGGTTT GAAATTCTCA AGGCTGTGAA CAGGCGGCAC
GTGGGTCCGG ACATTATATC GTGCCCCACC TGCGGCCGGT GCGGCATTGA TCTTTTTGCC
ATCGTGGAAC AGGTGGAACG CCATGCCCTG GCCATGCGGG CCCCGGTAAA AATCGCGGTG
ATGGGGTGCG TGGTCAATGG CCCGGGAGAA GCGGCCGAGG CGGATGTGGG CATTGCCGGC
GGCAGGGGAG AGGGCGTGCT GTTTAAAAAA GGGAAAATTG TTAAGACCAT TCCCGAAGCC
AACCTGGTGA AAAATCTGCT GGCGGAAATG GATAAGCTTG AGACGAATTG GGGTAAAAAA
GGAGCTGGAA AAAAATAA
 
Protein sequence
MAFDITRRKS RQIHIGSVAV GGEAPVSVQS MTTTDTADVE ATVAQIARLE AAGCEIVRVA 
VPNPEAAKAI GRIKNEIAIP LIADIHFNWR LAVAAMEAGA DGLRINPGNI GGREKTLNVI
DCARHHKVAI RIGVNSGSVE KDLLKKHSGP TPRAMVESAL RNIAIFEAAN FFDIKLSLKA
SDVARTVESY RLIAAACDYP LHVGVTEAGG LFPGLVKSSL GIGMLLAEGI GDTIRVSLTR
DPVEEIRAGF EILKAVNRRH VGPDIISCPT CGRCGIDLFA IVEQVERHAL AMRAPVKIAV
MGCVVNGPGE AAEADVGIAG GRGEGVLFKK GKIVKTIPEA NLVKNLLAEM DKLETNWGKK
GAGKK