Gene GSU1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1459 
SymbolispG 
ID2686219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1598892 
End bp1599953 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content61% 
IMG OID637126133 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionNP_952510 
Protein GI39996559 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0367396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCA AGACGAGACA GATCCGAGTG GGGAACGTAC CGGTGGGCGG CGATGCGCCC 
TGCTCGGTGC AATCCATGTG CAATACCGAT ACGCGGGACG CGGGCGCCAC CCTTGATCAG
ATCAACGCCC TGGCGGCTGC GGGCTGTGAA ATCGTCCGCT GCGCCGTCCC CGACATGGCC
GCTGCCGAGG CCTTGGGCGC CATCAAGCGC CAAAGCCCCA TCCCGGTTAT TGCCGACATC
CACTTTGACT ACAAACTTGC CCTGAAGGTA CTGGAGGGGG GAATTGACGG CCTGCGTCTC
AATCCCGGCA ATATCGGCGA ACGGTGGAAG GTGGAAGAGG TTGTAGCAGC GGCCCGGGAA
CGGCTGGTAC CCATCCGCAT CGGCGTCAAT GCCGGCTCTC TAGAAAAGGA GCTTCTTCAG
AAATACGGCC ACCCGACCGC CGAGGCCATG GTCGAGTCGG CCCTTGGCCA TGTGCGTATT
CTCGAAGAAC TCGGCTACGA TCAGATCAAG ATATCTCTCA AGGCATCGGA CGTACCCAAG
ACGGTGGCGG CTTACCGGCT TCTGGCCCAA CGGATCGACT ATCCGCTTCA CATCGGCATT
ACCGAAGCGG GCACTATGTT CTCCGGAACC ATCAAGTCAG CCGTGGGGCT TGGCATCCTG
CTTGCCGACG GCATCGGCGA TACGCTCCGG GTATCCCTCA CGGGTGATCC GGTGGACGAG
GTGCGGGTCG GCTTCGAGAT CCTCAAGGCG CTTAATCTCA GACAAAAAGG GATTAATCTT
GTCTCCTGCC CCACCTGCGG CCGTTGCCAG ATCAACCTCA TCGGGGTGGC CGAAGAGGTT
GAGAAGCGTC TCGCCGGCAT CGACGCCCAT CTCACCGTGG CGGTCATGGG ATGCGTCGTG
AACGGACCCG GCGAGGCCCG CGAGGCCGAC GTGGGGATTG CCGGCGGACG GGGCGAGGGG
CTCCTGTTCC GCAACGGGGA AATCGTCCGC AAGGTGCCGG AGGCCGACAT GGCCGATGCG
CTGATTGCGG AAGTCGAAAA GATACTCGCT GAAAAACACT AA
 
Protein sequence
MKAKTRQIRV GNVPVGGDAP CSVQSMCNTD TRDAGATLDQ INALAAAGCE IVRCAVPDMA 
AAEALGAIKR QSPIPVIADI HFDYKLALKV LEGGIDGLRL NPGNIGERWK VEEVVAAARE
RLVPIRIGVN AGSLEKELLQ KYGHPTAEAM VESALGHVRI LEELGYDQIK ISLKASDVPK
TVAAYRLLAQ RIDYPLHIGI TEAGTMFSGT IKSAVGLGIL LADGIGDTLR VSLTGDPVDE
VRVGFEILKA LNLRQKGINL VSCPTCGRCQ INLIGVAEEV EKRLAGIDAH LTVAVMGCVV
NGPGEAREAD VGIAGGRGEG LLFRNGEIVR KVPEADMADA LIAEVEKILA EKH