Gene TM1040_0862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0862 
SymbolispG 
ID4076232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp915958 
End bp917091 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content60% 
IMG OID638006164 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_612857 
Protein GI99080703 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0125129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGA ATCATATCCG CCCTTGGCGG AACATCGAAC GCCGCAAGAG CCGCCAGATC 
CATGTGGGCA ATGTGCCCGT AGGGGGGGAT GCGCCAATTG CGGTGCAGAC CATGACCAAC
ACGTTGACCA CCGACATCAA AGGCACCATC GCACAGGTGC AGGCCGCCGC CGATGCGGGC
GCCGATATAG TGCGTGTCTC GGTCCCGGAC GAGGCCTCTG CCCGCGCGCT CAAGGAGATC
GTGCGGGAAA GCCCGGTGCC GATCGTGGCC GATATTCATT TCCACTACAA ACGTGGCATC
GAGGCCGCCG AGGCCGGTGC CGCCTGCCTG CGCATCAATC CCGGAAACAT CGGCGACGAA
AAACGCGTGG CCGAGGTGAT CAAGGCCGCC CGCGACCACA ATTGCTCGAT CCGTATTGGC
GTCAATGCTG GCAGCCTCGA AAAACACCTC CTCGAAAAAT ACGGCGAGCC TTGCCCGGAC
GCGATGGTCG AATCCGGCCT TGATCACATC AAGATCCTTC AGGACCACGA CTTTCATGAA
TTCAAGATCT CGGTCAAAGC CTCCGACGTC TTCATGTCTG CCGCCGCTTA TCAGATGCTT
GCGGATGCGA CCGACGCCCC CATCCACCTC GGCATCACCG AGGCTGGTGG GCTGATGTCC
GGCACGATCA AGTCTGCGAT CGGCCTCGGA CAGCTCTTGT GGATGGGCAT TGGCGACACC
CTGCGCGTAA GCCTTTCCGC CGATCCGGTT GAAGAGGTCA AAGTAGGCTT TGAGATCCTC
AAATCCCTCG GCCTGCGTCA CCGGGGCGTC AATATCATCT CCTGCCCGAG CTGCGCGCGT
CAGGGGTTTG ACGTTATCAA AACGGTCGAA ACCCTCGAAG AGCGCCTCGA ACACATCAAA
ACCCCCATGA GCCTCTCGAT CATCGGCTGC GTTGTGAATG GTCCGGGCGA GGCTTTGATG
ACCGATGTCG GCTTCACCGG TGGCGGCGCG GGCTCCGGCA TGGTGTATCT TGCGGGCAAG
GCCAGCCACA AGATGTCCAA CGACCAGATG ATCGATCACA TCGTCGAAGA GGTCGAGAAA
AAAGCCGCCG CTCTCGACGC GCAAGCGGCT GAGGACATGA AAGCGGCGGA ATAA
 
Protein sequence
MSMNHIRPWR NIERRKSRQI HVGNVPVGGD APIAVQTMTN TLTTDIKGTI AQVQAAADAG 
ADIVRVSVPD EASARALKEI VRESPVPIVA DIHFHYKRGI EAAEAGAACL RINPGNIGDE
KRVAEVIKAA RDHNCSIRIG VNAGSLEKHL LEKYGEPCPD AMVESGLDHI KILQDHDFHE
FKISVKASDV FMSAAAYQML ADATDAPIHL GITEAGGLMS GTIKSAIGLG QLLWMGIGDT
LRVSLSADPV EEVKVGFEIL KSLGLRHRGV NIISCPSCAR QGFDVIKTVE TLEERLEHIK
TPMSLSIIGC VVNGPGEALM TDVGFTGGGA GSGMVYLAGK ASHKMSNDQM IDHIVEEVEK
KAAALDAQAA EDMKAAE