Gene Rleg_4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4159 
SymbolispG 
ID8014951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4243912 
End bp4245162 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID644826729 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_002977939 
Protein GI241206843 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.834774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCAG CCGCCGATTT TGATCCGAAA CCGCGCCGCG CTTCCGTTGC CGTCGATGTC 
GGCGGCGTCA TCGTCGGCGG CGGGGCGCCG GTCGTCGTGC AGTCCATGAC GAACACCGAT
ACGGCCGATA TCGATTCCAC CGTCGCGCAG GTCGCCGCTC TCCACCGGGC GGGCTCGGAA
CTGGTGCGCA TTACCGTCGA CCGCGACGAG AGTGCAGCCG CCGTGCCGAA GATCCGCGAG
CGGCTTCTGC GCCTCGGCAT GGACGTGCCC TTGATCGGCG ACTTCCACTA TATCGGCCAC
AAGCTGCTCG CCGATCATCC GGATTGCGCC GAAGCGCTGG CGAAATACCG CATCAACCCC
GGCAATGTCG GCTTCAAGGA CAAGAAGGAC AAGCAGTTCG CCGAGATCAT CGAGATGGCG
ATCCGCTATG ACAAGCCGGT GCGCATCGGC GTCAACTGGG GCTCGCTCGA TCAGGATCTG
CTGACGGCGC TGATGGACCG GAACGCCGAA GCCGGATCGC CGCTTTCGGC CCGGCAGGTG
ACGCGCGAGG CGATCGTGCA GTCGGCGCTG CTTTCGGCAG CCCTTGCCGA AGAGATCGGC
CTGCCGCGCA ACCGCATCAT CCTGTCGGCC AAGGTCAGCC AGGTGCAGGA CCTGATCGCC
GTCAATTCCA TGCTTGCCGA ACGCTCCAAT CATGCGCTGC ATCTCGGCCT GACCGAAGCC
GGCATGGGCA CCAAGGGCAT CGTCGCCTCG TCTGCGGCGA TGGGCTTCGT GCTGCAGCAC
GGCATCGGCG ATACGATCCG CGTGTCGCTG ACGCCGGAGC CGAACGGCGA CCGCACGCGC
GAAGTCCAGG TGGCGCAGGA AATCCTGCAG GTCATGGGCT TTCGCCAGTT CATACCCGTC
GTTGCGGCCT GTCCGGGCTG TGGACGCACG ACGTCGACGG TGTTCCAGGA ACTTGCCCAG
AATATCCAGA ACGACATCCG CAAGAACATG CCTGTCTGGC GCGAGAAATA TCCTGGGGTC
GAGGCGCTGA ACGTCGCCGT CATGGGCTGC ATCGTCAACG GGCCGGGCGA AAGCAAACAT
GCCGATATCG GCATTTCGCT TCCGGGCACT GGCGAAACGC CGGCCGCCCC CGTCTTCATC
GACGGCCGGA AGGCGCTGAC TCTGCGCGGT GCCAATATCG CCGCCGATTT CGAGGCGCTG
GTTGTCGACT ATATCGAGAA GCGTTTCGGC CAACGGACGG CGGCGGAATG A
 
Protein sequence
MLSAADFDPK PRRASVAVDV GGVIVGGGAP VVVQSMTNTD TADIDSTVAQ VAALHRAGSE 
LVRITVDRDE SAAAVPKIRE RLLRLGMDVP LIGDFHYIGH KLLADHPDCA EALAKYRINP
GNVGFKDKKD KQFAEIIEMA IRYDKPVRIG VNWGSLDQDL LTALMDRNAE AGSPLSARQV
TREAIVQSAL LSAALAEEIG LPRNRIILSA KVSQVQDLIA VNSMLAERSN HALHLGLTEA
GMGTKGIVAS SAAMGFVLQH GIGDTIRVSL TPEPNGDRTR EVQVAQEILQ VMGFRQFIPV
VAACPGCGRT TSTVFQELAQ NIQNDIRKNM PVWREKYPGV EALNVAVMGC IVNGPGESKH
ADIGISLPGT GETPAAPVFI DGRKALTLRG ANIAADFEAL VVDYIEKRFG QRTAAE