Gene Rleg2_3831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3831 
SymbolispG 
ID6982594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3963288 
End bp3964538 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID643398553 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_002283319 
Protein GI209551402 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCAA CTGCCGATTT TGATCCGAAA CCGCGCCGCG CGTCCGTTGC CGTCGATGTC 
GGCGGCGTCA TCGTCGGCGG CGGGGCGCCG GTGGTCGTGC AATCGATGAC GAACACTGAC
ACGGCCGATA TCGATTCGAC CGTCGCCCAG GTCGCCGCTC TCCACCGGGC GGGCTCGGAG
CTGGTACGCA TCACCGTCGA CCGTGACGAG AGTGCGGCCG CCGTGCCCAA GATCCGCGAG
CGGCTGTTGC GGCTCGGCAT GGACGTGCCA TTGATCGGCG ACTTCCATTA TGTCGGCCAC
AAACTGCTTG CCGATCACCC TGATTGTGCC GCAGCGCTCG CGAAATACCG CATCAATCCC
GGCAATGTCG GCTTCAAGGA CAAGAAGGAC AAGCAGTTCG CCGAGATCAT CGAGATGGCG
ATCCGCTACG ACAAGCCGGT GCGCGTCGGC GTCAACTGGG GTTCGCTCGA TCAGGATCTC
TTGACGGCGC TGATGGATGA GAATGCTAGA GCCGGTTCGC CGCTTTCGGC CCGGCAGGTA
ACACGCGAGG CGATCGTGCA ATCGGCGCTC CTTTCGGCAG CCCTTGCCGA AGAGATCGGC
CTGCCGCGCA ACCGCATCAT CCTGTCGGCC AAGGTCAGCC AGGTCCAGGA CCTGATCGCC
GTCAATTCCA TGCTTGCCGA ACGCTCCAAT CATGCGCTTC ATCTCGGCCT GACCGAAGCC
GGCATGGGCA CCAAGGGCAT CGTCGCCTCA TCGGCGGCGA TGGGTTTCGT GCTCCAGCAC
GGCATCGGCG ATACGATCCG CGTATCGCTG ACGCCGGAGC CGAACGGCGA CCGCACGCGC
GAAGTTCAGG TAGCGCAGGA AATCCTGCAG GTCATGGGCT TCCGCCAGTT CATTCCTGTC
GTTGCCGCCT GTCCTGGCTG CGGACGCACG ACGTCGACAG TGTTCCAGGA GCTTGCCCAG
AACATCCAGA ACGACATCCG CAAGAACATG CCGGTCTGGC GCGAGAAATA TCCGGGCGTC
GAGGCGCTGA ATGTTGCCGT CATGGGCTGC ATCGTCAACG GACCGGGCGA AAGCAAACAT
GCCGATATCG GCATTTCGCT TCCCGGCACC GGCGAGACGC CGGCAGCCCC GGTCTTCATC
GACGGGAAGA AGGCGCTGAC ATTGCGCGGT CCCAATATCG CTGCCGACTT CGAGGCGCTC
GTCGTCGACT ATATCGAGAA GCGTTTCGGC CAGCGGACGG CGGCGGAATG A
 
Protein sequence
MSPTADFDPK PRRASVAVDV GGVIVGGGAP VVVQSMTNTD TADIDSTVAQ VAALHRAGSE 
LVRITVDRDE SAAAVPKIRE RLLRLGMDVP LIGDFHYVGH KLLADHPDCA AALAKYRINP
GNVGFKDKKD KQFAEIIEMA IRYDKPVRVG VNWGSLDQDL LTALMDENAR AGSPLSARQV
TREAIVQSAL LSAALAEEIG LPRNRIILSA KVSQVQDLIA VNSMLAERSN HALHLGLTEA
GMGTKGIVAS SAAMGFVLQH GIGDTIRVSL TPEPNGDRTR EVQVAQEILQ VMGFRQFIPV
VAACPGCGRT TSTVFQELAQ NIQNDIRKNM PVWREKYPGV EALNVAVMGC IVNGPGESKH
ADIGISLPGT GETPAAPVFI DGKKALTLRG PNIAADFEAL VVDYIEKRFG QRTAAE