Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4159 |
Symbol | ispG |
ID | 8014951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4243912 |
End bp | 4245162 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826729 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_002977939 |
Protein GI | 241206843 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.834774 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCAG CCGCCGATTT TGATCCGAAA CCGCGCCGCG CTTCCGTTGC CGTCGATGTC GGCGGCGTCA TCGTCGGCGG CGGGGCGCCG GTCGTCGTGC AGTCCATGAC GAACACCGAT ACGGCCGATA TCGATTCCAC CGTCGCGCAG GTCGCCGCTC TCCACCGGGC GGGCTCGGAA CTGGTGCGCA TTACCGTCGA CCGCGACGAG AGTGCAGCCG CCGTGCCGAA GATCCGCGAG CGGCTTCTGC GCCTCGGCAT GGACGTGCCC TTGATCGGCG ACTTCCACTA TATCGGCCAC AAGCTGCTCG CCGATCATCC GGATTGCGCC GAAGCGCTGG CGAAATACCG CATCAACCCC GGCAATGTCG GCTTCAAGGA CAAGAAGGAC AAGCAGTTCG CCGAGATCAT CGAGATGGCG ATCCGCTATG ACAAGCCGGT GCGCATCGGC GTCAACTGGG GCTCGCTCGA TCAGGATCTG CTGACGGCGC TGATGGACCG GAACGCCGAA GCCGGATCGC CGCTTTCGGC CCGGCAGGTG ACGCGCGAGG CGATCGTGCA GTCGGCGCTG CTTTCGGCAG CCCTTGCCGA AGAGATCGGC CTGCCGCGCA ACCGCATCAT CCTGTCGGCC AAGGTCAGCC AGGTGCAGGA CCTGATCGCC GTCAATTCCA TGCTTGCCGA ACGCTCCAAT CATGCGCTGC ATCTCGGCCT GACCGAAGCC GGCATGGGCA CCAAGGGCAT CGTCGCCTCG TCTGCGGCGA TGGGCTTCGT GCTGCAGCAC GGCATCGGCG ATACGATCCG CGTGTCGCTG ACGCCGGAGC CGAACGGCGA CCGCACGCGC GAAGTCCAGG TGGCGCAGGA AATCCTGCAG GTCATGGGCT TTCGCCAGTT CATACCCGTC GTTGCGGCCT GTCCGGGCTG TGGACGCACG ACGTCGACGG TGTTCCAGGA ACTTGCCCAG AATATCCAGA ACGACATCCG CAAGAACATG CCTGTCTGGC GCGAGAAATA TCCTGGGGTC GAGGCGCTGA ACGTCGCCGT CATGGGCTGC ATCGTCAACG GGCCGGGCGA AAGCAAACAT GCCGATATCG GCATTTCGCT TCCGGGCACT GGCGAAACGC CGGCCGCCCC CGTCTTCATC GACGGCCGGA AGGCGCTGAC TCTGCGCGGT GCCAATATCG CCGCCGATTT CGAGGCGCTG GTTGTCGACT ATATCGAGAA GCGTTTCGGC CAACGGACGG CGGCGGAATG A
|
Protein sequence | MLSAADFDPK PRRASVAVDV GGVIVGGGAP VVVQSMTNTD TADIDSTVAQ VAALHRAGSE LVRITVDRDE SAAAVPKIRE RLLRLGMDVP LIGDFHYIGH KLLADHPDCA EALAKYRINP GNVGFKDKKD KQFAEIIEMA IRYDKPVRIG VNWGSLDQDL LTALMDRNAE AGSPLSARQV TREAIVQSAL LSAALAEEIG LPRNRIILSA KVSQVQDLIA VNSMLAERSN HALHLGLTEA GMGTKGIVAS SAAMGFVLQH GIGDTIRVSL TPEPNGDRTR EVQVAQEILQ VMGFRQFIPV VAACPGCGRT TSTVFQELAQ NIQNDIRKNM PVWREKYPGV EALNVAVMGC IVNGPGESKH ADIGISLPGT GETPAAPVFI DGRKALTLRG ANIAADFEAL VVDYIEKRFG QRTAAE
|
| |