Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_5049 |
Symbol | ispG |
ID | 6134777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 5530899 |
End bp | 5532185 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641645185 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001771810 |
Protein GI | 170743155 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00821783 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAACCCA TGGAAGCTTC CGCCGCGCCC GAGATCGCAG GCCCGGCGCC CCGGCACGGC ACCGTCGGCG TGCGGATCGG CGAGGGCGAG GGCGCCGTCA CGATCGGCGG CGGCGCGCCC ATCGTCGTCC AGTCGATGAC GAACACCGAC ACGGCGGACA TCGACGCCAC CGTGGCGCAG GTCGCCGCGC TCGCCCGCGC CGGCTCCGAG ATCGTGCGCA TCACCGTCGA CCGCGACGAG GCCGCCGCCG CGGTGCCGAA GATCCGCGAG CGCCTGGACC GCATCGGCGT CCACGTGCCC CTCGTCGGCG ACTTCCACTA CATCGGCCAC AAGCTCCTCT CCGACCACCC GGCCTGCGCC GAGGCGCTCG CCAAGTACCG GATCAATCCG GGCAACGTCG GCTTCAAGGA GAAGAAGGAC CTCCAGTTCT CGACCATCGT CGAGATGGCG GCCCGGTACG GCAAGGCCGT GCGCATCGGC GCGAACTGGG GCTCCCTCGA CGAGGCGCTC CTCACCCGCC TGATGGACGA GAACGCCCGC AGCGAGCGAC CGGTCGATGC CCGCGCGGTG ATGCGCGAGG CCATGGTGCA GTCCGCCCTC CTCTCCGCCG ACCGGGCGGT CGAGATCGGC CTGCCGAAGG ACCGCATCGT GCTCTCCGCC AAGGTCTCGG CGGTGCAGGA CCTGATCGCG GTCTACCGCG AGGTCGCCCG CCGCTCGGAC TACGCGATCC ACCTCGGCCT GACCGAGGCC GGCATGGGCA CCAAGGGCAT CGTCGCGGCC TCGGCCGCCA TGGGCGTGCT CCTGCAGGAG GGCATCGGCG ACACGATCCG CTACTCGCTC ACCCCGGAGC CCGGCGGCGA CCGCACCGTG GAGGTGAAGG CCGCCCAGGA ACTGCTCCAG ACGATGGGCT TCCGCACCTT CGTGCCCCTC GTCGCCGCCT GCCCGGGCTG CGGGCGCACC ACCTCGACGG TGTTCCAGGA ACTCGCCCGC GACATCCAGA ACTGGATCGC GACCTCGATG CCGGAATGGC GCAAGACCTA TCCGGGCGTC GAGACCCTGA ACGTGGCGGT GATGGGCTGC ATCGTGAACG GCCCGGGCGA ATCGAAGCAC GCCGACATCG GCATCTCGTT GCCGGGAACC GGCGAGAGCC CCTCCGCCCC CGTCTTCATC GACGGCAAGA AGGCGATGAC CTTGCGCGGG GCCACCCTTG CCAAGGATTT CGAAACCATC GTCATCGACT ACATTGAGCG CCGATTCGGC CAGGGCCGGC GGAGCGCGGC GGAGTAA
|
Protein sequence | MQPMEASAAP EIAGPAPRHG TVGVRIGEGE GAVTIGGGAP IVVQSMTNTD TADIDATVAQ VAALARAGSE IVRITVDRDE AAAAVPKIRE RLDRIGVHVP LVGDFHYIGH KLLSDHPACA EALAKYRINP GNVGFKEKKD LQFSTIVEMA ARYGKAVRIG ANWGSLDEAL LTRLMDENAR SERPVDARAV MREAMVQSAL LSADRAVEIG LPKDRIVLSA KVSAVQDLIA VYREVARRSD YAIHLGLTEA GMGTKGIVAA SAAMGVLLQE GIGDTIRYSL TPEPGGDRTV EVKAAQELLQ TMGFRTFVPL VAACPGCGRT TSTVFQELAR DIQNWIATSM PEWRKTYPGV ETLNVAVMGC IVNGPGESKH ADIGISLPGT GESPSAPVFI DGKKAMTLRG ATLAKDFETI VIDYIERRFG QGRRSAAE
|
| |