Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0132 |
Symbol | ispG |
ID | 4710655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 150784 |
End bp | 152025 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639854590 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001001728 |
Protein GI | 121996941 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCAG AAACCCGTCC CATTGAAGCC GAGCCGTCGC CCCGGCGCCA CAGCCGAACA GTCCACATCG GTGGCCTGCG TATGGGCGGC GACGCCCCGA TCGTGGTCCA GTCGATGACC GACACCGACA CCGCCGACGA GGTGGCCACT GCGGTCCAGG CCGCGGATCT GGCTCGCGCC GGATCCGAAC TGGTCCGGGT GACCATCAAC AACGAGGAAG CCGCGGCGGC CGTGCCGCGC ATCCGCGAGC GGCTAGCCCG GATGGGGGTC GAGGTGCCCA TCGTCGGCGA CTTCCACTTC AACGGCCACA AGCTGCTGCG TCGCCACCCC GAGTGCGCCG AGGCCCTCGC CAAGATGCGC ATCAACCCGG GCAACGTCGG CAAGGGCAGC CGGCGCGATC CGCAGTTCGC GGAGATGATC GAGATCGCCT GCACCTACGA CCTTCCGGTA CGCATTGGCG TCAACTGGGG AAGCCTGGAC GACAGCGTGC TCACCCGGCT GATGGACGCC AACGCCGCAC GCCAGCAGCC GCTGCCGCCG GAGACCGTCA TGCGCGACGC GGTGGTCACC TCGGCGGTGG AGAGTGCGCA GCGCGCCGAG GAACTCGGGT TGCCGGGTGA TCGCATCGTG CTCTCCTGCA AGATGAGCGG CGTCCAGGAT CTGGTGGCGG TCTATCGCGA CCTGGCCCGC CGCTGTGACT ACCCCCTGCA CCTGGGCCTG ACCGAGGCCG GCATGGGGGT CCCTGGTGTG GTCGCCTCCT CCGCGGCCCT GGCCATCCTG TTGCAGGAGG GGATCGGCGA CACCATCCGG GTCTCGCTCA CACCGGATCC CGGCGGAGCC CGCACCCGGG AGGTCGAGGT TGCCCAGCAG GTCCTGCAGA GCATGGGCCT GCGCGATTTC ACCCCGCGGG TCACCGCCTG CCCGGGCTGC GGGCGCACCT CCAGCGACTT CTTCCAGCAC CTGGCCGAGC ACATCCGCAA CCATATCGAC AAGCGCATGC CGGAATGGCG CGAGCGCTAC CCCGGCGTCG AGGGGCTGCG GATCGCCGTC ATGGGGTGCG TAGTCAACGG CCCCGGCGAG AGCCGCCACG CCGATATCGG CATCAGCCTG CCCGGCGCCG GTGAGCAGCC CTCGGCACCG GTGTTCATCG ACGGCGAGCG CTCGGTCACC CTCAAGGGCG ATCAGATCGC CGAAGAGTTC GAGCAGATCG TCGAGTCCTA CGTGCAACGG CGGTTCGGCT AG
|
Protein sequence | MDAETRPIEA EPSPRRHSRT VHIGGLRMGG DAPIVVQSMT DTDTADEVAT AVQAADLARA GSELVRVTIN NEEAAAAVPR IRERLARMGV EVPIVGDFHF NGHKLLRRHP ECAEALAKMR INPGNVGKGS RRDPQFAEMI EIACTYDLPV RIGVNWGSLD DSVLTRLMDA NAARQQPLPP ETVMRDAVVT SAVESAQRAE ELGLPGDRIV LSCKMSGVQD LVAVYRDLAR RCDYPLHLGL TEAGMGVPGV VASSAALAIL LQEGIGDTIR VSLTPDPGGA RTREVEVAQQ VLQSMGLRDF TPRVTACPGC GRTSSDFFQH LAEHIRNHID KRMPEWRERY PGVEGLRIAV MGCVVNGPGE SRHADIGISL PGAGEQPSAP VFIDGERSVT LKGDQIAEEF EQIVESYVQR RFG
|
| |