Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0957 |
Symbol | ispG |
ID | 5898412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1005773 |
End bp | 1006912 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561439 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001682585 |
Protein GI | 167644922 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.838599 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.683477 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCG ACCACACCCA CCTCCGTCCG TGGCGCTCGA TCGAGCGCCG CAAGTCGCGC AAGATCCGCG TCGGCAATGT CGAGGTGGGC GGCGACGCGC CGATCACCGT CCAGTCGATG ACCAACACCC TGACCAGCGA CGCCGCCGCG ACGCTGGAGC AGATCCGCCA ACTGGAAGAG GCCGGCGCCG ACATTGTCCG CGTTTCGTGC CCCGACACCG ACTCGACGGC GGCCTTCAAG ACCATCGCCC GCGAGAGCCG GGTGCCGCTC GTGGCCGACA TCCACTTCCA CTACAAGCGC GGCATCGAGG CGGCGCAAAA CGGCGCGGCC TGCCTGCGGA TCAATCCGGG CAATATCGGC AGCCCCGACC GCGTGCGCGA CGTCATCCAG GCGGCCCGCG ACCACGGCTG CTCGATGCGG ATCGGCGTCA ACGCCGGCTC GCTGGAGCGC GAACTGCTGG AAAAGTACGG CGAGCCTTGC CCCGACGCGA TGGTCGAGAG CGCCCTCAAC CACGCCCGCA TCCTGCAGGA CCACGACTTC CACGAGTTCA AGATCAGCGT GAAGGCGTCC GACCCGTTCA TGACGGTGGC GGCCTATCAC CAGCTGTCCG AGCGCATCGA CTGCCCGCTG CACCTGGGGG TCACCGAGGC CGGCGCCCTG CGGACCGGCA CGGTGAAGTC GTCGATCGGC ATCGGCTCGA TGCTGTGGGC CGGCATCGGC GACACCATCC GGGTGTCCCT GGCCGCCGAC CCGGTCGAGG AGATCAAGGT CGGCTTCGAT ATCCTCAAGT CGCTGGGCCT GCGCCATCGC GGCGTCAACA TCATCGCCTG CCCGTCCTGC GCCCGTCAGG GCTTCAACGT CATCAAGACG GTGGAGGCCT TGGAGCAGCG GCTGGCCCAC ATCTCGCAAC CGATGTCGCT GTCGATCATC GGCTGCGTGG TCAACGGTCC CGGCGAGGCG CTGATGACCG ACCTGGGTTT CACCGGCGGC GGGGCCGGGT CGGGCATGGT CTACATGGCC GGCAAGCCCG ACCACAAGCA GTCCAACGAC GGCATGATCG ACCACATCGT CGAGCTGGTG GAACAGCGCG CGGCCCTGCT GAAGGCCGCG GCCGACGCCG AGGCGATCGC GGCGGAGTAG
|
Protein sequence | MATDHTHLRP WRSIERRKSR KIRVGNVEVG GDAPITVQSM TNTLTSDAAA TLEQIRQLEE AGADIVRVSC PDTDSTAAFK TIARESRVPL VADIHFHYKR GIEAAQNGAA CLRINPGNIG SPDRVRDVIQ AARDHGCSMR IGVNAGSLER ELLEKYGEPC PDAMVESALN HARILQDHDF HEFKISVKAS DPFMTVAAYH QLSERIDCPL HLGVTEAGAL RTGTVKSSIG IGSMLWAGIG DTIRVSLAAD PVEEIKVGFD ILKSLGLRHR GVNIIACPSC ARQGFNVIKT VEALEQRLAH ISQPMSLSII GCVVNGPGEA LMTDLGFTGG GAGSGMVYMA GKPDHKQSND GMIDHIVELV EQRAALLKAA ADAEAIAAE
|
| |