Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A2583 |
Symbol | gph-1 |
ID | 4680577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | + |
Start bp | 2566116 |
End bp | 2566841 |
Gene Length | 726 bp |
Protein Length | 241 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639846844 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_993885 |
Protein GI | 121600878 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCTT CGTCGCCCTC CTTCGCCGCC CCGCTGTCCG ACGCGCCGCG CCTCGACGCA TGCGAGGCCG TGCTGTTCGA TCTCGACGGC ACGCTCGCCG ACACCGCGCC CGATCTCGCC GCCGCGGTCA ACAAGATGCA GCGCTCGCGC GGCATCGCAC AAACGCCGCT CGACGCGCTG CGCCCGCTCG CGTCGGCGGG CGCGCGCGGC CTGATCGGCG GCGCGTTCGG CATCGCGCCC GCGGACGCCG AATTCGACGC GCTGCGCGAC GAATTCCTCG CGAACTACGC GACGGATCTG TGCGTGCACA CGACGCTCTT TCCGGGCATC GGCGTGCTGC TCGACGACCT CGACGCGCGC GGCGTGCGCT GGGGCATCGT GACCAACAAG GCTGCGCGGT TCACCGATCC GCTCGTTGCG CTGCTCGGCC TCGCGGCGCG CGCGGCGTGC GTGGTCAGCG GTGACACGGC ATCGCACCCG AAGCCGCATC CGGCGCCGCT GCTGTACGCG GCCGACCGCC TCTCGCTCGC CCCCGAGCGG ATCGTGTACG TCGGCGACGA CCTTCGCGAC ATCCAGGCGG GCAGCGCGGC CGGCATGCCG ACGGTCGCCG CCGCGTACGG CTATTGCGGC GACGGCGCCG CCCCCGCCGA CTGGCGGGCG CAGCATCTCG TCGAGACGAC GGACGACCTG CAGCGGCTGC TGCGCGTGTT GCGCTATAAT GCTTGA
|
Protein sequence | MSPSSPSFAA PLSDAPRLDA CEAVLFDLDG TLADTAPDLA AAVNKMQRSR GIAQTPLDAL RPLASAGARG LIGGAFGIAP ADAEFDALRD EFLANYATDL CVHTTLFPGI GVLLDDLDAR GVRWGIVTNK AARFTDPLVA LLGLAARAAC VVSGDTASHP KPHPAPLLYA ADRLSLAPER IVYVGDDLRD IQAGSAAGMP TVAAAYGYCG DGAAPADWRA QHLVETTDDL QRLLRVLRYN A
|
| |