Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3555 |
Symbol | gph |
ID | 4882005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 3487987 |
End bp | 3488724 |
Gene Length | 738 bp |
Protein Length | 245 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640129483 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_001060560 |
Protein GI | 126441118 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.221388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACGA TGACGACGTC GCCTCTCAAT GGCGCGCCGC GCATCGAAGC GGCGCTCATC GATCTCGACG GCACGATGGT CGATACCGCA GACGATTTCA CGGCCGGCCT GAACGCGATG CTCGCGCAGC TCGATGCCGA GGAGACGACG CGCGAGGAAG TGATGCGCTA TGTCGGCAAG GGTTCGGAGA ACCTGATCCA GTGCGTGCTG ACGCCGCGCT TTTCCGCAGA CGACGCGAAC GCGCGCTTCG ACGAGGCGCT CGCGCTCTAT CAGGCCGAAT ACGCGAAGAT CAACGGCCGC CACACGCGGC TCTACCCGGA CGTCGACGCA GGCTTGCGGG CGATGCGCGA AGCGGGCGTC AAGCTCGCAT GCGTGACGAA CAAGCCGTGC CGGTTCGCGG TCGAGCTGCT CGCGCAGTAC GGCCTGTCCG GCCATTTCTC CGCGGTGTTC GGCGGCGACA GCGTGCCGCG CAAGAAGCCC GATCCGGCGC CGATGCTCGC CGCATGCGCC GCGCTCGGCG TCGCGCCGCG CGCGGCGGTG GCGATCGGCG ATTCGGAGAA CGACGCGCTC GCGGGCCGCG CGGCCGGGAT GGCGACGCTG ACGGTGCCGT ACGGCTACAA CCACGGCAAC GCTATACAAA CGATCGAATC GGATGGTATA GTCGATTCGC TTCTCGCCGC CGCACGGCTC ATCGCCGCGC ACAATTCGGC AGGATCAGCG GCAAGATCAG CCATCTGA
|
Protein sequence | MSTMTTSPLN GAPRIEAALI DLDGTMVDTA DDFTAGLNAM LAQLDAEETT REEVMRYVGK GSENLIQCVL TPRFSADDAN ARFDEALALY QAEYAKINGR HTRLYPDVDA GLRAMREAGV KLACVTNKPC RFAVELLAQY GLSGHFSAVF GGDSVPRKKP DPAPMLAACA ALGVAPRAAV AIGDSENDAL AGRAAGMATL TVPYGYNHGN AIQTIESDGI VDSLLAAARL IAAHNSAGSA ARSAI
|
| |