Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1629 |
Symbol | gph-1 |
ID | 3848352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 1835171 |
End bp | 1835896 |
Gene Length | 726 bp |
Protein Length | 241 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637841298 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_442166 |
Protein GI | 83718948 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCTT CGTCGCCCTC CTTCGCCGCC TCGCAGTCCG GCGCGCCGCG CCTCGAAGCG TGCGAGGCCG TGCTGTTCGA TCTCGACGGC ACGCTCGCCG ATACGGCGCC CGACCTCGCG GCCGCGGTCA ACAAGATGCA GCGCTCGCGC GGGGCCGCCC CAACGCCCCT CGATGCGCTG CGCCCGCTCG CGTCGGCCGG CGCCCGGGGG CTCATCGGCG GCGCGTTCGG CATCGTGCCC GCGGATGCGG AATTCGATGC GCTGCGCGAC GAATTCCTCG CCAACTACGC GACCGACCTG TGTGTGCACA CGACGCTCTT TCCGGGCATC GGCGCGCTGC TGGACGACCT GGACGCGCGC GGCGTGCGCT GGGGCATCGT GACCAACAAG GCCGCGCGCT TCACCGACCC GCTCGTCGCA CTGCTCGGCC TCGCGGCGCG CGCGGCATGC GTGGTCAGCG GCGACACGGC GTCGCACCCG AAACCGCATC CGGCCCCGCT GCTGCATGCC GCGCAGAGCC TGTCGCTCGC GCCCGAGCGG ATCGTGTATG TCGGCGACGA TTTGCGCGAC ATCCAGGCGG GCAGCGCCGC CGGCATGCCG ACGGTTGCGG CCGCATACGG CTATTGCGGC GACGGCGTCG CGCCCGCCGA TTGGCAGGCG CAGCATCTCG TCGAAACGAC GGACGACCTG CAGCGACTAT TGCGCGTGTT GCGCTATAAT GATTGA
|
Protein sequence | MSSSSPSFAA SQSGAPRLEA CEAVLFDLDG TLADTAPDLA AAVNKMQRSR GAAPTPLDAL RPLASAGARG LIGGAFGIVP ADAEFDALRD EFLANYATDL CVHTTLFPGI GALLDDLDAR GVRWGIVTNK AARFTDPLVA LLGLAARAAC VVSGDTASHP KPHPAPLLHA AQSLSLAPER IVYVGDDLRD IQAGSAAGMP TVAAAYGYCG DGVAPADWQA QHLVETTDDL QRLLRVLRYN D
|
| |