Gene BURPS668_2893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2893 
Symbolgph 
ID4885375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2852350 
End bp2853075 
Gene Length726 bp 
Protein Length241 aa 
Translation table11 
GC content73% 
IMG OID640128821 
Productphosphoglycolate phosphatase 
Protein accessionYP_001059912 
Protein GI126440696 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases 
TIGRFAM ID[TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCCTT CGTCGCCCTC CTTCGCCGCC CCGCTGTCCG ACGCGCCGCG CCTCGACGCA 
TGCGAGGCCG TGCTGTTCGA TCTCGACGGC ACGCTCGCCG ACACCGCGCC CGATCTCGCC
GCCGCGGTCA ACAAGATGCA GCGCTCGCGC GGCATCGCAC AAACGCCGCT CGACGCGCTG
CGCCCGCTCG CGTCGGCGGG CGCGCGCGGC CTGATCGGCG GCGCGTTCGG CATCGCGCCC
GCGGACGCCG AATTCGACGC GCTGCGCGAC GAATTCCTCG CGAACTACGC GGCGGATCTG
TGCGTGCACA CGACGCTCTT TCCGGGCATC GGCGCGCTGC TCGACGACCT CGACGCGCGC
GGCGTGCGCT GGGGCATCGT GACCAACAAG GCTGCGCGGT TCACCGATCC GCTCGTTGCG
CTGCTCGGCC TCGCGGCGCG CGCGGCGTGC GTGGTCAGCG GCGACACGGC ATCGCACCCG
AAGCCGCATC CGGCGCCGCT GCTGTACGCG GCCGACCGCC TCTCGCTCGC CCCCGAGCGG
ATCGTGTACG TCGGCGACGA CCTTCGCGAC ATCCAGGCGG GCAGCGCGGC CGGCATGCCG
ACGGTCGCCG CCGCGTACGG CTATTGCGGC GACGGCGCCG CCCCCGCCGA CTGGCGGGCG
CAGCATCTCG TCGAGACGAC GGACGACCTG CAGCGGCTGC TGCGCGTGTT GCGCTATAAT
GCTTGA
 
Protein sequence
MSPSSPSFAA PLSDAPRLDA CEAVLFDLDG TLADTAPDLA AAVNKMQRSR GIAQTPLDAL 
RPLASAGARG LIGGAFGIAP ADAEFDALRD EFLANYAADL CVHTTLFPGI GALLDDLDAR
GVRWGIVTNK AARFTDPLVA LLGLAARAAC VVSGDTASHP KPHPAPLLYA ADRLSLAPER
IVYVGDDLRD IQAGSAAGMP TVAAAYGYCG DGAAPADWRA QHLVETTDDL QRLLRVLRYN
A