Gene Mpe_A2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2787 
SymbolcbbZ 
ID4785037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2967884 
End bp2968576 
Gene Length693 bp 
Protein Length230 aa 
Translation table11 
GC content72% 
IMG OID640091358 
Productphosphoglycolate phosphatase 
Protein accessionYP_001021976 
Protein GI124267972 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases 
TIGRFAM ID[TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.485008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACT TCGACGCCAT CGCCTTCGAC CTCGACGGCA CGCTGGTCGA CAGCGCACCC 
GACATTGCGC ACGCGCTGAA TGCCGGCCTG GACGAGGTGC GCCTGCAGCG CTTCGATCTC
GCCCGCGTGC GCGGCTGGAT CGGCGACGGC CCGGACGCGC TCATCGCGCG AGCGATGAAT
GCGCAGGACC TCGACGCGGT CAGCACCGCG CTGCTGACGC CGCGCGTGCG CGCCGCATTC
GATCGCGCCA CGCTCGCGGC GCCGCTGCAG CATGGCCAGG TCTACGACGG CATCGCCGAG
CTGCTGGCGC AGCTGAAGCC GCACCGACCA CTCGCCGTCG TGACCAACAA GCCGACCCGG
CTCGCCCGTG CCGTGCTGGA GGCTGCCGGG CTGCTGGACT GCTTCGCCAC CGTGCACGGC
GCCGACACGA AGGCGCAACG CAAGCCTTCG CCACTGCTGC TGGAAAACGC GGCAGACCAG
CTCGGCGTCT CGACGGGTCG CCTGCTGATG GTGGGCGACA GCATCCTCGA TCTGCGAGCG
GCCCACGCGG CGGGCGCCCA GGCCGCGCTG GTGCAGTGGG GCTACGGCCA CCTGACCGTG
CCCGAGACCC TCGACGCCTG GCGCGTGGCG ACGCCGGCGC AGCTGGCGGC CCGACTGCAC
CTGCAGTCCG CCCCCTGCGC CCAGAACACC TAA
 
Protein sequence
MMNFDAIAFD LDGTLVDSAP DIAHALNAGL DEVRLQRFDL ARVRGWIGDG PDALIARAMN 
AQDLDAVSTA LLTPRVRAAF DRATLAAPLQ HGQVYDGIAE LLAQLKPHRP LAVVTNKPTR
LARAVLEAAG LLDCFATVHG ADTKAQRKPS PLLLENAADQ LGVSTGRLLM VGDSILDLRA
AHAAGAQAAL VQWGYGHLTV PETLDAWRVA TPAQLAARLH LQSAPCAQNT