Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2296 |
Symbol | |
ID | 5834595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2546007 |
End bp | 2546708 |
Gene Length | 702 bp |
Protein Length | 233 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641368095 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_001639762 |
Protein GI | 163851719 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.75758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0287022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGC ACAGCTCCCC GATCGTCGTG TTCGATCTCG ACGGAACCCT CGCCGAGACC GCGGGCGACC TCATCGGCAC CCTGAACGTC GTCCTCGCTC GGGACGGCCA AGCGCCGCTT CCCCTCGAAC AGGCCCGCGA CCTGCTCGGC GCGGGCGCCC GCGCCCTGAT CCAGCGTGGC TTCACGGTCG CGGGTGCGAG CCTGACGCCG GAGCGGCTGG AGACGCTGTT CCAGGATTTC CTCGTCTATT ACGGCGAGCA TCTCACCGAC AATTCCTACC TCTTCCCCGG CGTGGTCGAG GCGCTGGACC GGCTGGAGGC GGCGGGCTTC CGCCTCGCGA TCTGCACCAA CAAGGTCGAG TCGCACGCCG TGGCTTTGCT CGACGCGCTG GGCATCGGCC ACCGCTTCTG CACGATCGTC GGCAAGGAGA CCTTCGCCTT CTCGAAGCCG GACCCGCGCC ACATCACCGC GACCATCGAG CGCGCCGGGG GTGATCTCCA CCGCGCCGTG ATGGTGGGTG ATTCGAAGGC CGACGTCGCC GCGGCCAAGG CGGCCGGCAT CCCCGTCGTC GGCGTGACCT TCGGCTACAC GCCCGTGCCG ATGCGCGAGC TGGCGCCCGA CTGGGTCATC GAGCATTTCG ACGCCCTGCC CGACGCGGTC GACGCCCTCC TCGCCCGCGA GACCGTGAAG CCGGCCGCCT GA
|
Protein sequence | MNAHSSPIVV FDLDGTLAET AGDLIGTLNV VLARDGQAPL PLEQARDLLG AGARALIQRG FTVAGASLTP ERLETLFQDF LVYYGEHLTD NSYLFPGVVE ALDRLEAAGF RLAICTNKVE SHAVALLDAL GIGHRFCTIV GKETFAFSKP DPRHITATIE RAGGDLHRAV MVGDSKADVA AAKAAGIPVV GVTFGYTPVP MRELAPDWVI EHFDALPDAV DALLARETVK PAA
|
| |