Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_3177 |
Symbol | |
ID | 4040011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | + |
Start bp | 3449372 |
End bp | 3450112 |
Gene Length | 741 bp |
Protein Length | 246 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637978582 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_585318 |
Protein GI | 94312108 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTCG AAATACTGCG CCGCACGGAC TGGCACGCCA TCCGGGCGGT GATCGTCGAC CTCGATGGCA CCATGGTCGA CACCGCTGGC GATTTTCACG CCGCCGTCAA CGCGATGCTG CTGGCCCTGG CGCACAAGCA TCCCAACCTG GGCCCGGTCG AGCCGATGTC CCAGGAGGAC ATTGTCAGCT TCGTCGGCAA GGGGTCGGAG AACCTGATCC GCCGCGTGCT GGACGCGCGT TTCTCGCCAC TGCACGCCAA CGGGCTGTTC GCTGATGCCT ACGCGCTGTA CGACCGCGAG TACGTGCGGA TCAACGGCCA GTTTTCGAAC GTCTACCCGA ATGTCCGCGA GGGCCTGACG GCGCTCAAGG CGATGGGCCT GCGCATGGCC TGCGTGACCA ACAAGCCGTG GAACTTCACC GAGCCGCTGC TGGCCCGTAC CGGCCTGGCC CAGTATTTCG ATCTGGTCTA CGGCGGCGAC GCGTTCGCCC TGCGCAAACC CGATCCGTAT CCGCTGCTCA AGGTGGCCGA GGCATTCCGG GTCGATCCGG AGGCCGTGCT GGCCATCGGC GACTCCGAAA ATGACGCGCG GGCGGCACGT GCGGCGGGCA TGGGAGTGCT CTTGATGCCA TATGGCTACA ACCATGGCAA TCCTGTACAA GACGTCGACG CCGATGGTAT AGTCGCCGAC ATTGCCCGCG TGGCTGCGCT TCTTGCCGCA CATCGGGCAA CACACCGCTA A
|
Protein sequence | MSLEILRRTD WHAIRAVIVD LDGTMVDTAG DFHAAVNAML LALAHKHPNL GPVEPMSQED IVSFVGKGSE NLIRRVLDAR FSPLHANGLF ADAYALYDRE YVRINGQFSN VYPNVREGLT ALKAMGLRMA CVTNKPWNFT EPLLARTGLA QYFDLVYGGD AFALRKPDPY PLLKVAEAFR VDPEAVLAIG DSENDARAAR AAGMGVLLMP YGYNHGNPVQ DVDADGIVAD IARVAALLAA HRATHR
|
| |