Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3031 |
Symbol | |
ID | 3836476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3490378 |
End bp | 3491148 |
Gene Length | 771 bp |
Protein Length | 256 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637827145 |
Product | 2-phosphoglycolate phosphatase |
Protein accession | YP_428113 |
Protein GI | 83594361 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCT CCGTCCATCC CCCCCCGCCG CCCCGGCCCT TCCCCTTGCC CAAGGCCGTG ATCTTCGATC TCGACGGCAC CCTGGTCCAC AGCCTGCCCG GGCTGACCGA CGCCCTGAAC AAGACCCTGG CCGAGGATGA TCTGGCGCCG CTGGACGAAG CGGCGGTCAA ACGCATGGTC GGCGAAGGGG CCGGATTGTT GGTGGCGCGC GCCTTCGCCG CCTATGGCCT TGGCCGGGCC GACGATGCCG ATGACACGGC AACGCAAGCC CGGCTCGCGC GCTTTCTCGC CCATTACGCC CCCGACCCGC TGGCCGGCGC CAGCGTCTAT CCTGGCGCCT TGGCCCTGCT CGGCGCCCTG GCGGCGCGCG GCATCCGCCT GGGGGTTTGC ACCAACAAGC CCGAAGGCCC GGCCCGCGCC CTGCTGGAAG GCCTGGGCCT CGCCGATCCG ATCATGGATG TGGTCGGCGG CGACACCTTG GCCCAGCGCA AACCCGACCC GGCGCCGCTG CGCGCCCTGC TCGACAGCCT GGGCGTGGAG GCCGATCAGG CGCTGATGGT CGGTGACAGC CCCACCGATG TCGCCACCGC CAAGGCGGCG GGCGTGCCGG TGGTGGTGAT GTCCTATGGC TATAGCCGCG AGCCGGTGGC CAGCCTGGGC GCCCTCGCCG TCTTCGATGA TTTCGCCAGC CTGGGTGATT GGCTGGGATT TCCCCAGCCT GGGGGCGATC GACTGGGGGC AACTCCGGCT TTGAGCGAGA ATCCGGCTTG A
|
Protein sequence | MSASVHPPPP PRPFPLPKAV IFDLDGTLVH SLPGLTDALN KTLAEDDLAP LDEAAVKRMV GEGAGLLVAR AFAAYGLGRA DDADDTATQA RLARFLAHYA PDPLAGASVY PGALALLGAL AARGIRLGVC TNKPEGPARA LLEGLGLADP IMDVVGGDTL AQRKPDPAPL RALLDSLGVE ADQALMVGDS PTDVATAKAA GVPVVVMSYG YSREPVASLG ALAVFDDFAS LGDWLGFPQP GGDRLGATPA LSENPA
|
| |