Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3387 |
Symbol | |
ID | 3911189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3871733 |
End bp | 3872410 |
Gene Length | 678 bp |
Protein Length | 225 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885290 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_486994 |
Protein GI | 86750498 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0781845 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAT CCCGCACCGT TGTCTTCGAC CTGGACGGCA CGCTGATCGA CACCGCGCCG GACCTGATCA ATGCCCTGAA TTACATCCTC GTCCGCGAAG GCATGCCGGC GGTCCCGCTC GCAAAGGCCC GCAATATGAT CGGGCAAGGC GCGCGGCGAT TGCTGGAGCG CGGCCTCGAA CTCGACGGCC GCGTCATCAG CCCGGACGAC GTCAACCGCC TCGCGGTCGA TTTCATCGAC TATTACGCCG CCAATATCGC CGTCGAGTCG CGGCCGTTCG AGGGACTGGA ACAGACGCTC GATGCGCTCG CCGGCCAGGG CTACCAGTTC GCGGTGTGCA CCAACAAGCT GGAATGGCTG TCGAAGCTGC TGCTCGACCA GCTCGGCCTG AGCTCCCGCT TCGCCGCGAT CTGCGGCGCC GACACCTTCG GCGTCGCCAA ACCCGACCCG GCGATCCTGC GCGAGACCAT TGCCAAGGCC GGCGGCGCAC TGGCCTCGGC GGTGATGGTC GGCGACGCCG GACCGGATGT CGGCGTCGCG CGGCGCGCCG GCATCCCGGT GATCGGCGTC GAGTTCGGCT ACACCGAGGT TCCGATCGCC GAGCTCAAGC CGGACCGGCT GGTCGGCCAT ATGCGCGACT TGCCGAACGC CGTCACGCAG CTGCTGCCGC CGGCCTGA
|
Protein sequence | MSTSRTVVFD LDGTLIDTAP DLINALNYIL VREGMPAVPL AKARNMIGQG ARRLLERGLE LDGRVISPDD VNRLAVDFID YYAANIAVES RPFEGLEQTL DALAGQGYQF AVCTNKLEWL SKLLLDQLGL SSRFAAICGA DTFGVAKPDP AILRETIAKA GGALASAVMV GDAGPDVGVA RRAGIPVIGV EFGYTEVPIA ELKPDRLVGH MRDLPNAVTQ LLPPA
|
| |