Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2193 |
Symbol | |
ID | 6409853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 2376296 |
End bp | 2376973 |
Gene Length | 678 bp |
Protein Length | 225 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642712077 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_001991189 |
Protein GI | 192290584 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.83571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCAT CTCCCATTGT TGTCTTCGAT CTCGACGGCA CGCTGATCGA CACCGCGCCC GACCTGATCA ACGCCCTGAA TTTCATCCTC GTTCGCGAAG GCATGCCGGC CGTTCCGATG GCCGTCGCGC GCAACATGAT CGGACAAGGC GCTCGCCGCC TGCTCGAACG TGGCCTCGAA CTCGACGGCC GTGTGATCGC GCAGGACGAC GTCAATCGCC TGACGGTCGA TTTCATCGAC TATTACGCCG CCCACATCGC CGACGAGTCG CGCCCGTTCG AAGGGCTGGA AGCGACGCTC GACGAGCTGT CGGAGAGCGG CTACCGGTTT GCGGTCTGCA CCAACAAGCT CGAATGGCTG TCGAAGCTGC TGCTCGACCG GCTCGGGCTC AGCCCGCGCT TTGCGGCGAT CTGCGGCGCG GACACTTTCG GTGTTGCCAA GCCTGACCCG GCGATCCTGC GCGAGACCGT GGCGAAGGCC GGCGGCGACC TGTCGGCGGC CATCATGGTC GGCGACGCGG GACCGGATGT CGGTGTCGCA CGCCGGGCCG GCGTGCCGGT GATCGGCGTC GAATTCGGCT ACACCGAAGT GCCGATCGCC GAGCTCCAGC CGGACCTGTT GGTCGGCCAT ATGCGCGAGT TGCCGCATGC GGTCGGTCGG CTGCTGCCGC GACGCTGA
|
Protein sequence | MTASPIVVFD LDGTLIDTAP DLINALNFIL VREGMPAVPM AVARNMIGQG ARRLLERGLE LDGRVIAQDD VNRLTVDFID YYAAHIADES RPFEGLEATL DELSESGYRF AVCTNKLEWL SKLLLDRLGL SPRFAAICGA DTFGVAKPDP AILRETVAKA GGDLSAAIMV GDAGPDVGVA RRAGVPVIGV EFGYTEVPIA ELQPDLLVGH MRELPHAVGR LLPRR
|
| |