Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3762 |
Symbol | gph |
ID | 6270055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3487890 |
End bp | 3488648 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727625 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_001882060 |
Protein GI | 187732490 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.40481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGT TTGAAGATAT TCGCGGCGTC GCTTTTGATC TCGACGGTAC GCTGGTTGAC AGTGCTCCCG GTCTTGCTGC TGCGGTAGAT ATGGCGCTGT ATGCGCTGGA GTTACCCGTC GCAGGTGAAG AACGCGTTAT TACCTGGATT GGTAACGGCG CAGATGTTCT GATGGAGCGT GCATTGGCCT GGGCGCGTCA GGAACGTGCG ACTCTGCGTA AAACAATGGG TAAACCGCCC GTTGATGACG ACATTCCGGC AGAAGAACAG GTACGTATTC TGCGTAAACT GTTCGATCGC TACTATAGCG AGGTTGCCGA AGAGGGGACG TTTTTGTTCC CGCACGTTGC CGATACGTTG GGCGCGTTGC AGGCTAAAGG CCTGCCGCTA GGCCTGGTCA CCAACAAACC GACGCCGTTC GTCGCGCCAC TGCTCGAAGC CTTAGATATT GCCAAATACT TTAGCGTGGT TATCGGCGGC GATGATGTGC AAAACAAAAA ACCGCATCCG GACCCGCTGT TACTGGTGGC TGAGCGGATG GGAATTGCCC CACAACAGAT GCTGTTTGTC GGCGACTCAC GCAATGATAT TCAGGCGGCA AAAGCGGCAG GTTGCCCATC AGTTGGCTTA ACCTACGGAT ATAACTACGG CGAGGCTATC GATCTCAGCC AGCCTGATGT AATTTATCAG TCTATAAATG ACCTTCTGCC CGCATTAGGG CTTCCGCATA GCGAAAATCA GGAATCGAAA AATGACTAA
|
Protein sequence | MNKFEDIRGV AFDLDGTLVD SAPGLAAAVD MALYALELPV AGEERVITWI GNGADVLMER ALAWARQERA TLRKTMGKPP VDDDIPAEEQ VRILRKLFDR YYSEVAEEGT FLFPHVADTL GALQAKGLPL GLVTNKPTPF VAPLLEALDI AKYFSVVIGG DDVQNKKPHP DPLLLVAERM GIAPQQMLFV GDSRNDIQAA KAAGCPSVGL TYGYNYGEAI DLSQPDVIYQ SINDLLPALG LPHSENQESK ND
|
| |