Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3854 |
Symbol | gph |
ID | 5588271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3828318 |
End bp | 3829076 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640927478 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_001464839 |
Protein GI | 157158678 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00525883 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGT TTGAAGATAT TCGCGGCGTC GCTTTTGATC TTGATGGTAC GCTGGTCGAC AGTGCTCCCG GTCTTGCTGC TGCGGTAGAT ATGGCGCTGT ATGCGCTGGA GTTGCCCATC GCAGGTGAAG AACGCGTTAT TACCTGGATT GGTAACGGCG CAGATGTTCT GATGGAGCGC GCATTGACCT GGGCGCGTCA GGAACGTGCG ACTCTGCGTA AAACAATGGG TAAACCGCCC GTTGATGACG ACATTCCGGC AGAAGAACAG GTACGTATTC TGCGTAAACT GTTCGATCGC TACTATAGCG AGGTTGCCGA AGAGGGGACG TTTTTGTTCC CGCACGTTGC CGATACGTTG GGCGCGTTGC AGGCTAAAGG CCTGCCGCTA GGCCTGGTCA CCAACAAACC GACGCCGTTC GTCGCGCCGC TGCTCGAAGC CTTAGATATT GCCAAATACT TTAGCGTGGT TATCGGCGGC GATGATGTGC AAAACAAAAA ACCGCATCCG GACCCGCTGT TACTGGTGGC TGAGCGGATG GGAATTGCCC CACAAGAGAT GCTGTTTGTC GGCGACTCAC GCAATGATAT TCAGGCGGCA AAAGCGGCAG GTTGCCCATC AGTTGGCTTA ACCTACGGAT ATAACTACGG CGAGGCTATC GATCTCAGCC AGCCTGATGT AATTTATCAG TCTATAAATG ACCTTCTGCC CGCATTAGGG CTTCCGCATA GCGAAAATCA GGAATCGAAA AATGACTAA
|
Protein sequence | MNKFEDIRGV AFDLDGTLVD SAPGLAAAVD MALYALELPI AGEERVITWI GNGADVLMER ALTWARQERA TLRKTMGKPP VDDDIPAEEQ VRILRKLFDR YYSEVAEEGT FLFPHVADTL GALQAKGLPL GLVTNKPTPF VAPLLEALDI AKYFSVVIGG DDVQNKKPHP DPLLLVAERM GIAPQEMLFV GDSRNDIQAA KAAGCPSVGL TYGYNYGEAI DLSQPDVIYQ SINDLLPALG LPHSENQESK ND
|
| |