Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3754 |
Symbol | gph |
ID | 6483401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3618525 |
End bp | 3619283 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739021 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_002042732 |
Protein GI | 194445749 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGT TACAGAATAT TCGGGGCGTC GCCTTTGATC TTGACGGTAC GCTGGTGGAT AGCGCGCCGG GTCTTGCCGC GGCGGTGGAT ATGGCGCTGT ATGCGCTGGA ACTGCCGGTC GCGGGCGAGG AGCGCGTGAT TACCTGGATT GGTAACGGCG CAGACGTATT GATGGAACGC GCGCTGGCCT GGGCTCGCGA GGAGCGCGCC ACGCTGCGTA AGACGATGGG GAAACCGCCC GTTGATGAAG ATATTCCTGC CGAGGAACAG GTACGCATTC TGCGTAAACT GTTCGACAGG TATTATGGCG AAGTGGCGGA AGAGGGCACT TTTTTATTTC CGCATGTCGC CGACACGCTG GGCGCGCTGC ACGCCAGCGG ATTGTCATTA GGTCTGGTGA CGAATAAGCC GACGCCGTTC GTCGCGCCGT TGCTGGAATC GCTTGATATC GCCAAATACT TTAGTGTGGT TATCGGCGGC GATGATGTGC AAAATAAGAA GCCGCATCCG GAGCCGCTGT TGCTGGTGGC AAGCCGGCTG GGCATGATGC CGGAGCAGAT GCTTTTTGTC GGCGATTCGC GTAATGATAT TCAGGCTGCA AAAGCGGCGG GCTGCCCTTC GGTTGGCCTG ACATACGGCT ACAATTATGG CGAAGCGATC GCTCTTAGCG AGCCGGACGT CATTTATGAC AGTTTTAACG ATCTTTTGCC CGCACTTGGG CTTCCGCATA GCGATAACCA GGAAATAAAA AATGACTAA
|
Protein sequence | MDKLQNIRGV AFDLDGTLVD SAPGLAAAVD MALYALELPV AGEERVITWI GNGADVLMER ALAWAREERA TLRKTMGKPP VDEDIPAEEQ VRILRKLFDR YYGEVAEEGT FLFPHVADTL GALHASGLSL GLVTNKPTPF VAPLLESLDI AKYFSVVIGG DDVQNKKPHP EPLLLVASRL GMMPEQMLFV GDSRNDIQAA KAAGCPSVGL TYGYNYGEAI ALSEPDVIYD SFNDLLPALG LPHSDNQEIK ND
|
| |