Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A3679 |
Symbol | gph |
ID | 6518378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 3534251 |
End bp | 3535009 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642748661 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_002116425 |
Protein GI | 194735884 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.966867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGT TGCAGAATAT TCGGGGCGTC GCCTTTGATC TTGACGGCAC GCTGGTGGAT AGCGCGCCGG GTCTTGCCGC GGCGGTGGAT ATGGCGCTGT ATGCGCTGGA ACTGCCGGTC GCGGGCGAGG AGCGCGTGAT TACCTGGATT GGTAACGGCG CAGACGTATT GATGGAACGC GCGCTGGCCT GGGCTCGCGA GGAGCGCGCC ACGCTGCGTA AGACGATGGG GAAACCGCTC GTTGATGAAG ATATTCCTGC CGAGGAACAG GTACGCATTC TGCGTAAACT GTTCGACAGG TATTATGGCG AAGTGGCGGA AGAAGGCACC TTTTTATTTC CGCATGTCGC CGACACGCTG GGCGCGCTGC ACGCCAGCGG ATTGTCATTA GGTCTGGTGA CGAATAAGCC GACGCCGTTC GTCGCGCCGT TGCTGGAATC GCTTGATATC GCCAAATACT TTAGTGTGGT TATCGGTGGC GATGATGTGC AAAATAAGAA GCCGCATCCG GAGCCGCTGT TGCTGGTGGC AAGCCGGCTG GGCATGATGC CGGAGCAGAT GCTTTTTGTC GGCGATTCGC GTAATGATAT TCAGGCTGCA AAAGCGGCGG GCTGCCCTTC GGTTGGCCTG ACATACGGCT ACAATTATGG CGAAGCGATC GCTCTTAGCG AGCCGGACGT CATTTATGAC AGTTTTAACG ATCTTTTGCC CGCACTTGGG CTTCCGCATA GCGATAACCA GGAAATAAAA AATGACTAA
|
Protein sequence | MDKLQNIRGV AFDLDGTLVD SAPGLAAAVD MALYALELPV AGEERVITWI GNGADVLMER ALAWAREERA TLRKTMGKPL VDEDIPAEEQ VRILRKLFDR YYGEVAEEGT FLFPHVADTL GALHASGLSL GLVTNKPTPF VAPLLESLDI AKYFSVVIGG DDVQNKKPHP EPLLLVASRL GMMPEQMLFV GDSRNDIQAA KAAGCPSVGL TYGYNYGEAI ALSEPDVIYD SFNDLLPALG LPHSDNQEIK ND
|
| |