Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1198 |
Symbol | |
ID | 3916495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1246715 |
End bp | 1247386 |
Gene Length | 672 bp |
Protein Length | 223 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640443934 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_496477 |
Protein GI | 87199220 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.042458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAT TTCCTTATGC GATCGTCGGC TTCGACCTCG ACGGGACGCT GCTCGATACC TCCCGCGATC TCGGCACCGC GCTCAATCAC GCGCTGGCGC TCGCCGGTCG CCCGCCGGTT CTCCTGGAAG AGGTAACCCG CCACATCGGC GGCGGCGCGG CCCAGATGCT GCGCTCGGCG CTCACGGAAT CGGGCGGGGT CGACGAGGAA GCCTTCCCCC GCCTCCAGGC CGAACTCATC GCCTTCTACG CAAGCAACAT CGCCCACCAC ACCACGCTGT TTCCGGGCGG CGAAGCCATG CTCGACGCGC TCGATGCACG CGGCGTGAAA GTCGCCATCG CCACGAACAA GAAGGAATCG CTGGCGGTCC GCCTGTTCGA GGAACTGGGC ATGACCCACC GCTTCGCGAC GATCATCGGC GGGGACACGC TCGGCCCCGG CACCGCCAAG CCGCGCCCGG ACATGCTCCA CGCCATGGTC GAACGCTGCG GCGGCGGCCC GGCCGCCTTC GTCGGCGACA CCACGTTCGA TGTCGGCGCG GCCCGCGCGG CGGGCCTGCC CGTCGTCGCC GTCCGCTTCG GCTTCAACGA CCTTCCGGCT GACGAGTTGA ACGCCGATGC AGTGATCGAT CACTTCGACG AACTCGTCCC CGCGCTGGAA CAACTCGCCT GA
|
Protein sequence | MAKFPYAIVG FDLDGTLLDT SRDLGTALNH ALALAGRPPV LLEEVTRHIG GGAAQMLRSA LTESGGVDEE AFPRLQAELI AFYASNIAHH TTLFPGGEAM LDALDARGVK VAIATNKKES LAVRLFEELG MTHRFATIIG GDTLGPGTAK PRPDMLHAMV ERCGGGPAAF VGDTTFDVGA ARAAGLPVVA VRFGFNDLPA DELNADAVID HFDELVPALE QLA
|
| |