Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2083 |
Symbol | |
ID | 4710087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2286876 |
End bp | 2287556 |
Gene Length | 681 bp |
Protein Length | 226 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639856557 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_001003649 |
Protein GI | 121998862 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.609164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCTCT CGCTAACCCG CGCGATTCTC TTCGATCTGG ACGGGACGCT GGTGGACAGC GCCCCGGATC TGACCGTCGC CATCAACCAG GTGCTCGCCG AGCGGGACCA CGCCCCGGTG ACCGAGGAGC AGGTCCGCGG TTGGGTGGGC AATGGTGCCC GTCGGCTGGT GGCCCGCGCC CTCACCGGCG CGGACGACGG CAACCCGCCA GAAGAGGAGC TGGACGCCGC GCTCGAGCGC TTCTTCGAGT GCTACGGCGA GGCCGTCTAC GTACACAGCC GCCCCTACCC GGAGGCCGTC GAGACCCTGC AGGCCCTGGC CCAGGCCGGC ATGCGTCTGG CCGTGGTCAC CAACAAGCCG CGGCGCTTCG CCGAGCCGAT CCTCCAGGGC ATGGGGGTGA CGGATGCCAT CGACGTGGTC GTTGGCGGCG AGTGCACCGA GGCCCGCAAG CCCGACCCGG AGCCGCTCCG GCTGGCCATG GAGCGCCTGG GGGCGGCGTC GCGGACCGTG CTGATGGTCG GTGACTCGCG AACCGACGTG GAGGCGGCGC GCAACGCCGG TATTCCGGTG GTGTGTGTGC CCTACGGCTA CCGCCGCGGG GTGGCGCTGG AAGACCTGGG CGCCGACGCC ATCGTGGATG ATCTCAGCGG CGTGGTCGCG CTGCTACGCG AGGCGGCCTG A
|
Protein sequence | MDLSLTRAIL FDLDGTLVDS APDLTVAINQ VLAERDHAPV TEEQVRGWVG NGARRLVARA LTGADDGNPP EEELDAALER FFECYGEAVY VHSRPYPEAV ETLQALAQAG MRLAVVTNKP RRFAEPILQG MGVTDAIDVV VGGECTEARK PDPEPLRLAM ERLGAASRTV LMVGDSRTDV EAARNAGIPV VCVPYGYRRG VALEDLGADA IVDDLSGVVA LLREAA
|
| |