Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3547 |
Symbol | |
ID | 5901002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3825857 |
End bp | 3826573 |
Gene Length | 717 bp |
Protein Length | 238 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564054 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_001685172 |
Protein GI | 167647509 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGCTC CGAATGAGAT CCTGAACGGC GCCGTCATCG CCTTCGACCT GGACGGCACC CTGGTCGACA CCGCCCCCGA CCTCGTGGTC TCGCTGAACA TCATCCTCGC CGAGGAGGGC CTGCCGCCCC TGCCGTTCGA CGACGTGCGC AAGATGGTCG GCCGGGGCGC CAAGGCCTTG CTGGAACGCG GCTTCGCCGC GGCCGGCGCG CCGCTCGACG CCGATCAGGC CCCAACCCTG GTCGAGCGGT TCATCGCCCT CTATCTGGGC CGCATCGCCC ACGAGAGCGC CCCCTTTCCT GGCGTGGTCG ACACCCTGAT CGCCCTGCGC GCCAGCGGCG CCAAGCTGGC GGTCTGCACC AACAAGCTGA CCCACCTGTC GGTCGCCCTG CTCGACGCCC TGGACCTGAC GCAACATTTC GACGCCGTGG TCGGGGCCGA CAGCGCCCCC GCCGCCAAGC CCGATCCGCG CCACGTGCTG GCCGCGATCA CGGCGGTCGG CGGCGATCCG GCCCGCGCCG TGATGATCGG CGACAGCATC AATGACGCCC TGGCCGCCAG GGCCGCCAAC GTCCCGACCG TGCTGGTGAC GTTCGGCTAC ACCGAAGCCC CGGTCGAGAC CCTGGGCGGA GATCTGTTGA TAGACGCCTT CTCCGACGTG CCATCGGCCT GCATCACGCT TTTGACCTCT TGCGGTCCCG GAACGACCGG GCTATAG
|
Protein sequence | MHAPNEILNG AVIAFDLDGT LVDTAPDLVV SLNIILAEEG LPPLPFDDVR KMVGRGAKAL LERGFAAAGA PLDADQAPTL VERFIALYLG RIAHESAPFP GVVDTLIALR ASGAKLAVCT NKLTHLSVAL LDALDLTQHF DAVVGADSAP AAKPDPRHVL AAITAVGGDP ARAVMIGDSI NDALAARAAN VPTVLVTFGY TEAPVETLGG DLLIDAFSDV PSACITLLTS CGPGTTGL
|
| |