Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2233 |
Symbol | |
ID | 8013239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2237414 |
End bp | 2238130 |
Gene Length | 717 bp |
Protein Length | 238 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644824819 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_002976049 |
Protein GI | 241204953 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.25252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.513173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTGA CCCCTGCCCC CACTCGCCCG GCGCTCGTCG TCTTCGATCT CGACGGCACC CTTCTCGATA CGCATGTGGA CCTGGTCGAG AGCCTGAACC ACACGATCGC GGCCCTTGAC CTCGAGCCGG TCAGCTACGA CGACCTCACC CATCTCGTCG GCCAGGGCGC GCGCGTCATG ATCGAGCGCG CCTGCCGGCT GCGCGGCCAT CCGCTCGAAA GCGACGCCCT GCCGCCGCTG GTCGAGCGCT TCGTCGCTCA TTATGCCGGC AACATGCCCG GCCGGACCGA ACCCTATCCC GGCCTGGTCG CCGCCATGGA CCGGCTGAAA TCTCAGGGTT ACCGCCTCGC CGTCTGCACC AACAAGATGG AAAGCCTGGC AGTCCGCCTG CTCGACAAGC TCGACCTCGT CAGATATTTC GACACCATCA CCGGCGGCGA CAGCTTCGAA TACCGCAAGC CCGACGCCCG CCACCTCACC GGCACCATCG AACGCGCCGG CGGCGACATC GCCCGCACCG TGATGATCGG TGACAGCGTC AACGATATTG CCGTAGCAAG AAACGCCGGC ATACCGTCGA TCGCCGTGCC TTTCGGTTAT TCCGACGTGC CGGTTTCGAG CCTCGATCCG GATCTTATCA TCACCCATTT CGATGAGTTG ACGCCGGATC TGGTGGAAAC GCTGTTGCGG GAATATGCGG AGAAGGTTGC CGTCTGA
|
Protein sequence | MPLTPAPTRP ALVVFDLDGT LLDTHVDLVE SLNHTIAALD LEPVSYDDLT HLVGQGARVM IERACRLRGH PLESDALPPL VERFVAHYAG NMPGRTEPYP GLVAAMDRLK SQGYRLAVCT NKMESLAVRL LDKLDLVRYF DTITGGDSFE YRKPDARHLT GTIERAGGDI ARTVMIGDSV NDIAVARNAG IPSIAVPFGY SDVPVSSLDP DLIITHFDEL TPDLVETLLR EYAEKVAV
|
| |