Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4436 |
Symbol | |
ID | 6977530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 68972 |
End bp | 69622 |
Gene Length | 651 bp |
Protein Length | 216 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643393614 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 3 |
Protein accession | YP_002278432 |
Protein GI | 209546514 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR02247] Epoxide hydrolase N-terminal domain-like phosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00258568 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGCG AACGCGCCCT CATTCTCGAT TTCGGCGGCG TTGTCACCCG CACCCTGTTC GAGACGCATG AGATCACCGA GCGCACGCTC GGCCTTCCCC AGGGATCGCT GACCTGGCTC GGTCCGTTCG ATACAGTGAC CGACCCGCTC TGGGTGGCGA TGCAGAATCG CGAGATCACC GAACGCGATT ACTGGCTGAC GCGGGCCGCC GAGATCGGCC GAATGGTCGG CGAGAATTGG AGCGACATGC AGACCTTCGT GCGCCGCGCC CGCGGCGCCG AACCCGAACT GGTGCTTCGG CCAGAGGCGC GCGACGCCAT CCTGCGGGCC AAGGAAGCGG GATTGAAGCT CGCGATCCTG TCGAACGAAC TCGATCTCTT CTACGGAGCC GAATTCCGCA AGCGCTTCCC GCTGATCGAC CTCTTCGACG TCATCGTCGA CGCCACCTAC ACCAAGATCC TCAAACCCGA TCCGCGCGCC TATGAGCAGG TGCTGTCCGA GCTCGGCCTG CCGCGCGAAG CCTGCGTTTT CGTCGACGAC CAGAAGAAGA ACATCGAAGG CGCCGAAGCC GTCGGATTGC CGCATGTGCA TTTCGACGTC ACCCGCCCTG CCGAAAGTTA TGCCCGCGCA CTTGCGATGC TGGGCCTCTG A
|
Protein sequence | MTSERALILD FGGVVTRTLF ETHEITERTL GLPQGSLTWL GPFDTVTDPL WVAMQNREIT ERDYWLTRAA EIGRMVGENW SDMQTFVRRA RGAEPELVLR PEARDAILRA KEAGLKLAIL SNELDLFYGA EFRKRFPLID LFDVIVDATY TKILKPDPRA YEQVLSELGL PREACVFVDD QKKNIEGAEA VGLPHVHFDV TRPAESYARA LAMLGL
|
| |