Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4859 |
Symbol | |
ID | 8007247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 239182 |
End bp | 240381 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644821789 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002973049 |
Protein GI | 241113214 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.156633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.785793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA CCGATCTGCG CTGCGCCGTT ATCGGCAAAC ATCCGATCGT CCGCATCGTC ACGGACGAGG GCCTCTATGG CTTGGGCGAA GTCGAGTTCA CCAAGTCCTA CCTCAAGCCC TTTGTGCTGC ATTTCCGCGA AGCGCTGATC GGCGAGGACC CGACCGACGT CGAGAGAGTG ATGCTGAAGA TCCGCCAACG CGGCTCTTTC AAGCCCTACG GCGCGGCGGT AAGCGCGATC GAGCATGCGC TGTGGGATAT TGCCGGTAAG GCGGCGGGCG TGCCCGCCTA TAAACTGCTC GGCGGCAAGG TGCGCGACAA GGTGCGCGTC TACAACGGCT CGATCCGCCA GAAACGCACC GGCGACCGGC CGGAGGATTA CGCCGCTGAC GTCAAATGGA TGATGGAGCA GCCGCAGAAC TTCTTCATGG TCAAACAAGG GATCTCGTTC CACTCCAACA TGAAGGACAC CATCGAGGAT TTCCACTACG GCGTGACGCA GAAGAAGGCC GGCTATCACG GTGCCATGGA TCAGGGCGTA ATCAGCGAGC GCGGCTTCAA TCACATGCTC GACTGCGTGA TCGCGATGAA GGAAGTGCTG GGCGACAAAG TCAGCCTGGC GCTCGACTGC GGTCCGGGCT GGATGCTGCC CGATGCGATC AAGTTCGCGC GCGCGGTAGA GAAGTACAAT TTGATGTGGC TCGAGGACAT GCTGACCGGC GACTACGTGC CGTGGGTCAA TCCGCAGGCC TATCGGGAAC TGACAATCTC CACCTCGACG CCGATCCACA CTGGTGAGCA GATCTACCTG CGGCACAATT TCAAGGAACT GATCGAGACG CAGGCGGTAC GCGTCATCGG CCCCGATCCA GCCGATATTG GCGGTATTGC CGAGCTCAAA TGGGTCGCCG AGCGCGCCTA CATGCACTCG ATCCTGATGG CGCCGCACGG CACAGCTAAC GGCCTGCTGG GGCTCGGCGC ATTGATCAAT GTCTGCGCCA CCTTGCCGGC AAATTATATC GCCTTCGAAT ATCCGAGCGC CTCCGACCCC TGGTGGGAGG ATCTGGTCAT CGGCTTGCCG GCGCAGATCG TGAAGGAAAG CATGGTGGAC CTACTGGAAG CGCCGGGGCT CGGCCTCGAT ATCGACGCCG AGGCGGCCAG GCGATATCTC AGGGAAGAGG ATGCTGGCTT CTTCGACTGA
|
Protein sequence | MKITDLRCAV IGKHPIVRIV TDEGLYGLGE VEFTKSYLKP FVLHFREALI GEDPTDVERV MLKIRQRGSF KPYGAAVSAI EHALWDIAGK AAGVPAYKLL GGKVRDKVRV YNGSIRQKRT GDRPEDYAAD VKWMMEQPQN FFMVKQGISF HSNMKDTIED FHYGVTQKKA GYHGAMDQGV ISERGFNHML DCVIAMKEVL GDKVSLALDC GPGWMLPDAI KFARAVEKYN LMWLEDMLTG DYVPWVNPQA YRELTISTST PIHTGEQIYL RHNFKELIET QAVRVIGPDP ADIGGIAELK WVAERAYMHS ILMAPHGTAN GLLGLGALIN VCATLPANYI AFEYPSASDP WWEDLVIGLP AQIVKESMVD LLEAPGLGLD IDAEAARRYL REEDAGFFD
|
| |