Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5408 |
Symbol | |
ID | 6978502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1051769 |
End bp | 1052959 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643394510 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002279328 |
Protein GI | 209547410 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00119776 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTTTGATC CCAATCGACG CACGATCCCC GCACAGCCGC CTAAGGCTGG CTTTGCGGCC GGTCCGGCCA AGCTGGACGC CATCCGATCG GTCACGCTGT CGCTCGCCTA TCTGCCGTTG GCGCGGCCGA TCAGCGACGC CAAGGTTTTG ACCGGCCGGC AGAAGCCGCT GACGGAGGTG GCGTTTCTGT TTTGCGAGAT CGTCTCCGAC GCTGGCCATA GCGGTCTCGG CTTCAGTTAC TCGAAGCGGG CAGGCGGGCC AGCCCTTTAC GCACATGCCT GCGAGATTGC CGACAATCTG ATCGGCGAAG ATCCCAACGA TACAGCGCGG ATATGGGACA AGCTCTGCTG GGCCGGCGCC TCGGTCGGCC GTTCCGGCAT TGCGACACAG GCCATTGCCG CCATCGACAT CTGCCTTTGG GACCTGAAGG CGAAACGCGC CGGCCTGCCG CTGGCCAAAC TGCTTGGCGC CCATCGCGAC AGCGTCGCCT GCTACAACAC GTCGGGCGGC TTCCTTTCGT CGAGCGTCGA GGAAATCCGC GATGCCATCG ATCACTCGAT CGCCTCCGGC ATCGGCGGCA TCAAGATCAA GGTCGGGCAG CCGGACCCGA TGATCGATCT TCGCCGGCTC GATGCGGTGA CCAGCCATAT CGATGGCCGC GTGCCGCTGA TGGTCGATGC CAACCAGCAA TGGGACCGCA CGACGGCGTT GCGCTTCGGC CGGCTGGTCG AGCCGCTCAA TCTCGAATGG ATCGAAGAGC CGCTCGATGC CTATGACGCC GAGGGCCATG CGGCACTGGC GCGTGAACTC GCCACACCAA TCGCCACCGG CGAGATGCTG GCAAGCGCCG ACGAACATAT GGCCCTGATC CGTGCCGACG CGGTCGACTT CATCCAGCCG GATGCGCCGC GGGTCGGGGG CATCACGCCC TTTCTGCGCA TCTGCACGCA AGCCGAGGCG AAACGCATGC GGCTCGCGCC GCATTTCGCC ATGGAAATCC ATCTGCACTT GGCGGCCGCT TATGCGCACG AGCCGTGGGT CGAGCATTTC GATTGGCTGG CGCCGCTGTT CAACGAACAG CTCGATATTA GCGACGGCCG GATGATCGTG CCCGCCCGTC CGGGGCTTGG TTGCAGCCTG ACGGGCAAAG CCCGTGACTG GACCGTGGAA ACGCGCAGCT TCGGCGGCTG A
|
Protein sequence | MFDPNRRTIP AQPPKAGFAA GPAKLDAIRS VTLSLAYLPL ARPISDAKVL TGRQKPLTEV AFLFCEIVSD AGHSGLGFSY SKRAGGPALY AHACEIADNL IGEDPNDTAR IWDKLCWAGA SVGRSGIATQ AIAAIDICLW DLKAKRAGLP LAKLLGAHRD SVACYNTSGG FLSSSVEEIR DAIDHSIASG IGGIKIKVGQ PDPMIDLRRL DAVTSHIDGR VPLMVDANQQ WDRTTALRFG RLVEPLNLEW IEEPLDAYDA EGHAALAREL ATPIATGEML ASADEHMALI RADAVDFIQP DAPRVGGITP FLRICTQAEA KRMRLAPHFA MEIHLHLAAA YAHEPWVEHF DWLAPLFNEQ LDISDGRMIV PARPGLGCSL TGKARDWTVE TRSFGG
|
| |