Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5452 |
Symbol | |
ID | 6978546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1097229 |
End bp | 1098416 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394553 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002279371 |
Protein GI | 209547453 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.168322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.041032 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCG ACCGCATGCG GGTTTTCATG ACCCGCGACA AGGACCGTCC GCGCGTGATC GTCGCGCTCG ATACCGATGA CGGGCTGACG GGCTGGGGCG AATGCTACAA TCACGGCCCC GACAAGGCGC TGCCGCCGCT GCTCGATTAT CTCTACGCCT TTGTGGCCGG CCAGGATCCG ACACGCGTCG ATTATCTGGT CAACCTGCTG ATCCAGCAAA GCCGGTTCCC GCCGGGCGCG CTCGGCCTTG CCGCAATCTC CGCGCTCGAT CATTGCCTGT GGGACCTAGC GGCCAAGGCG GCGAACGTCC CGGTCTACAA GCTGCTCGGC GGCGAAGTGC GCGACCGCAT CAAGGTCTAT GCCGGGGTCT ATACCGCACC GGACGCGCCG GCGGCCCGCG ACGAATTCGA TCGCCTGAAG GAGGGATGGG GGTTCACCGC CTTCAAGCTC AGCCCCTGGC GGATCGACAT GCACGCCAAT CGCTGGGGCA ACGTCGTCAA AGCCTCGGCG GATTATTTCC GTTCGCTGCG CGAAACGGTC AACGATGAAT ACGAGATCGC CTTCGACGCC CATGCCAAGA TTTTCGAGCC GATCGCCGCT CGCCAGCTCG GCAACGCGCT AGCGCCCTAC GATCCGCTGT TTTTCGAGGA GCCGCTGCGT CCTGAGAACA TCGAGGCCTG GGGCGACCTG AAACAGGGGC TCAACTGCGT CCTCGCCACC GGTGAGTCAC TTTACAGCAG AAACGAGTTC CTGCGGCTGC TGCAGGTCAA GGGCGCCGAT CTCATCCAGC CGGATATCTG CGTCGTCGGC GGCATCAGCG AAATGCGCCG CATTGCGACG CTTGCCGAAG CCTTCTTCGT CGGCGTTGCG CCGCACAATC CGATGGGGCC GCTGGCAACG GCGGTCAACG TGCATTTTTC GGCCGCGGCG CAGAATTTCC GCATCCTCGA ATACCGGCTG CCAAAGGGCC AGGCCTACGT CTATGGCGGC AACGATATCG AGAAGCGGGA AGGGGAAACC CGTTACGTCG TCGATCCCTA TCTGCCGAAG GACGGTTATC TCGAACTGCG CCCGGATCGG CCCGGCTGGG GCGTCGAGAT GGACGAGAAG GCGATGGAGG AGGAAGGCTA CATCCACTGG CAGCGGCGCG TGCCAAAGCG GCCGGACGGT TCTTACGCCT TCGCTTGA
|
Protein sequence | MKIDRMRVFM TRDKDRPRVI VALDTDDGLT GWGECYNHGP DKALPPLLDY LYAFVAGQDP TRVDYLVNLL IQQSRFPPGA LGLAAISALD HCLWDLAAKA ANVPVYKLLG GEVRDRIKVY AGVYTAPDAP AARDEFDRLK EGWGFTAFKL SPWRIDMHAN RWGNVVKASA DYFRSLRETV NDEYEIAFDA HAKIFEPIAA RQLGNALAPY DPLFFEEPLR PENIEAWGDL KQGLNCVLAT GESLYSRNEF LRLLQVKGAD LIQPDICVVG GISEMRRIAT LAEAFFVGVA PHNPMGPLAT AVNVHFSAAA QNFRILEYRL PKGQAYVYGG NDIEKREGET RYVVDPYLPK DGYLELRPDR PGWGVEMDEK AMEEEGYIHW QRRVPKRPDG SYAFA
|
| |