Gene Rleg_4859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4859 
Symbol 
ID8007247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp239182 
End bp240381 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content61% 
IMG OID644821789 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_002973049 
Protein GI241113214 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.156633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.785793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA CCGATCTGCG CTGCGCCGTT ATCGGCAAAC ATCCGATCGT CCGCATCGTC 
ACGGACGAGG GCCTCTATGG CTTGGGCGAA GTCGAGTTCA CCAAGTCCTA CCTCAAGCCC
TTTGTGCTGC ATTTCCGCGA AGCGCTGATC GGCGAGGACC CGACCGACGT CGAGAGAGTG
ATGCTGAAGA TCCGCCAACG CGGCTCTTTC AAGCCCTACG GCGCGGCGGT AAGCGCGATC
GAGCATGCGC TGTGGGATAT TGCCGGTAAG GCGGCGGGCG TGCCCGCCTA TAAACTGCTC
GGCGGCAAGG TGCGCGACAA GGTGCGCGTC TACAACGGCT CGATCCGCCA GAAACGCACC
GGCGACCGGC CGGAGGATTA CGCCGCTGAC GTCAAATGGA TGATGGAGCA GCCGCAGAAC
TTCTTCATGG TCAAACAAGG GATCTCGTTC CACTCCAACA TGAAGGACAC CATCGAGGAT
TTCCACTACG GCGTGACGCA GAAGAAGGCC GGCTATCACG GTGCCATGGA TCAGGGCGTA
ATCAGCGAGC GCGGCTTCAA TCACATGCTC GACTGCGTGA TCGCGATGAA GGAAGTGCTG
GGCGACAAAG TCAGCCTGGC GCTCGACTGC GGTCCGGGCT GGATGCTGCC CGATGCGATC
AAGTTCGCGC GCGCGGTAGA GAAGTACAAT TTGATGTGGC TCGAGGACAT GCTGACCGGC
GACTACGTGC CGTGGGTCAA TCCGCAGGCC TATCGGGAAC TGACAATCTC CACCTCGACG
CCGATCCACA CTGGTGAGCA GATCTACCTG CGGCACAATT TCAAGGAACT GATCGAGACG
CAGGCGGTAC GCGTCATCGG CCCCGATCCA GCCGATATTG GCGGTATTGC CGAGCTCAAA
TGGGTCGCCG AGCGCGCCTA CATGCACTCG ATCCTGATGG CGCCGCACGG CACAGCTAAC
GGCCTGCTGG GGCTCGGCGC ATTGATCAAT GTCTGCGCCA CCTTGCCGGC AAATTATATC
GCCTTCGAAT ATCCGAGCGC CTCCGACCCC TGGTGGGAGG ATCTGGTCAT CGGCTTGCCG
GCGCAGATCG TGAAGGAAAG CATGGTGGAC CTACTGGAAG CGCCGGGGCT CGGCCTCGAT
ATCGACGCCG AGGCGGCCAG GCGATATCTC AGGGAAGAGG ATGCTGGCTT CTTCGACTGA
 
Protein sequence
MKITDLRCAV IGKHPIVRIV TDEGLYGLGE VEFTKSYLKP FVLHFREALI GEDPTDVERV 
MLKIRQRGSF KPYGAAVSAI EHALWDIAGK AAGVPAYKLL GGKVRDKVRV YNGSIRQKRT
GDRPEDYAAD VKWMMEQPQN FFMVKQGISF HSNMKDTIED FHYGVTQKKA GYHGAMDQGV
ISERGFNHML DCVIAMKEVL GDKVSLALDC GPGWMLPDAI KFARAVEKYN LMWLEDMLTG
DYVPWVNPQA YRELTISTST PIHTGEQIYL RHNFKELIET QAVRVIGPDP ADIGGIAELK
WVAERAYMHS ILMAPHGTAN GLLGLGALIN VCATLPANYI AFEYPSASDP WWEDLVIGLP
AQIVKESMVD LLEAPGLGLD IDAEAARRYL REEDAGFFD