Gene Smed_5845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5845 
Symbol 
ID5320147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp809136 
End bp810383 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID640777540 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001314472 
Protein GI150377877 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC CCAGAATCAC CGACATCCGT GCAACCACCG TCACGGTTCC GCTGGAGGCG 
CCGCTGCGCC ATTCCAACGG CGCCCATTGG GGGCGGTTCG TGCGCACCAT CGTCGAGGTA
GAGACCGACG TGGGCATCAT CGGCCTTGGC GAGATGGGCG GCGGCGGGGA GAGCGCCGAG
GCGGCGTTTC GGGCCTTGAA GTCCTATCTC GTCGGCCACG ATCCCTTCGA GCTGGAAAAT
CTCCGCTTCA TGATCTGCAA CCCGACCGCC AGCCTCTACA ACAATCGCAC CCAGATGCAT
GCGGCCATCG AATTCGCCTG TCTGGACATC ATGGGCAAGT TTCTCGGCGT GCCGGTCTGC
GACATCCTGG GCGGTAAGAT GCGCGATGCC GTTCCTTTTG CCAGCTACAT GTTTTTCCGC
CTCGCCAACA AGGATACCGG CGAGGGCGAG ACGCGCACGG CCGATCAGCT TATCGAGCAG
ACCTTGGCGC TGAAAAAGAA GTGCGGCTTC ACCTCGCACA AGCTGAAGAG CGGCGTCTTC
CCGCCGGATT ATGAGCTGGA GGTTTTTCGC GCATGGGCAA AGGCGCTCGG CCCCGACAGC
GTCCGCTACG ACCCCAACGC GGCCTTCAGC GTCGAGGAGG CGATCCGCTT CGCCAGGGGC
ATCGAAGATC TGAACAATGA TTATTATGAG GACCCGACCT GGGGGTTGAA CGGCATGCGG
CGGGTGCGCG AAAACACGAC GATGCCGCTC GCCACCAACA CCGTTGTCGT GAATTTCGAA
CAGCTCGCCG CCAATATTCT CAATCCCGCG GTCGACGTCA TTCTGCTCGA CACCACCTTC
TGGGGCGGCA TCCGGCCATG CGTAAAGGCG GCGGGCGTTT GCGAGACCTT TCAGCTCGGC
ATCGCGGTGC ATTCATCTGG CGAACTTGGC GTTCAGCTCG CCACCATGCT GCATCTCGGC
GCGGTCCTTC CGAACCTCGT TTTCCATGCC GACGCGCATT ATCACCAGCT CACGGACGAC
ATCATCATCG GGGGACCGAT GCGTTACGAG AACGGCGCGA TCAAGGTGCC GACGGCGCCC
GGCCTCGGGG TCGAGCTCGA TCGCAACAGG CTCGGCCAGT ATGCCGATCT TCACAAGACG
CTCGGTGGCT ATTCCTATGA CCGGGATCCG TCCCGCCCCG GCTGGTTCTC GGTCGTCCCG
AACACCCGAT GGGCGGACCC CAGCGATGAT CGCGTCGTGG ACTACTGA
 
Protein sequence
MKRPRITDIR ATTVTVPLEA PLRHSNGAHW GRFVRTIVEV ETDVGIIGLG EMGGGGESAE 
AAFRALKSYL VGHDPFELEN LRFMICNPTA SLYNNRTQMH AAIEFACLDI MGKFLGVPVC
DILGGKMRDA VPFASYMFFR LANKDTGEGE TRTADQLIEQ TLALKKKCGF TSHKLKSGVF
PPDYELEVFR AWAKALGPDS VRYDPNAAFS VEEAIRFARG IEDLNNDYYE DPTWGLNGMR
RVRENTTMPL ATNTVVVNFE QLAANILNPA VDVILLDTTF WGGIRPCVKA AGVCETFQLG
IAVHSSGELG VQLATMLHLG AVLPNLVFHA DAHYHQLTDD IIIGGPMRYE NGAIKVPTAP
GLGVELDRNR LGQYADLHKT LGGYSYDRDP SRPGWFSVVP NTRWADPSDD RVVDY