Multi-objective reinforcement learning

Zhenpeng Zhou; Steven Kearnes; Li Li; Richard N. Zare; Patrick Riley

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Multi-objective reinforcement learning

ZZ Zhenpeng Zhou

SK Steven Kearnes

LL Li Li

RZ Richard N. Zare

PR Patrick Riley

This method is extracted from research article: Sci Rep, Jul 2019

Optimization of Molecules via Deep Reinforcement Learning

DOI: 10.1038/s41598-019-47148-x

Request a Protocol

Ask a question

Favorite

In real-world applications like lead optimization, it is often desired to optimize several different properties at the same time. For example, we may want to optimize the selectivity of a drug while keeping the solubility in a specific range. Formally, under the multi-objective reinforcement learning setting, the environment will return a vector of rewards at each step t, with one reward for each objective, i.e. ${\vec{r}}_{t} = {[r_{1, t}, \dots, r_{k, t}]}^{T} \in ℝ^{k}$ , where k is the number of objectives.

There exist various goals in multi-objective optimization. The goal may be finding a set of Pareto optimal solutions, or find a single or several solutions that satisfy the preference of a decision maker. Similar to the choice in Guimaraes et al.^¹⁵, we adapted the latter one in this paper. Specifically, we implemented the “scalarized” reward framework to realize multi-objective optimization, with the introduction of a user defined weight vector $w = {[w_{1}, w_{2} \dots, w_{k}]}^{T} \in ℝ^{k}$ , the scalarized reward can be calculated as

The objective of the MDP is then to maximize the cumulative scalarized reward.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol