Stochastic dynamic programming: Difference between revisions

Revision as of 23:25, 22 November 2021

Authors: Bo Yuan, Ali Amadeh, Max Greenberg, Raquel Sarabia Soto and Claudia Valero De la Flor (CHEME/SYSEN 6800, Fall 2021)

Theory, methodology and algorithm discussion

Theory

Stochastic dynamic programming combines stochastic programming and dynamic programming. Therefore, to understand better what it is, it is better first to give two definitions:

Stochastic programming. Unlike in a deterministic problem, where a decision’s outcome is only determined by the decision itself and all the parameters are known, in stochastic programming there is uncertainty and the decision results in a distribution of transformations.
Dynamic programming. It is an optimization method that consists in dividing a complex problem into easier subprobems and solving them recursively to find the optimal sub-solutions which lead to the complex problem optima.

In any stochastic dynamic programming problem, we must define the following concepts:

Policy, which is the set of rules used to make a decision.
Initial vector, p where p∈D and D is a finite closed region.
Choice made, q where q∈S and S is a set of possible choices.
Stochastic vector, z.
Distribution function dG_q (p,z), associated with z and dependent on p and q.
Return, which is the expected value of the function after the final stage.

@@ Line 6: / Line 6: @@
 * '''Stochastic programming.''' Unlike in a deterministic problem, where a decision’s outcome is only determined by the decision itself and all the parameters are known, in stochastic programming there is uncertainty and the decision results in a distribution of transformations.
 * '''Dynamic programming.''' It is an optimization method that consists in dividing a complex problem into easier subprobems and solving them recursively to find the optimal sub-solutions which lead to the complex problem optima.
+In any stochastic dynamic programming problem, we must define the following concepts:
+* Policy, which is the set of rules used to make a decision.
+* Initial vector, p where p∈D and D is a finite closed region.
+* Choice made, q where q∈S and S is a set of possible choices.
+* Stochastic vector, z.
+* Distribution function dG_q (p,z), associated with z and dependent on p and q.
+* Return, which is the expected value of the function after the final stage.

Stochastic dynamic programming: Difference between revisions

Revision as of 23:25, 22 November 2021

Theory, methodology and algorithm discussion

Theory

Navigation menu