This is a preview. Log in through your library . Abstract A general sequential model is defined where returns are in a partially ordered set. A distinction is made between maximal (nondominated) ...
In this paper we investigate the computation of optimal policies in constrained discrete stochastic dynamic programming with the average reward as utility function. The state-space and action-sets are ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results