Based on the theory of performance potentials and the method of equivalent Markov process, the performance optimization problem is discussed for a class of semi-Markov decision processes (SMDPs) with parameterized randomized stationary policies and a simulation optimization algorithm is proposed.
基于性能势理论和等价Markov过程方法,研究了一类半Markov决策过程(SMDP)在参数化随机平稳策略下的仿真优化算法,并简要分析了算法的收敛性。
With some concepts associating with multinomial numbers,we obtained the following results:the value of multinomial numbers does not change in identifying process,the value of multinomial numbers decreases in discreting process and increases in meaning process.
由与多重组合数 ( multinomial numbers)有关的几个概念得到 3个结论 :等价化过程不改变组合数的大小、悬殊化过程使组合数变小、平均化过程使组合数变大 。