Q-learning is a typical RL method with a slow convergence speed especially as the scales of the state space and the action space increase.

英美

利用模糊综合决策方法处理专家经验和环境信息得到Q学习的先验知识，对Q学习的初始状态进行优化。

目录

查词历史