Q-learning is a typical RL method with a slow convergence speed especially as the scales of the state space and the action space increase.

  • 利用模糊综合决策方法处理专家经验和环境信息得到Q学习的先验知识,对Q学习的初始状态进行优化。
目录 查词历史