The algorithm revises the reinforcement signal and improves the exploration policy to overcome the negative effect of limit cycles in the inverted pendulum system.

  • 算法采用修正强化信号和改进探索策略的方法克服极限环对倒立摆系统的影响。
目录 查词历史