无人驾驶铰接式车辆强化学习路径跟踪控制算法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家高技术研究发展计划(863计划)项目(2011AA060404)和中央高校基本科研业务费专项资金项目(FRF-TP-16-004A1)


Reinforcement Learning Algorithm for Path Following Control of Articulated Vehicle
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对无人驾驶铰接式运输车辆无人驾驶智能控制问题,提出了一种强化学习自适应PID路径跟踪控制算法。首先推导了铰接车的运动学模型,根据该模型建立实际行驶路径与参考路径偏差的模型,以PID控制算法为基础,设计了基于强化学习的自适应PID路径跟踪控制器,该控制器以横向位置偏差、航向角偏差、曲率偏差为输入,以转角控制量为输出,通过强化学习算法对PID参数进行在线自适应整定。最后在实车道路试验中验证了控制器的路径跟踪质量并与传统PID控制结果进行了对比。结果表明,相比于传统PID控制器,强化学习自适应PID控制器能够有效减小超调和震荡,实现精确跟踪参考路径,可以较好地实现系统动态性能和稳态误差性能的优化。

    Abstract:

    With the industry 4.0 embraced a number of contemporary automation, data exchange and manufacturing technologies, the autonomous driving system is widespread. In order to enable the autonomous driving, path following strategies are essential to maintain the normal work of the vehicles. The articulated frame steering vehicles (ASV) are flexible, efficient and widely implemented in agriculture, mining, construction and forestry sectors due to their high maneuverability. The articulated vehicle usually composes of two units, a tractor and a trailer, which are connected by an articulation joint. However, as the ASV dynamics are significantly different from the conventional vehicles with front wheel steering, the path following controller derived for conventional vehicles is considered not to be applicable for the ASVs. Thus the path following control is challenging the robustness. A path following strategy is proposed for the ASVs on the basis of reinforcement learning adaptive PID algorithm. The kinematic model of the ASV is derived by neglecting the vehicle dynamics. Three measurable errors are defined to indicate the deviation of real path from reference path, i.e., lateral displacement error, orientation error and curvature error. These errors are served as the inputs in order to synthesize the path following controller and the desired steering angle is served as the output of path following controller. Based on the PID algorithm, the reinforcement learning method is selected for optimizing the parameters of PID online to reduce the overshoot and chattering. Furthermore, the prototype test is conducted to evaluate the performance of the proposed control law. The result shows that compared with the traditional PID, reinforcement learning adaptive PID controller can restrain the overshoot and chattering efficiently and follow the reference path accurately.

    参考文献
    相似文献
    引证文献
引用本文

邵俊恺,赵翾,杨珏,张文明,康翌婷,赵鑫鑫.无人驾驶铰接式车辆强化学习路径跟踪控制算法[J].农业机械学报,2017,48(3):376-382. SHAO Junkai, ZHAO Xuan, YANG Jue, ZHANG Wenming, KANG Yiting, ZHAO Xinxin. Reinforcement Learning Algorithm for Path Following Control of Articulated Vehicle[J]. Transactions of the Chinese Society for Agricultural Machinery,2017,48(3):376-382.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2016-04-18
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2017-03-10
  • 出版日期: