个人简介
教育背景
2000年9月–2004年7月 西北工业大学,航海工程学院,学士
2004年9月–2007年3月 北京邮电大学,自动化系,硕士
2007年9月–2012年1月 清华大学,自动化系,博士
工作履历
2012年2月–2015年5月 航天五院,北京控制工程研究所,主管设计师
2015年6月–2018年7月 清华大学,深圳研究生院信息学部,博士后/助理研究员
2018年8月–2023年1月 清华大学,自动化系,助理研究员
2023年1月–至今 清华大学,自动化系,副研究员
学术兼职
担任NeurIPS, ICLR, IJCAI, AAMAS等国际学术会议和MSSP, RAL, AST等国际期刊审稿人
社会兼职
人工智能学会智能决策专委会(筹) 秘书长
智能无人系统建模仿真专委会 委员
奖励与荣誉
军队科技进步一等奖
近期论文
查看导师新发文章
(温馨提示:请注意重名现象,建议点开原文通过作者单位确认)
Kailin Zeng, QiYuan Zhang, Bin Chen, Bin Liang, and Jun Yang*. APD: Learning Diverse Behaviors for Reinforcement Learning Through Unsupervised Active Pre-Training, IEEE Robotics and Automation Letters, 2022.
Shu Leng, Xianglong Li, Meng Yu, Jun Yang*, Bin Liang. Flexible online planning based residual space object de-spinning for dual-arm space-borne maintenance, Aerospace Science and Technology, 2022, 130:1-13.
Xiaoteng Ma, Yiqin Yang&, Hao Hu, Qihan Liu, Jun Yang*, Chongjie Zhang, Qianchuan Zhao, Bin Liang. Offline Reinforcement Learning with Value-based Episodic Memory, Tenth International Conference on Learning Representations (ICLR), 2022.
Jun Yang, Bin Chen, Yanan Wang, Chunzhu Wang. Crack detection in carbide anvil using acoustic signal and deep learning with particle swarm optimization, Measurement, 2021.
Duo Wang, Ming Zhang, Yuchun Xu, Weining Lu, Jun Yang*, Tao Zhang, Metric-based Meta-learning Model for Few-shot Fault Diagnosis under Multiple Limited Data Conditions, Mechanical Systems and Signal Processing, 2021.
Qiyuan Zhang, Xiaoteng Ma, Yiqin Yang, Chenghao Li, Jun Yang*, Yu Liu, Bin Liang, Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning, IEEE Robotics and Automation Letters, 2021.
Yiqin Yang , Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang*, Qianchuan Zhao, Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning, 35th Conference on Neural Information Processing Systems (NeurIPS), 2021.
Chenghao Li, Tonghan Wang, Chengjie Wu, Qianchuan Zhao, Jun Yang?, Chongjie Zhang. Celebrating Diversity in Shared Multi-Agent Reinforcement Learning, 35th Conference on Neural Information Processing Systems (NeurIPS), 2021.
Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang*, Qianchuan Zhao. Average-Reward Reinforcement Learning with Trust Region Methods, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI), 2021.
Xiaoteng Ma, Yiqin Yang, Chenghao Li, Qianchuan Zhao, Jun Yang, Yiwen Lu. Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning, 20th International Conference on Autonomous Agents and Multi-agent Systems (AAMAS), 2021.
Xiaoyan Hu; Li Xia, Jun, Yang, Qianchuan Zhao. A Fast-Convergence Method of Monte Carlo Counterfactual Regret Minimization for Imperfect Information Dynamic Games, IEEE 9th Data Driven Control and Learning Systems Conference, 2020.
Chenghao Li; Xiaoteng Ma; Li Xia; Qianchuan Zhao; Jun Yang. Fairness Control of Traffic Light via Deep Reinforcement Learning, 16th IEEE International Conference on Automation Science and Engineering (CASE), 2020.