把握方向
定位人生
Introduction
Hey, I'm Yi Ma, a 3rd year PhD candidate of College of Intelligence and Computing in Tianjin University. I'm a member of Professor Jianye Hao's research group.
I have an research interest in offline reinforcement learning and combinatorial optimization.
Besides, I'm a huge fan of basketball, snowboarding and orienteering.
Competitions
Honor
- Second Prize of Innovation Pioneer as an intern in Huawei Central Research Institute
Internships
Papers
Authors with equal contribution are marked by *, corresponding authors are marked by ^.
-
SplitNet: A Reinforcement Learning based Sequence Splitting Method for the MinMax Multiple Travelling Salesman Problem
Hebin Liang, Yi Ma^, Zilin Cao, Tianyang Liu, Fei Ni, Zhigang Li, Jianye Hao
AAAI 2023 | [paper]
-
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Tong Sang, Hongyao Tang, Yi Ma, Jianye Hao, Yan Zheng, Zhaopeng Meng, Boyan Li, Zhen Wang
IJCAI 2022 | [paper]
-
A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems
Yi Ma*, Xiaotian Hao*, Jianye HAO, Jiawen Lu, Xing Liu, Xialiang Tong, Mingxuan Yuan, Zhigang Li, Jie Tang, Zhaopeng Meng
NeurIPS 2021 | [paper]
-
A Multi-Graph Attributed Reinforcement Learning Based Optimization Algorithm for Large-scale Hybrid Flow Shop Scheduling Problem
Fei Ni, Jianye Hao, Jiawen Lu, Xialiang Tong, Mingxuan Yuan, Jiahui Duan, Yi Ma, Kun He
KDD 2021 | [paper]
-
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Xiaotian Hao*, Zhaoqing Peng*, Yi Ma*, Guan Wang, Junqi Jin, Jianye Hao, Shan Chen, Rongquan Bai, Mingzhou Xie, Miao Xu, Zhenzhe Zheng, Chuan Yu, Han Li, Jian Xu, Kun Gai
ICML 2020 | [paper]
-
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
Xiaotian Hao, Junqi Jin, Jianye Hao, Jin Li, Weixun Wang, Yi Ma,, Zhenzhe Zheng, Han Li, Jian Xu, Kun Gai
IJCAI 2020 | [paper]
-
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng
IJCAI 2020 | [paper]
-
Integrating Sequence and Network Information to Enhance Protein-Protein Interaction Prediction Using Graph Convolutional Networks
Leilei Liu*, Yi Ma*, Xianglei Zhu, Yaodong Yang, Xiaotian Hao, Li Wang, Jiajie Peng
BIBM 2019 | [paper]
Academic Service
- NeurIPS 2022 Top Reviewer
- Program Chair or Reviewer in NeurIPS, ICML, IJCAI, ICLR, AAAI, and CIKM
Patents
-
基于内在动机的多智能体稀疏奖励环境协作探索方法
谢京达;郝建业;郑岩;马亿;杨天培
-
基于环境动态分解模型的深度强化学习方法
王聪;杨天培;郝建业;郑岩;马亿
-
基于深度强化学习的自动协商智能体设计方法
林杰;陈锶奇;郝建业;郑岩;马亿
-
基于注意力机制与强化学习的多智能体游戏AI设计方法
张宁宁;王立;郝建业;郑岩;马亿;王维埙
-
基于对比学习和互信息的元强化学习方法
桑桐;郝建业;郑岩;马亿;汤宏垚
-
面向连续-离散混合决策的游戏AI智能体强化学习方法
李博研;汤宏垚;马亿;郝建业;郑岩;王立
-
基于层次深度强化学习的复杂游戏AI设计方法
赵煜东;汤宏垚;马亿;郝建业;郑岩;王立
-
基于图神经网络和强化学习的物流调度规划方法
马亿;李峙钢;郝建业
-
一种订单分配方法及装置
陆佳文;马亿;袁明轩
Contacts