About

Hey, this is MY.

把握方向
定位人生

Introduction

Hey, I'm Yi Ma, a 3rd year PhD candidate of College of Intelligence and Computing in Tianjin University. I'm a member of Professor Jianye Hao's research group. I have an research interest in offline reinforcement learning and combinatorial optimization.

Besides, I'm a huge fan of basketball, snowboarding and orienteering.

Competitions
  • Yi Ma and Hebin Liang won the first place prize of NeurIPS 2022 Driving SMARTS Competition in both Track 1 & 2

Honor
  • Second Prize of Innovation Pioneer as an intern in Huawei Central Research Institute
Internships
Papers
Authors with equal contribution are marked by *.
  • SplitNet: A Reinforcement Learning based Sequence Splitting Method for the MinMax Multiple Travelling Salesman Problem
    Hebin Liang*, Yi Ma*, Zilin Cao, Tianyang Liu, Fei Ni, Zhigang Li, Jianye Hao
    AAAI 2023 | [paper]
  • PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
    Tong Sang, Hongyao Tang, Yi Ma, Jianye Hao, Yan Zheng, Zhaopeng Meng, Boyan Li, Zhen Wang
    IJCAI 2022 | [paper]
  • A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems
    Yi Ma*, Xiaotian Hao*, Jianye HAO, Jiawen Lu, Xing Liu, Xialiang Tong, Mingxuan Yuan, Zhigang Li, Jie Tang, Zhaopeng Meng
    NeurIPS 2021 | [paper]
  • A Multi-Graph Attributed Reinforcement Learning Based Optimization Algorithm for Large-scale Hybrid Flow Shop Scheduling Problem
    Fei Ni, Jianye Hao, Jiawen Lu, Xialiang Tong, Mingxuan Yuan, Jiahui Duan, Yi Ma, Kun He
    KDD 2021 | [paper]
  • Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
    Xiaotian Hao*, Zhaoqing Peng*, Yi Ma*, Guan Wang, Junqi Jin, Jianye Hao, Shan Chen, Rongquan Bai, Mingzhou Xie, Miao Xu, Zhenzhe Zheng, Chuan Yu, Han Li, Jian Xu, Kun Gai
    ICML 2020 | [paper]
  • Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
    Xiaotian Hao, Junqi Jin, Jianye Hao, Jin Li, Weixun Wang, Yi Ma,, Zhenzhe Zheng, Han Li, Jian Xu, Kun Gai
    IJCAI 2020 | [paper]
  • KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
    Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng
    IJCAI 2020 | [paper]
  • Integrating Sequence and Network Information to Enhance Protein-Protein Interaction Prediction Using Graph Convolutional Networks
    Leilei Liu*, Yi Ma*, Xianglei Zhu, Yaodong Yang, Xiaotian Hao, Li Wang, Jiajie Peng
    BIBM 2019 | [paper]
Academic Service
  • NeurIPS 2022 Top Reviewer
  • Program Chair or Reviewer in NeurIPS, ICML, IJCAI, ICLR, AAAI, and CIKM
Patents
  • 基于内在动机的多智能体稀疏奖励环境协作探索方法
    谢京达;郝建业;郑岩;马亿;杨天培
  • 基于环境动态分解模型的深度强化学习方法
    王聪;杨天培;郝建业;郑岩;马亿
  • 基于深度强化学习的自动协商智能体设计方法
    林杰;陈锶奇;郝建业;郑岩;马亿
  • 基于注意力机制与强化学习的多智能体游戏AI设计方法
    张宁宁;王立;郝建业;郑岩;马亿;王维埙
  • 基于对比学习和互信息的元强化学习方法
    桑桐;郝建业;郑岩;马亿;汤宏垚
  • 面向连续-离散混合决策的游戏AI智能体强化学习方法
    李博研;汤宏垚;马亿;郝建业;郑岩;王立
  • 基于层次深度强化学习的复杂游戏AI设计方法
    赵煜东;汤宏垚;马亿;郝建业;郑岩;王立
  • 基于图神经网络和强化学习的物流调度规划方法
    马亿;李峙钢;郝建业
  • 一种订单分配方法及装置
    陆佳文;马亿;袁明轩
Contacts