Publications and Preprints

2026

  1. Provably Convergent Actor-Critic in Risk-averse MARL
    Yizhou Zhang and Eric Mazumdar
    In Proceedings of the 43rd International Conference on Machine Learning (ICML), Spotlight (2.2%) , 2026
  2. Training Generalizable Collaborative Agents via Strategic Risk Aversion
    Chengrui Qu, Yizhou Zhang, Nicola Lanzetti, and Eric Mazumdar
    In submission , 2026

2025

  1. KL-regularization Itself is Differentially Private in Bandits and RLHF
    Yizhou Zhang, Kishan Panaganti, Laixi Shi, Juba Ziani, and Adam Wierman
    In submission , 2025
  2. CDC
    Convergent Q-Learning for Infinite-Horizon General-Sum Markov Games through Behavioral Economics
    Yizhou Zhang and Eric Mazumdar
    In Proceedings of the 64th IEEE Conference on Decision and Control (CDC), 2025
  3. Learning to Steer Learners in Games
    Yizhou Zhang, Yian Ma, and Eric Mazumdar
    In Proceedings of the 42nd International Conference on Machine Learning (ICML), 2025

2023

  1. Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
    Yizhou Zhang*, Guannan Qu*, Pan Xu*, Yiheng Lin, Zaiwei Chen, and Adam Wierman
    Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2023