Building an Efficient Poker Agent Using RL

In this paper, we apply variations of Deep Q-learning (DQN) and Proximal Policy Optimization (PPO) to learn the game of heads-up no-limit Texas Hold’em.

May 2022 · Alex Kashi, Vedang Lad, Hakon Grini