Building an Efficient Poker Agent Using RL

In this paper, we apply variations of Deep Q-learning (DQN) and Proximal Policy Optimization (PPO) to learn the game of heads-up no-limit Texas Hold’em.

<span title='2022-05-11 00:00:00 +0000 UTC'>May 2022</span>&nbsp;&middot;&nbsp;Alex Kashi, Vedang Lad, Hakon Grini