Deep Reinforcement Learning
Quite old at this point, but still a very interesting read: https://deepmind.com/research/publications/playing-atari-deep-reinforcement-learning/
I liked how they simplified state space, and a experience replay was a neat idea. Question is how it could be used to store Q-value function for economic agents.