A simple primer for generative models with toy examples
A cheat sheet for implementing transformers.
A trivial attempt to unify the fundamental RL concepts in one place for building intuitions.
Here we take a detailed view of policy gradient methods and their intuitions. This blog discuess how REINFORCE, baseline and actor-critic algorithms came into existence.
This post is part of a series of posts on optimal control theory. We take a detalied look on how classical LQR control is derived. A simple implementation is provided for clarity.