Implementation of Hindsight Differentiable Policy Optimization, as described in the paper Deep Reinforcement Learning for Inventory Networks: Toward Reliable Policy Optimization. We argue that ...
Abstract: In the last few years, the deep learning paradigm has experienced huge success in various machine learning research areas like computer vision, drug discoveries, natural language processing, ...