Adaptive Sampling using Stein's Discrepancy

Project Description

Investigation into the usage of different sampling methods based on Stein’s discrepancy in the context of Curriculum Learning.

Implemented both a gradient-based approach (Stein’s Variational Gradient Descent1) and a direct minimization method (Stein Points2) in jax and evaluated them on different RL baselines.

Heiko Carrasco
Heiko Carrasco
Software Engineer