SmallData Symposium 2024

C04

Learning fast and efficient hyperparameter control for deep reinforcement learning on small datasets

Project summary

Deep reinforcement learning (RL) algorithms are a powerful class of methods for optimizing sequential decision making and control problems, and are the driver behind many real-world applications. We hypothesize that deep RL applications can considerably benefit from a focus on hyperparameter tuning. Specifically, we aim to develop automated techniques for setting the hyperparameters in a sample-efficient manner for small datasets, thus reducing uncertainty. We will create appropriate benchmarks and then investigate the performance of currently existing hyperparameter optimization (HPO) methods, to then develop automated approaches. This will include a dynamic meta-level control with the use of a robust hyperparameter transfer for online HPO for offline RL and neural architecture search (NAS).