SUPR
Best Policy Identification in Reinforcement Learning
Dnr:

NAISS 2023/22-487

Type:

NAISS Small Compute

Principal Investigator:

Alessio Russo

Affiliation:

Kungliga Tekniska högskolan

Start Date:

2023-05-05

End Date:

2024-06-01

Primary Classification:

10201: Computer Sciences

Webpage:

Allocation

Abstract

This project investigates how to drive exploration in reinforcement learning problems using the framework of best policy identification. The goal of the project is to demonstrate that the best policy identification methodology can be used to improve the exploration process in reinforcement learning problems. Results will be obtained by running experiments on several Gym and Mujoco environments.