Best Policy Identification in Reinforcement Learning

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2023/22-487

Type:

NAISS Small Compute

Principal Investigator:

Alessio Russo

Affiliation:

Kungliga Tekniska högskolan

Start Date:

2023-05-05

End Date:

2024-06-01

Primary Classification:

10201: Computer Sciences

Webpage:

Allocation

Alvis at C3SE: 1000 GPU-h/month
Mimer at C3SE: 500 GiB

Abstract

This project investigates how to drive exploration in reinforcement learning problems using the framework of best policy identification. The goal of the project is to demonstrate that the best policy identification methodology can be used to improve the exploration process in reinforcement learning problems. Results will be obtained by running experiments on several Gym and Mujoco environments.