Smart - Multi-purpose Landing Page Template

Project Objectives

A study on rollout mechanisms in model-based reinforcement learning

1. Extent of Rollouts

How much do we want to use the model to roll out in every step?

2. Directionality of Rollouts

In which direction, forward or backward, would be more beneficial to roll out in every step?

3. Design an automated rollout mechanisms

With insights identified in the first two objectives, can we design an automated rollout mechanism that takes full advantage of the two properties of rollout mechanisms?

Experimental Environments

The set of benchmarks planned to be used to conduct experiments

Grid World

Environments with discrete state space and discrete action space. The goal of an agent is to reach the goal state. This set of environments is ideal for large-scale experimentation

MuJoCo

A more complex set of locomotion control problems with continuous state space and continuous action space. Problems to be experimented on to verify findings in grid world environments

Team

Dr. J. Pan

Supervisor

Yat Long (Richie), Lo

BEng(CS) final year student

Bidirectional Rollouts in Model-Based Reinforcement Learning