On Reinforcement Learning, Effect Handlers, and the State Monad (HOPE 2022)

Sun 11 - Fri 16 September 2022 Ljubljana, Slovenia

Who

Ugo Dal Lago, Alexis Ghyselen, Francesco Gavazzo

Track

HOPE 2022

Time Zone

The program is currently displayed in (GMT+02:00) Belgrade, Bratislava, Budapest, Ljubljana, Prague.

Use conference time zone: (GMT+02:00) Belgrade, Bratislava, Budapest, Ljubljana, PragueSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 11 Sep 2022 12:00 - 12:30 at M1 - HOPE Session 2

Abstract

We study algebraic effects and handlers as a way to support decision-making abstractions in functional programs, whereas a user can ask a learning algorithm to resolve choices without implementing the underlying selection mechanism, and give feedback by way of rewards. Differently from some recently proposed approaches to the problem based on the selection monad, we express the underlying intelligence as a reinforcement learning algorithm implemented as a set of handlers for some of these algebraic operations, including those for choices and rewards. We show how we can in practice use algebraic operations and handlers — as available in the programming language EFF — to clearly separate the learning algorithm from its environment, thus allowing for a good level of modularity. We then show how the host language can be taken as a 𝜆-calculus with handlers, this way highlighting the essential linguistic features. We conclude by hinting at how type and effect systems could ensure safety properties, at the same time pointing at some directions for further work.

Ugo Dal Lago

University of Bologna; Inria

Italy

Alexis Ghyselen

University of Bologna

Francesco Gavazzo

University of Bologna & INRIA Sophia Antipolis